Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Conservation of Protein Domains Over Different Proteins02:26

Conservation of Protein Domains Over Different Proteins

Protein domains are small structurally independent units that are part of a single amino acid chain.  Although these domains are often structurally independent, they may rely on synergistic effects to perform their functions as part of a larger protein. Protein domains may be conserved within the same organism, as well as across different organisms.
A limited set of protein domains often duplicate and recombine during evolution. These domains can be organized in different combinations to form...
Protein Complexes with Interchangeable Parts01:57

Protein Complexes with Interchangeable Parts

Groups of proteins may form a complex where each protein in this complex has a different role in the overall execution of the complex’s function. Often some of the proteins in the complex can be replaced by a closely related variant to give a complex that contains many of the same components yet is functionally distinct.
The SCF ubiquitin ligase is a protein complex of five individual proteins. This complex attaches ubiquitin to other target proteins to mark them for degradation. In order to...
Protein Complexes with Interchangeable Parts01:57

Protein Complexes with Interchangeable Parts

Groups of proteins may form a complex where each protein in this complex has a different role in the overall execution of the complex’s function. Often some of the proteins in the complex can be replaced by a closely related variant to give a complex that contains many of the same components yet is functionally distinct.
The SCF ubiquitin ligase is a protein complex of five individual proteins. This complex attaches ubiquitin to other target proteins to mark them for degradation. In order to...
Conservation of Protein Domains02:26

Conservation of Protein Domains

Protein domains are small structurally independent units that are part of a single amino acid chain.  Although these domains are often structurally independent, they may rely on synergistic effects to perform their functions as part of a larger protein. Protein domains may be conserved within the same organism, as well as across different organisms.
A limited set of protein domains often duplicate and recombine during evolution. These domains can be organized in different combinations to form...
Improving Translational Accuracy02:07

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
Improving Translational Accuracy02:07

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Assessment of clinical and logistical contribution in a Norwegian helicopter emergency medical service using integrated data: a retrospective observational study.

Scandinavian journal of trauma, resuscitation and emergency medicine·2026
Same author

Resources for screening the literature for glycan-related terms using PubAnnotation in GlyCosmos.

Glycobiology·2026
Same author

A curation system of rice trait ontology with reliable interoperation by LLM and PubAnnotation.

Genomics & informatics·2025
Same author

Variations in the <sup>129</sup>I/<sup>127</sup>I ratios of pre-world war II algae under long-term exposure to atmospheric iodine isotopes.

Marine pollution bulletin·2025
Same author

Annotated textual dataset PV600 of perovskite bandgaps for information extraction from literature.

Scientific data·2025
Same author

LSD600: the first corpus of biomedical abstracts annotated with lifestyle-disease relations.

Database : the journal of biological databases and curation·2025
Same journal

CNV-ECOD: A copy number variation detection method based on ECOD algorithm using next-generation sequencing data.

Journal of bioinformatics and computational biology·2026
Same journal

ReinVar: A model-free paradigm-based reinforcement learning approach to detect copy number variation.

Journal of bioinformatics and computational biology·2026
Same journal

When pipelines run but coordinates fail: A simple spatial specificity check for false locality in post-GWAS analysis.

Journal of bioinformatics and computational biology·2026
Same journal

Comparative benchmarking of template-based, evolutionary-diffusion, and generative language models for IsPETase structure prediction.

Journal of bioinformatics and computational biology·2026
Same journal

Trap spaces as labelled ideals of SCC posets: A structural-functional theory of reachability in asynchronous boolean networks.

Journal of bioinformatics and computational biology·2026
Same journal

Erratum - DDINet: Drug-drug interaction prediction network based on multi-molecular fingerprint features and multi-head attention centered weighted autoencoder.

Journal of bioinformatics and computational biology·2026
See all related articles

Related Experiment Video

Updated: Jun 7, 2026

An Integrated Approach for Microprotein Identification and Sequence Analysis
09:37

An Integrated Approach for Microprotein Identification and Sequence Analysis

Published on: July 12, 2022

Improving the inter-corpora compatibility for protein annotations.

Yue Wang1, Jin-Dong Kim, Rune Saetre

  • 1Department of Computer Science, University of Tokyo, 7-3-1 Hongo, Bunkyo-Ku, Tokyo, 113-0033, Japan. wangyue@is.s.u-tokyo.ac.jp

Journal of Bioinformatics and Computational Biology
|October 29, 2010
PubMed
Summary
This summary is machine-generated.

This study addresses protein name recognition challenges by improving compatibility between the GENIA and GENETAG corpora. Experimental results demonstrate that most annotation incompatibilities can be resolved, advancing biomedical literature analysis.

More Related Videos

A Protocol for Computer-Based Protein Structure and Function Prediction
16:41

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

Mapping Dysfunctional Protein-Protein Interactions in Disease
09:39

Mapping Dysfunctional Protein-Protein Interactions in Disease

Published on: October 24, 2025

Related Experiment Videos

Last Updated: Jun 7, 2026

An Integrated Approach for Microprotein Identification and Sequence Analysis
09:37

An Integrated Approach for Microprotein Identification and Sequence Analysis

Published on: July 12, 2022

A Protocol for Computer-Based Protein Structure and Function Prediction
16:41

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

Mapping Dysfunctional Protein-Protein Interactions in Disease
09:39

Mapping Dysfunctional Protein-Protein Interactions in Disease

Published on: October 24, 2025

Area of Science:

  • Bioinformatics
  • Natural Language Processing
  • Computational Biology

Background:

  • Existing protein-annotated corpora exhibit annotation incompatibility, hindering automatic protein name recognition in biomedical literature.
  • This incompatibility limits the effective use and integration of valuable annotated datasets.

Purpose of the Study:

  • To resolve the annotation incompatibility issue between two major protein-annotated corpora: the GENIA corpus and the GENETAG corpus.
  • To enhance the compatibility of these corpora for improved downstream applications in biomedical text mining.

Main Methods:

  • Conducted a comparative study to gain deeper insights into the structures and annotations of the GENIA and GENETAG corpora.
  • Developed and applied methods to identify and reconcile differences in protein entity annotations between the two datasets.

Main Results:

  • Identified key sources of incompatibility between the GENIA and GENETAG protein annotations.
  • Experimental results indicate that the majority of the incompatibility issues between the two corpora can be effectively removed.
  • Demonstrated improved compatibility, facilitating more robust protein name recognition.

Conclusions:

  • Resolving annotation incompatibility is crucial for advancing automatic protein name recognition in the biomedical domain.
  • The developed methods successfully improved the compatibility between the GENIA and GENETAG corpora.
  • This work provides a foundation for creating more unified and usable annotated resources for biomedical research.