Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Gene Duplication and Divergence02:37

Gene Duplication and Divergence

The seminal work of Ohno in 1970 popularized the idea of gene duplication and divergence. DNA sequence comparison studies reveal that a large portion of the genes in bacteria, archaebacteria, and eukaryotes was  generated by gene duplication and divergence, indicating its critical role in evolution.
The duplicated copies of the gene are called Paralogs. Paralogs with similar sequences and functions form a gene family. Across several species, a large number of gene families are characterized.
Gene Conversion02:08

Gene Conversion

Other than maintaining genome stability via DNA repair, homologous recombination plays an important role in diversifying the genome. In fact, the recombination of sequences forms the molecular basis of genomic evolution. Random and non-random permutations of genomic sequences create a library of new amalgamated sequences. These newly formed genomes can determine the fitness and survival of cells. In bacteria, homologous and non-homologous types of recombination lead to the evolution of new...
Gene Conversion02:08

Gene Conversion

Other than maintaining genome stability via DNA repair, homologous recombination plays an important role in diversifying the genome. In fact, the recombination of sequences forms the molecular basis of genomic evolution. Random and non-random permutations of genomic sequences create a library of new amalgamated sequences. These newly formed genomes can determine the fitness and survival of cells. In bacteria, homologous and non-homologous types of recombination lead to the evolution of new...
Gene Evolution - Fast or Slow?02:05

Gene Evolution - Fast or Slow?

The genomes of eukaryotes are punctuated by long stretches of sequence which do not code for proteins or RNAs. Although some of these regions do contain crucial regulatory sequences, the vast majority of this DNA serves no known function. Typically, these regions of the genome are the ones in which the fastest change, in evolutionary terms, is observed, because there is typically little to no selection pressure acting on these regions to preserve their sequences.
In contrast, regions which code...
Gene Evolution - Fast or Slow?02:05

Gene Evolution - Fast or Slow?

The genomes of eukaryotes are punctuated by long stretches of sequence which do not code for proteins or RNAs. Although some of these regions do contain crucial regulatory sequences, the vast majority of this DNA serves no known function. Typically, these regions of the genome are the ones in which the fastest change, in evolutionary terms, is observed, because there is typically little to no selection pressure acting on these regions to preserve their sequences.
In contrast, regions which code...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Adaptmol: domain adaptation for molecular image recognition with limited supervision.

Journal of cheminformatics·2026
Same author

Dynamic thermodynamic-informational entropic relationship (TIER) models of selective vulnerability to neurodegeneration.

bioRxiv : the preprint server for biology·2026
Same author

Desiderata for a biomedical knowledge network: opportunities, challenges and future directions.

Bioinformatics advances·2026
Same author

Governing real-world health data as a public utility.

Science (New York, N.Y.)·2026
Same author

In Memoriam: Celebrating the Life of Dr. John R. Benfield (1931-2025).

The Annals of thoracic surgery·2026
Same author

The state of standardized musculoskeletal terminology for healthcare reuse:A scoping review.

International journal of medical informatics·2026
Same journal

circ2DGNN: circRNA-Disease Association Prediction via Transformer-Based Graph Neural Network.

IEEE/ACM transactions on computational biology and bioinformatics·2024
Same journal

Hierarchical Hypergraph Learning in Association- Weighted Heterogeneous Network for miRNA- Disease Association Identification.

IEEE/ACM transactions on computational biology and bioinformatics·2024
Same journal

Discriminative Domain Adaption Network for Simultaneously Removing Batch Effects and Annotating Cell Types in Single-Cell RNA-Seq.

IEEE/ACM transactions on computational biology and bioinformatics·2024
Same journal

MLW-BFECF: A Multi-Weighted Dynamic Cascade Forest Based on Bilinear Feature Extraction for Predicting the Stage of Kidney Renal Clear Cell Carcinoma on Multi-Modal Gene Data.

IEEE/ACM transactions on computational biology and bioinformatics·2024
Same journal

An End-to-End Knowledge Graph Fused Graph Neural Network for Accurate Protein-Protein Interactions Prediction.

IEEE/ACM transactions on computational biology and bioinformatics·2024
Same journal

Generative Biomedical Event Extraction With Constrained Decoding Strategy.

IEEE/ACM transactions on computational biology and bioinformatics·2024
See all related articles

Related Experiment Video

Updated: Jun 10, 2026

Gene Expression Profiling of Infecting Microbes Using a Digital Bar-coding Platform
09:13

Gene Expression Profiling of Infecting Microbes Using a Digital Bar-coding Platform

Published on: January 13, 2016

Exploring species-based strategies for gene normalization.

Karin Verspoor1, Christophe Roeder, Helen L Johnson

  • 1Center for Computational Pharmacology, University of Colorado Denver, Aurora, CO 80045, USA. karin.verspoor@ucdenver.edu

IEEE/ACM Transactions on Computational Biology and Bioinformatics
|July 31, 2010
PubMed
Summary
This summary is machine-generated.

This study presents a novel system for gene normalization in protein information extraction. The fuzzy dictionary lookup and species-aware strategies show promise for accurately identifying and mapping protein mentions.

More Related Videos

Competitive Genomic Screens of Barcoded Yeast Libraries
11:59

Competitive Genomic Screens of Barcoded Yeast Libraries

Published on: August 11, 2011

G2-seq: A High Throughput Sequencing-based Technique for Identifying Late Replicating Regions of the Genome
06:40

G2-seq: A High Throughput Sequencing-based Technique for Identifying Late Replicating Regions of the Genome

Published on: March 22, 2018

Related Experiment Videos

Last Updated: Jun 10, 2026

Gene Expression Profiling of Infecting Microbes Using a Digital Bar-coding Platform
09:13

Gene Expression Profiling of Infecting Microbes Using a Digital Bar-coding Platform

Published on: January 13, 2016

Competitive Genomic Screens of Barcoded Yeast Libraries
11:59

Competitive Genomic Screens of Barcoded Yeast Libraries

Published on: August 11, 2011

G2-seq: A High Throughput Sequencing-based Technique for Identifying Late Replicating Regions of the Genome
06:40

G2-seq: A High Throughput Sequencing-based Technique for Identifying Late Replicating Regions of the Genome

Published on: March 22, 2018

Area of Science:

  • Biomedical Informatics
  • Computational Biology
  • Natural Language Processing

Background:

  • Accurate identification and normalization of protein mentions in scientific literature are crucial for building comprehensive knowledge bases.
  • Existing methods for protein information extraction face challenges in handling variations in gene/protein nomenclature and context.

Purpose of the Study:

  • To develop and evaluate a system for the BioCreative II.5 challenge focused on protein and protein interaction information extraction.
  • To specifically address the gene normalization task, which involves recognizing protein mentions and mapping them to database identifiers.

Main Methods:

  • A "fuzzy" dictionary lookup approach for detecting protein mentions by matching regularized text to dictionary entries.
  • Multiple gene normalization strategies were developed, incorporating global and local species/organism context.
  • External knowledge sources were utilized to enhance normalization accuracy.

Main Results:

  • The developed system demonstrated competitive performance in the BioCreative II.5 evaluation.
  • Analysis of system variations highlighted the effectiveness of different normalization strategies and the impact of external knowledge.
  • The gene normalization strategies showed promising results, indicating potential for improvement.

Conclusions:

  • The proposed fuzzy dictionary lookup and context-aware normalization strategies are effective for gene normalization in protein information extraction.
  • The system provides a valuable platform for further research into optimizing information extraction performance.
  • Further exploration of external knowledge integration and normalization techniques can enhance system accuracy.