Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Genome Annotation and Assembly03:36

Genome Annotation and Assembly

The genome refers to all of the genetic material in an organism. It can range from a few million base pairs in microbial cells to several billion base pairs in many eukaryotic organisms. Genome assembly refers to the process of taking the DNA sequencing data and putting it all back together in a correct order to create a close representation of the original genome. This is followed by the identification of functional elements on the newly assembled genome, a process called genome annotation.
Genomics02:02

Genomics

Genomics is the science of genomes: it is the study of all the genetic material of an organism. In humans, the genome consists of information carried in 23 pairs of chromosomes in the nucleus, as well as mitochondrial DNA. In genomics, both coding and non-coding DNA is sequenced and analyzed. Genomics allows a better understanding of all living things, their evolution, and their diversity. It has a myriad of uses: for example, to build phylogenetic trees, to improve productivity and...
Organization of Genes02:07

Organization of Genes

Overview
Organization of Genes02:07

Organization of Genes

Overview
Gene Duplication and Divergence02:37

Gene Duplication and Divergence

The seminal work of Ohno in 1970 popularized the idea of gene duplication and divergence. DNA sequence comparison studies reveal that a large portion of the genes in bacteria, archaebacteria, and eukaryotes was  generated by gene duplication and divergence, indicating its critical role in evolution.
The duplicated copies of the gene are called Paralogs. Paralogs with similar sequences and functions form a gene family. Across several species, a large number of gene families are characterized.
Complementary DNA01:44

Complementary DNA

Overview

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Elevated platelet-to-lymphocyte ratio is associated with increased breast cancer risk: A cross-sectional analysis of the NHANES 2009 to 2020.

Medicine·2026
Same author

Investigating the human anellome across the lifespan reveals sex-specific biphasic trajectories.

npj aging·2026
Same author

Plasma proteomic profiling uncovers the age-related molecular changes in chronic obstructive pulmonary disease.

BMC pulmonary medicine·2026
Same author

Integrated Bulk and Single-Cell Transcriptomic Analysis Reveals Xenobiotic Metabolism Genes Drive Progression From Liver Cirrhosis to Hepatocellular Carcinoma.

Human mutation·2026
Same author

Molecular Transformation and Climate Effects of Water-Soluble Brown Carbon from Urban on-Road Vehicular Emissions.

Environmental science & technology·2026
Same author

Proteomics-decoded capture kill mechanism of CuAgOx/UiO-66-NH<sub>2</sub> for dark inactivation of pathogenic bacteria.

Journal of hazardous materials·2026
Same journal

Invaders taking over-Mollusc faunal change in volcanic barrier lakes of the Albertine Rift biodiversity hotspot.

PloS one·2026
Same journal

AI-driven molecular diversification and ligand-based optimization of macitentan derivatives targeting VEGFR1 and endothelin signaling pathways.

PloS one·2026
Same journal

Performance patterns and records in the world aquatics masters championships: Where do the most frequently represented nations among the top-ten masters swimmers come from?

PloS one·2026
Same journal

Modeling diurnal Temperature-Rainfall relationships under multicollinearity using PLS-SEM: A case study of Ghana.

PloS one·2026
Same journal

Organizational culture, social capital, and emergency capacity in primary healthcare institutions: A cross-sectional structural equation modeling study comparing ordinary and older communities.

PloS one·2026
Same journal

Impact of kidney function on the metabolome in the general population.

PloS one·2026
See all related articles

Related Experiment Video

Updated: May 18, 2026

Leveraging CyVerse Resources for De Novo Comparative Transcriptomics of Underserved (Non-model) Organisms
10:41

Leveraging CyVerse Resources for De Novo Comparative Transcriptomics of Underserved (Non-model) Organisms

Published on: May 9, 2017

Integrating various resources for gene name normalization.

Yuncui Hu1, Yanpeng Li, Hongfei Lin

  • 1School of Computer Science and Technology, Dalian University of Technology, Dalian, Liaoning, China. huyuncui@mail.dlut.edu.cn

Plos One
|September 18, 2012
PubMed
Summary
This summary is machine-generated.

This study introduces a novel system for accurately identifying and normalizing gene names in biomedical texts. The system enhances gene name recognition and mapping, achieving high performance in gene normalization tasks.

More Related Videos

Comprehensive Workflow for the Genome-wide Identification and Expression Meta-analysis of the ATL E3 Ubiquitin Ligase Gene Family in Grapevine
10:40

Comprehensive Workflow for the Genome-wide Identification and Expression Meta-analysis of the ATL E3 Ubiquitin Ligase Gene Family in Grapevine

Published on: December 22, 2017

A Bioinformatics Pipeline for Investigating Molecular Evolution and Gene Expression using RNA-seq
07:09

A Bioinformatics Pipeline for Investigating Molecular Evolution and Gene Expression using RNA-seq

Published on: May 28, 2021

Related Experiment Videos

Last Updated: May 18, 2026

Leveraging CyVerse Resources for De Novo Comparative Transcriptomics of Underserved (Non-model) Organisms
10:41

Leveraging CyVerse Resources for De Novo Comparative Transcriptomics of Underserved (Non-model) Organisms

Published on: May 9, 2017

Comprehensive Workflow for the Genome-wide Identification and Expression Meta-analysis of the ATL E3 Ubiquitin Ligase Gene Family in Grapevine
10:40

Comprehensive Workflow for the Genome-wide Identification and Expression Meta-analysis of the ATL E3 Ubiquitin Ligase Gene Family in Grapevine

Published on: December 22, 2017

A Bioinformatics Pipeline for Investigating Molecular Evolution and Gene Expression using RNA-seq
07:09

A Bioinformatics Pipeline for Investigating Molecular Evolution and Gene Expression using RNA-seq

Published on: May 28, 2021

Area of Science:

  • Biomedical text mining
  • Bioinformatics
  • Natural Language Processing

Background:

  • Accurate recognition and normalization of gene mentions are essential for biomedical text mining.
  • Existing methods face challenges in handling the complexity and variability of gene nomenclature.

Purpose of the Study:

  • To develop and evaluate a comprehensive system for extracting gene names from biomedical literature.
  • To normalize recognized gene names to standardized database identifiers.
  • To improve the precision and recall of gene name recognition and normalization.

Main Methods:

  • A four-component system: gene name recognition, entity mapping, disambiguation, and filtering.
  • Gene name recognition utilizes dictionary matching and semi-supervised learning with MEDLINE co-occurrence data.
  • Entity mapping combines exact and approximate matching to link gene names to the EntrezGene database.
  • Disambiguation employs semantic similarity from Gene Ontology and MEDLINE abstracts.
  • Filtering is based on confidence scores to manage precision-recall trade-offs.

Main Results:

  • The system achieved an F-measure of 83% on the BioCreative II Gene Normalization dataset.
  • Precision was 82.5% and recall was 83.5%.
  • Performance is comparable to current state-of-the-art systems.

Conclusions:

  • The developed system effectively extracts and normalizes gene names from biomedical literature.
  • The integrated approach of recognition, mapping, disambiguation, and filtering enhances accuracy.
  • The system offers a robust solution for gene normalization in bioinformatics and text mining applications.