Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Identifying gene-specific variations in biomedical text.

Roman Klinger1, Christoph M Friedrich, Heinz Theodor Mevissen

  • 1Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing, Schloss Birlinghoven, 53754 Sankt Augustin, Germany. roman.klinger@scai.fhg.de

Journal of Bioinformatics and Computational Biology
|January 4, 2008
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Selecting medical research data platforms for translational biomedical research: a five-tier overview and requirement-weighted assessment framework.

Frontiers in digital health·2026
Same author

Extracting Medical Information From Unstructured Clinical Text Using Large Language Models to Enhance Health Care Interoperability: Proof-of-Concept Study.

Journal of medical Internet research·2026
Same author

Transitional initiatives for advancing the phasing out of the use of animals for drug and chemical safety testing: The IHI VICT3R project for reducing the use of animals by implementing virtual control groups.

NAM journal·2026
Same author

Challenges in AI Based Tumor Board Case Summarization and Recommendations.

Research square·2026
Same author

Multimodal phenotypic classification of generalized anxiety and panic using structural MRI data and psychosocial factors: machine learning results from the German National Cohort (NAKO) study.

Translational psychiatry·2026
Same author

Integrating Genetic Variants and Expression Profiles of Pharmacogenes to Investigate Resistance to Antidepressant Treatment.

Medicina (Kaunas, Lithuania)·2026
Same journal

CNV-ECOD: A copy number variation detection method based on ECOD algorithm using next-generation sequencing data.

Journal of bioinformatics and computational biology·2026
Same journal

ReinVar: A model-free paradigm-based reinforcement learning approach to detect copy number variation.

Journal of bioinformatics and computational biology·2026
Same journal

When pipelines run but coordinates fail: A simple spatial specificity check for false locality in post-GWAS analysis.

Journal of bioinformatics and computational biology·2026
Same journal

Comparative benchmarking of template-based, evolutionary-diffusion, and generative language models for IsPETase structure prediction.

Journal of bioinformatics and computational biology·2026
Same journal

Trap spaces as labelled ideals of SCC posets: A structural-functional theory of reachability in asynchronous boolean networks.

Journal of bioinformatics and computational biology·2026
Same journal

Erratum - DDINet: Drug-drug interaction prediction network based on multi-molecular fingerprint features and multi-head attention centered weighted autoencoder.

Journal of bioinformatics and computational biology·2026
See all related articles

This study improves the automatic extraction of genetic variation information from scientific literature. By integrating gene name recognition with variation entity normalization, it enhances the accuracy of linking textual mentions to databases like dbSNP.

Area of Science:

  • Genomics
  • Bioinformatics
  • Computational Biology

Background:

  • Biomedical research heavily relies on scientific publications to disseminate findings on genetic variations influencing diseases.
  • Accurate extraction of gene names and allelic variants from text is crucial for automated analysis.
  • Previous systems like OSIRIS faced challenges with recall for variation mentions and gene recognition.

Purpose of the Study:

  • To enhance the automatic recognition and normalization of gene and protein names, as well as variation terms in biomedical text.
  • To improve the linkage of identified textual entities to curated databases such as the Single Nucleotide Polymorphism database (dbSNP).
  • To evaluate the performance of an integrated system combining ProMiner and a conditional random field (CRF) model for variation entity recognition.

Related Experiment Videos

Main Methods:

  • Integration of the ProMiner system for gene and protein name recognition and normalization.
  • Application of a conditional random field (CRF) model for recognizing variation terms in biomedical literature.
  • Development of a novel normalization process for variation entities.
  • Linking normalized variation entities to dbSNP entries.

Main Results:

  • The novel approach demonstrates improved performance in recognizing and normalizing gene names and allelic variations.
  • Enhanced recall for variation mentions and gene name recognition compared to previous methods.
  • Successful linking of textual entities to specific dbSNP entries, facilitating data integration.

Conclusions:

  • The integrated system significantly improves the accuracy and recall of extracting genetic variation information from biomedical texts.
  • This approach facilitates a more robust connection between scientific literature and curated genetic variation databases.
  • The developed method represents a state-of-the-art advancement in automated biomedical text mining for genetic variation studies.