Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Search research articles

Related Experiment Videos

Identifying gene-specific variations in biomedical text.

Roman Klinger¹, Christoph M Friedrich, Heinz Theodor Mevissen

¹Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing, Schloss Birlinghoven, 53754 Sankt Augustin, Germany. roman.klinger@scai.fhg.de

Journal of Bioinformatics and Computational Biology

|January 4, 2008

Summary

This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Selecting medical research data platforms for translational biomedical research: a five-tier overview and requirement-weighted assessment framework.

Frontiers in digital health·2026

Same author

Extracting Medical Information From Unstructured Clinical Text Using Large Language Models to Enhance Health Care Interoperability: Proof-of-Concept Study.

Journal of medical Internet research·2026

Same author

Transitional initiatives for advancing the phasing out of the use of animals for drug and chemical safety testing: The IHI VICT3R project for reducing the use of animals by implementing virtual control groups.

NAM journal·2026

Same author

Challenges in AI Based Tumor Board Case Summarization and Recommendations.

Research square·2026

Same author

Multimodal phenotypic classification of generalized anxiety and panic using structural MRI data and psychosocial factors: machine learning results from the German National Cohort (NAKO) study.

Translational psychiatry·2026

Same author

Integrating Genetic Variants and Expression Profiles of Pharmacogenes to Investigate Resistance to Antidepressant Treatment.

Medicina (Kaunas, Lithuania)·2026

Same journal

CNV-ECOD: A copy number variation detection method based on ECOD algorithm using next-generation sequencing data.

Journal of bioinformatics and computational biology·2026

Same journal

ReinVar: A model-free paradigm-based reinforcement learning approach to detect copy number variation.

Journal of bioinformatics and computational biology·2026

Same journal

When pipelines run but coordinates fail: A simple spatial specificity check for false locality in post-GWAS analysis.

Journal of bioinformatics and computational biology·2026

Same journal

Comparative benchmarking of template-based, evolutionary-diffusion, and generative language models for IsPETase structure prediction.

Journal of bioinformatics and computational biology·2026

Same journal

Trap spaces as labelled ideals of SCC posets: A structural-functional theory of reachability in asynchronous boolean networks.

Journal of bioinformatics and computational biology·2026

Same journal

Erratum - DDINet: Drug-drug interaction prediction network based on multi-molecular fingerprint features and multi-head attention centered weighted autoencoder.

Journal of bioinformatics and computational biology·2026

See all related articles

This study improves the automatic extraction of genetic variation information from scientific literature. By integrating gene name recognition with variation entity normalization, it enhances the accuracy of linking textual mentions to databases like dbSNP.

Area of Science:

Genomics
Bioinformatics
Computational Biology

Background:

Biomedical research heavily relies on scientific publications to disseminate findings on genetic variations influencing diseases.
Accurate extraction of gene names and allelic variants from text is crucial for automated analysis.
Previous systems like OSIRIS faced challenges with recall for variation mentions and gene recognition.

Purpose of the Study:

To enhance the automatic recognition and normalization of gene and protein names, as well as variation terms in biomedical text.
To improve the linkage of identified textual entities to curated databases such as the Single Nucleotide Polymorphism database (dbSNP).
To evaluate the performance of an integrated system combining ProMiner and a conditional random field (CRF) model for variation entity recognition.

Related Experiment Videos

Main Methods:

Integration of the ProMiner system for gene and protein name recognition and normalization.
Application of a conditional random field (CRF) model for recognizing variation terms in biomedical literature.
Development of a novel normalization process for variation entities.
Linking normalized variation entities to dbSNP entries.

Main Results:

The novel approach demonstrates improved performance in recognizing and normalizing gene names and allelic variations.
Enhanced recall for variation mentions and gene name recognition compared to previous methods.
Successful linking of textual entities to specific dbSNP entries, facilitating data integration.

Conclusions:

The integrated system significantly improves the accuracy and recall of extracting genetic variation information from biomedical texts.
This approach facilitates a more robust connection between scientific literature and curated genetic variation databases.
The developed method represents a state-of-the-art advancement in automated biomedical text mining for genetic variation studies.