Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

LSAT: learning about alternative transcripts in MEDLINE.

Parantu K Shah1, Peer Bork

  • 1European Molecular Biology Laboratory, Heidelberg, Germany. shah@embl.de

Bioinformatics (Oxford, England)
|January 18, 2006
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

eggNOG v7: phylogeny-based orthology predictions and functional annotations.

Nucleic acids research·2025
Same author

<i>Staphylococcus aureus</i> colonization and bloodstream infection in very preterm infants.

Gut microbes·2025
Same author

VIRE: a metagenome-derived, planetary-scale virome resource with environmental context.

Nucleic acids research·2025
Same author

metaTraits: a large-scale integration of microbial phenotypic trait information.

Nucleic acids research·2025
Same author

proGenomes4: providing 2 million accurately and consistently annotated high-quality prokaryotic genomes.

Nucleic acids research·2025
Same author

MLH1 Mismatch Repair Deficiency Predicts Exceptional Response to Immune Checkpoint Inhibition in Clear-cell Renal Cell Carcinoma.

European urology·2025
Same journal

conMItion: an R package adjusting confounding factors for associations in multi-omics.

Bioinformatics (Oxford, England)·2026
Same journal

SpaMFG: a Spatial Multi-omics Integration Method based on Feature Grouping.

Bioinformatics (Oxford, England)·2026
Same journal

CSCN: Inference of Cell-Specific Causal Networks Using Single-Cell RNA-Seq Data.

Bioinformatics (Oxford, England)·2026
Same journal

Sparse CCA-Based Mediation Analysis with High-Dimensional Exposures and Mediators.

Bioinformatics (Oxford, England)·2026
Same journal

Enhancing Cross-Context Generalization in Drug Perturbation Prediction with a Multimodal Conditional Diffusion Framework.

Bioinformatics (Oxford, England)·2026
Same journal

Primer Design through Submodular Function Estimation.

Bioinformatics (Oxford, England)·2026
See all related articles

This study developed a two-step machine learning approach to identify and extract information on alternative transcripts, enhancing our understanding of functional diversity in eukaryotes.

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • Alternative transcripts contribute to functional diversity in eukaryotes.
  • Extracting information on alternative transcripts is crucial for biological research.
  • Current methods for transcript information extraction require improvement.

Purpose of the Study:

  • To develop and evaluate a machine learning-based system for extracting information on alternative transcripts.
  • To identify and categorize semantic information related to transcript diversity in scientific literature.
  • To create a database summarizing findings on alternative transcripts.

Main Methods:

  • A two-step procedure involving machine learning and information extraction was employed.
  • A Support Vector Machine (SVM) classifier was trained to identify relevant sentences in MEDLINE abstracts.

Related Experiment Videos

  • Semantic role labeling was performed on identified sentences to extract specific categories.
  • Main Results:

    • The SVM classifier achieved a 91% F-beta measure for sentence classification in cross-validation and 74% on a large corpus.
    • Eight frequently present semantic categories were identified in sentences related to transcript diversity.
    • The semantic role labeling step also demonstrated high F-beta measures for all identified categories.

    Conclusions:

    • The developed two-step procedure effectively extracts information on alternative transcripts.
    • The LSAT database provides a valuable resource for alternative transcript information.
    • This approach advances the study of functional diversity through alternative splicing.