Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Features generated for computational splice-site prediction correspond to functional elements.

Rezarta Islamaj Dogan1, Lise Getoor, W John Wilbur

  • 1Computer Science Department, University of Maryland, College Park, MD 20742, USA. rezarta@cs.umd.edu

BMC Bioinformatics
|October 26, 2007
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Exploring Anti-Aging Literature via <i>ConvexTopics</i> and Large Language Models.

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science·2026
Same author

Exploring Anti-Aging Literature via <i>ConvexTopics</i> and Large Language Models.

ArXiv·2026
Same author

Detection of a sequence feature for recursive splicing.

bioRxiv : the preprint server for biology·2026
Same author

AgentMD: Empowering language agents for risk prediction with large-scale clinical tool learning.

Nature communications·2025
Same author

AgentMD: Empowering Language Agents for Risk Prediction with Large-Scale Clinical Tool Learning.

ArXiv·2025
Same author

MedCPT: Contrastive Pre-trained Transformers with Large-scale PubMed Search Logs for Zero-shot Biomedical Information Retrieval.

ArXiv·2025
Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026
Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026
Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026
Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026
Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026
Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026
See all related articles

This study enhances splice site prediction by extending a feature generation algorithm (FGA) to 5' splice sites. The algorithm identifies known and novel splicing signals, improving accuracy for RNA splicing.

Area of Science:

  • Molecular Biology
  • Bioinformatics
  • Genomics

Background:

  • Accurate splice site selection is crucial for messenger RNA precursor splicing.
  • Requires well-characterized signals at splice sites and auxiliary signals in flanking regions.
  • Previous work developed a feature generation algorithm (FGA) for human 3' splice sites.

Purpose of the Study:

  • Extend splice-site prediction to 5' splice sites.
  • Explore generated features for biologically meaningful splicing signals.
  • Identify novel splicing signals beyond current methods.

Main Methods:

  • Feature generation algorithm (FGA) applied to 5' splice sites.
  • Analysis of sequence intervals flanking splice sites.
  • Classification of splice site accuracy.

Related Experiment Videos

Main Results:

  • Identified features corresponding to known core and auxiliary splicing signals (e.g., branch site, pyrimidine tract, GGG triplets, exon splicing enhancers).
  • Provided evidence that FGA identifies splicing signals missed by other methods.
  • Generated features capture known biological signals within expected sequence intervals.

Conclusions:

  • The FGA effectively captures known biological splicing signals.
  • The method is adaptable to other species and related classification tasks.
  • Potential applications include identifying tissue-specific regulatory elements, polyadenylation sites, and promoters.