Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Genome Annotation and Assembly03:36

Genome Annotation and Assembly

19.3K
The genome refers to all of the genetic material in an organism. It can range from a few million base pairs in microbial cells to several billion base pairs in many eukaryotic organisms. Genome assembly refers to the process of taking the DNA sequencing data and putting it all back together in a correct order to create a close representation of the original genome. This is followed by the identification of functional elements on the newly assembled genome, a process called genome annotation.
19.3K
Improving Translational Accuracy02:07

Improving Translational Accuracy

11.9K
Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
11.9K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Caregiver-Associated Physical Activity Patterns, Dietary Behaviors and Interventional Beliefs in Individuals with Down Syndrome: Insights from a Large European Survey.

Nutrients·2026
Same author

Understanding Obesity in Individuals with Down Syndrome: Caregiver Perceptions, Awareness, and Motivation.

Nutrients·2026
Same author

De novo design of RNA pseudoknots with deep learning.

bioRxiv : the preprint server for biology·2026
Same author

A susceptibility network analysis of disease trajectories leading to multiple sclerosis: A nationwide cohort study.

Multiple sclerosis (Houndmills, Basingstoke, England)·2026
Same author

Scalable and multiplexed recorders of gene regulation dynamics across weeks.

Nature·2026
Same author

Astrocytes from P301S Tau mice exhibit non-canonical protein secretion and reduced morphological complexity.

Neural regeneration research·2025
Same journal

A human-specific genetic modifier reconfigures large-scale cortical network dynamics underlying behavioral performance.

bioRxiv : the preprint server for biology·2026
Same journal

<i>Staphylococcus aureus</i> uses a eukaryotic-like uridyltransferase to make UDP-GlcNAc for cell wall synthesis.

bioRxiv : the preprint server for biology·2026
Same journal

Dynamic redistribution of eIF4F controls cap-dependent translation initiation.

bioRxiv : the preprint server for biology·2026
Same journal

When does additional information improve accuracy of RNA secondary structure prediction?

bioRxiv : the preprint server for biology·2026
Same journal

Normative brain-state trajectories reveal deviation from healthy aging in Alzheimer's disease.

bioRxiv : the preprint server for biology·2026
Same journal

Noradrenergic infraslow rhythm during sleep is the critical link between heart-rate dynamics and memory consolidation.

bioRxiv : the preprint server for biology·2026
See all related articles

Related Experiment Video

Updated: Sep 15, 2025

Identification and Classification of Position-specific GABAA Receptor Subunit Missense Variants for Their Role In Hippocampal Pyramidal Neurons
08:04

Identification and Classification of Position-specific GABAA Receptor Subunit Missense Variants for Their Role In Hippocampal Pyramidal Neurons

Published on: June 6, 2025

518

GhostBuster: A Deep-Learning-based, Literature-Unbiased Gene Prioritization Tool for Gene Annotation Prediction.

Giulio Deangeli1, Maria Grazia Spillantini1, Pietro Liò2

  • 1University of Cambridge, Department of Clinical Neurosciences, Clifford Allbutt Building, Hills Road, CB2 0HA Cambridge, UK.

Biorxiv : the Preprint Server for Biology
|July 16, 2025
PubMed
Summary
This summary is machine-generated.

GhostBuster is a novel machine learning platform that reduces literature bias in gene function prediction. It helps uncover the roles of understudied "ghost genes" in disease and biological networks.

More Related Videos

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

913
Navigating MARRVEL, a Web-Based Tool that Integrates Human Genomics and Model Organism Genetics Information
09:37

Navigating MARRVEL, a Web-Based Tool that Integrates Human Genomics and Model Organism Genetics Information

Published on: August 15, 2019

9.9K

Related Experiment Videos

Last Updated: Sep 15, 2025

Identification and Classification of Position-specific GABAA Receptor Subunit Missense Variants for Their Role In Hippocampal Pyramidal Neurons
08:04

Identification and Classification of Position-specific GABAA Receptor Subunit Missense Variants for Their Role In Hippocampal Pyramidal Neurons

Published on: June 6, 2025

518
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

913
Navigating MARRVEL, a Web-Based Tool that Integrates Human Genomics and Model Organism Genetics Information
09:37

Navigating MARRVEL, a Web-Based Tool that Integrates Human Genomics and Model Organism Genetics Information

Published on: August 15, 2019

9.9K

Area of Science:

  • Genomics
  • Bioinformatics
  • Machine Learning

Background:

  • A significant number of human protein-coding genes are poorly characterized, referred to as "ghost genes".
  • Research literature exhibits a "bandwagon effect", disproportionately focusing on well-annotated genes, which introduces bias.
  • This literature bias influences machine learning (ML) models, leading to predictions that favor well-studied genes and potentially overestimate biological relevance.

Purpose of the Study:

  • To develop a machine learning (ML) platform, GhostBuster, designed to predict gene functions, disease associations, and interactions while minimizing literature bias.
  • To evaluate the impact of biased (Gene Ontology) versus unbiased training datasets (LINCS, TCGA, STRING) on ML model performance and bias amplification.

Main Methods:

  • Developed GhostBuster, an encoder-decoder ML platform.
  • Compared ML models trained on literature-biased datasets against those trained on unbiased datasets (LINCS, TCGA, STRING).
  • Assessed the models' effectiveness in identifying novel gene annotations and predicting gene functions, disease associations, and interactions.

Main Results:

  • Literature-biased datasets yielded higher ML metrics but amplified existing biases.
  • Models trained on unbiased datasets were 2-3 times more effective at identifying recently discovered gene annotations.
  • The TCGA dataset, with minimal literature bias, demonstrated robust performance (ROC-AUC of 0.8-0.95).

Conclusions:

  • GhostBuster is the first ML framework explicitly designed to counteract literature bias in gene annotation.
  • The platform can predict novel gene functions, refine pathway memberships, and prioritize intergenic GWAS hits.
  • GhostBuster provides a powerful tool for exploring the roles of understudied genes in cellular function, disease, and molecular networks.