Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Improving Translational Accuracy02:07

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
DNA Microarrays02:34

DNA Microarrays

Microarrays are high-throughput and relatively inexpensive assays that can be automated to analyze large quantities of data at a time. They are used in genome-wide studies to compare gene or protein expression under two varied conditions, such as healthy and diseased states. Microarrays consist of glass or silica slides on which probe molecules are covalently attached through surface functionalization. Most commonly, the slides are prepared through the chemisorption of silanes to silica...
Improving Translational Accuracy02:07

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
Modern Molecular Taxonomy01:29

Modern Molecular Taxonomy

Advancements in molecular biology have revolutionized the identification and characterization of bacteria, with multiple methods leveraging DNA sequencing for enhanced precision. As sequencing technologies improve and costs decline, these approaches are increasingly used in clinical, environmental, and evolutionary studies.Multilocus Sequence Typing (MLST) examines several housekeeping genes, essential chromosomal genes encoding cellular functions, to distinguish strains. Approximately...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

vir2vec: A Viral Genome-Wide Viral Embedding.

bioRxiv : the preprint server for biology·2025
Same author

Reconstruction of a Risk Analysis Tool That Uses Nursing Data for Frailty Assessment Among Older Adults: Derivation and Internal Evaluation Study.

JMIR aging·2025
Same author

The Burden of Cancer and Precancerous Conditions Among Transgender Individuals in a Large Health Care Network: Retrospective Cohort Study.

JMIR cancer·2025
Same author

SARITA: a large language model for generating the S1 subunit of the SARS-CoV-2 spike protein.

Briefings in bioinformatics·2025
Same author

Leveraging large language models to predict antibiotic resistance in Mycobacterium tuberculosis.

Bioinformatics (Oxford, England)·2025
Same author

Behavioral and Demographic Profiles of HIV Transmission and Exposure Networks in Florida: Network Analysis of HIV Contact Tracing Data.

JMIR public health and surveillance·2025
Same journal

Readability of AI-Generated Patient Information on Glucagon-Like Peptide-1 Receptor Agonists.

JMIR bioinformatics and biotechnology·2026
Same journal

Random Survival Forest Versus Elastic-Net Regularized Cox Regression for Survival Prediction in Acute Myeloid Leukemia at Distinct Treatment Time Points: Model Performance Comparison Study.

JMIR bioinformatics and biotechnology·2026
Same journal

Temporal Reproducibility of a Genetic Algorithm-Derived Health Risk Score: Standardized Out-of-Fold Validation Framework (2021-2023).

JMIR bioinformatics and biotechnology·2026
Same journal

The AudioGene Translational Dashboard for Diagnosing Autosomal Dominant Nonsyndromic Hearing Loss: Phenotypic Data Visualization and Analysis Study.

JMIR bioinformatics and biotechnology·2026
Same journal

A Strategic Partnership to Advance AI Applications in Genomics and Bioinformatics for Health Innovation.

JMIR bioinformatics and biotechnology·2026
Same journal

Prevalence and Associated Risk Factors of Bovine Fasciolosis in Bahir Dar, Ethiopia: Cross-Sectional Study.

JMIR bioinformatics and biotechnology·2026
See all related articles

Related Experiment Video

Updated: Jun 28, 2026

Author Spotlight: Finding New Therapeutic Targets for Malignant Peripheral Nerve Sheath Tumor Through Genome-Scale shRNA Screens
09:33

Author Spotlight: Finding New Therapeutic Targets for Malignant Peripheral Nerve Sheath Tumor Through Genome-Scale shRNA Screens

Published on: August 25, 2023

1.6K

Paired-Sample and Pathway-Anchored MLOps Framework for Robust Transcriptomic Machine Learning in Small Cohorts: Model

Mahdieh Shabanian1, Nima Pouladi1, Liam Wilson1

  • 1Biomedical Informatics, University of Utah, 421 Wakara Suit 140, Salt Lake City, UT, 84108, United States, 1 7736143736.

JMIR Bioinformatics and Biotechnology
|December 4, 2025
PubMed
Summary
This summary is machine-generated.

This study introduces a novel machine learning (ML) approach for classifying rare diseases using paired-sample transcriptome data. The method enhances accuracy and overcomes small cohort limitations in transcriptomic classification.

Keywords:
MLOpsN-of-1Random Forest classifierW&Bablation analysismachine learningsmall cohortsweight and biases

More Related Videos

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

1.2K
Constructing and Visualizing Models using Mime-based Machine-learning Framework
06:19

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Published on: July 22, 2025

2.2K

Related Experiment Videos

Last Updated: Jun 28, 2026

Author Spotlight: Finding New Therapeutic Targets for Malignant Peripheral Nerve Sheath Tumor Through Genome-Scale shRNA Screens
09:33

Author Spotlight: Finding New Therapeutic Targets for Malignant Peripheral Nerve Sheath Tumor Through Genome-Scale shRNA Screens

Published on: August 25, 2023

1.6K
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

1.2K
Constructing and Visualizing Models using Mime-based Machine-learning Framework
06:19

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Published on: July 22, 2025

2.2K

Area of Science:

  • Genomics and Computational Biology
  • Machine Learning in Medicine
  • Rare Disease Research

Background:

  • Over 90% of human diseases are rare, affecting millions globally and posing challenges for research.
  • Low disease prevalence limits cohort sizes, hindering the development of robust transcriptome-based machine learning (ML) classifiers.
  • Standard ML models require large cohorts (>100 participants) for accuracy, which is infeasible for rare diseases with small patient groups, leading to overfitting.

Purpose of the Study:

  • To develop an ML classification method that overcomes cohort size limitations in rare disease research.
  • To integrate paired-sample transcriptome dynamics, N-of-1 pathway analytics, and MLOps for robust classification.
  • To enhance the accuracy and interpretability of ML models for high-dimensional transcriptomic data.

Main Methods:

  • Utilized within-subject paired-sample transcriptome data to control for individual variability and improve signal-to-noise ratio.
  • Implemented N-of-1 pathway-level analytics to reduce high-dimensional transcriptomic profiles into interpretable biological features.
  • Integrated reproducible machine learning operations (MLOps) for automated versioning, monitoring, and hyperparameter tuning to enhance model generalization.

Main Results:

  • Achieved 90% precision and recall in breast cancer classification and 92% precision with 90% recall in rhinovirus infection classification.
  • Paired-sample dynamics improved precision by up to 12% and recall by 13% in breast cancer, and 5% in rhinovirus.
  • MLOps workflows increased accuracy by ~14.5% compared to traditional methods, identifying key biological pathways for disease classification.

Conclusions:

  • The integrated approach of intrasubject dynamics, pathway-level feature reduction, and MLOps effectively addresses cohort size limitations in rare disease transcriptomic classification.
  • This method offers a scalable and interpretable solution for analyzing high-dimensional transcriptomic data in rare diseases.
  • Future research will focus on applying these advances to diverse therapeutic areas and small cohort study designs.