Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Resolving the Vagueness of Quantifiers With Explicit Expectations.

Open mind : discoveries in cognitive science·2026
Same author

Snippets from the 28th Congress of the South African Society for Biochemistry and Molecular Biology.

Advances in experimental medicine and biology·2026
Same author

Reducing Supply Chain Dependencies for Viral Genomic Surveillance: Get by with a Little HELP from Commercial Enzymes already in your Lab Freezer.

Wellcome open research·2026
Same author

Tollip antagonizes ESCRT-III-mediated plasma membrane repair and cell recovery.

Cell regeneration (London, England)·2026
Same author

Repurposing of Inhibitors of Plasmodial Aspartate Transcarbamoylase Toward Trypanosoma Cruzi.

ChemMedChem·2026
Same author

Adjuvant Chemotherapy with Traditional Chinese Herbal Granules Versus Placebo in Resected Non-Small Cell Lung Cancer: Updated Survival Analysis of a Multicenter, Randomized, Double-Blind, Placebo-Controlled Trial.

Integrative cancer therapies·2026
Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026
Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026
Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026
Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026
Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026
Same journal

Benchmarking DNA barcode decoding strategies under high error rates.

BMC bioinformatics·2026
See all related articles

Related Experiment Video

Updated: May 21, 2026

Detecting Somatic Genetic Alterations in Tumor Specimens by Exon Capture and Massively Parallel Sequencing
11:02

Detecting Somatic Genetic Alterations in Tumor Specimens by Exon Capture and Massively Parallel Sequencing

Published on: October 18, 2013

Machine learning framework for cost effective deep mutational scanning through targeted substitution profiling.

Emily Morgan1, Shaylyn Govender1, Prashant Singh2

  • 1Research Unit in Bioinformatics (RUBi), Department of Biochemistry, Microbiology and Bioinformatics, Rhodes University, Makhanda, 6139, South Africa.

BMC Bioinformatics
|May 20, 2026
PubMed
Summary
This summary is machine-generated.

Machine learning (ML) models accurately predict protein variant effects using targeted deep mutational scanning (DMS). This approach reduces experimental costs and accelerates variant effect prediction for enzymes like SARS-CoV-2 main protease.

Keywords:
Brute-force approachDeep mutational scanningRandom ForestSupport vector machineZero-shot prediction

More Related Videos

Implementation of In Vitro Drug Resistance Assays: Maximizing the Potential for Uncovering Clinically Relevant Resistance Mechanisms
08:46

Implementation of In Vitro Drug Resistance Assays: Maximizing the Potential for Uncovering Clinically Relevant Resistance Mechanisms

Published on: December 9, 2015

A Protocol for Functional Assessment of Whole-Protein Saturation Mutagenesis Libraries Utilizing High-Throughput Sequencing
11:36

A Protocol for Functional Assessment of Whole-Protein Saturation Mutagenesis Libraries Utilizing High-Throughput Sequencing

Published on: July 3, 2016

Related Experiment Videos

Last Updated: May 21, 2026

Detecting Somatic Genetic Alterations in Tumor Specimens by Exon Capture and Massively Parallel Sequencing
11:02

Detecting Somatic Genetic Alterations in Tumor Specimens by Exon Capture and Massively Parallel Sequencing

Published on: October 18, 2013

Implementation of In Vitro Drug Resistance Assays: Maximizing the Potential for Uncovering Clinically Relevant Resistance Mechanisms
08:46

Implementation of In Vitro Drug Resistance Assays: Maximizing the Potential for Uncovering Clinically Relevant Resistance Mechanisms

Published on: December 9, 2015

A Protocol for Functional Assessment of Whole-Protein Saturation Mutagenesis Libraries Utilizing High-Throughput Sequencing
11:36

A Protocol for Functional Assessment of Whole-Protein Saturation Mutagenesis Libraries Utilizing High-Throughput Sequencing

Published on: July 3, 2016

Area of Science:

  • Biochemistry
  • Computational Biology
  • Genomics

Background:

  • Deep mutational scanning (DMS) generates extensive protein variant effect maps but is labor-intensive.
  • Machine learning (ML) offers a promising avenue to predict functional impacts from limited experimental data, potentially reducing experimental costs.

Purpose of the Study:

  • To develop and evaluate an ML classifier for predicting the functional impact of amino acid substitutions.
  • To assess the feasibility of using ML with targeted DMS to reduce experimental burden while maintaining accuracy.

Main Methods:

  • Trained an ML classifier on normalized DMS scores from SARS-CoV-2 main protease (Mpro).
  • Employed brute-force feature selection to identify minimal feature subsets for accurate classification.
  • Evaluated classifier performance using zero-shot prediction on independent datasets, including other enzymes and protein interaction/folding systems.

Main Results:

  • Achieved classification accuracy exceeding 90% with minimal feature subsets (six substitution scores per residue).
  • Demonstrated strong zero-shot prediction performance on other enzymes.
  • Observed more modest performance on systems assessing protein folding and/or protein-protein interactions.

Conclusions:

  • Targeted DMS combined with ML significantly reduces sequencing and reagent costs.
  • This integrated approach offers a practical and efficient method for accelerating variant effect prediction.
  • The findings highlight the potential of ML to streamline functional genomics studies.