Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Protein Complexes with Interchangeable Parts01:57

Protein Complexes with Interchangeable Parts

2.5K
Groups of proteins may form a complex where each protein in this complex has a different role in the overall execution of the complex’s function. Often some of the proteins in the complex can be replaced by a closely related variant to give a complex that contains many of the same components yet is functionally distinct.
The SCF ubiquitin ligase is a protein complex of five individual proteins. This complex attaches ubiquitin to other target proteins to mark them for degradation. In order...
2.5K
Predicting Reaction Outcomes02:24

Predicting Reaction Outcomes

8.2K
Kinetics describes the rate and path by which a reaction occurs. In contrast, thermodynamics deals with state functions and describes the properties, behavior, and components of a system. It is not concerned with the path taken by the process and cannot address the rate at which a reaction occurs. Although it does provide information about what can happen during a reaction process, it does not describe the detailed steps of what appears on an atomic or a molecular level. On the other hand,...
8.2K
Gene Evolution - Fast or Slow?02:05

Gene Evolution - Fast or Slow?

7.0K
The genomes of eukaryotes are punctuated by long stretches of sequence which do not code for proteins or RNAs. Although some of these regions do contain crucial regulatory sequences, the vast majority of this DNA serves no known function. Typically, these regions of the genome are the ones in which the fastest change, in evolutionary terms, is observed, because there is typically little to no selection pressure acting on these regions to preserve their sequences.
In contrast, regions which code...
7.0K
Factors Affecting Dissolution: Polymorphism, Amorphism and Pseudopolymorphism01:21

Factors Affecting Dissolution: Polymorphism, Amorphism and Pseudopolymorphism

288
Polymorphism refers to the existence of a drug substance in multiple crystalline forms, known as polymorphs. Recently, this term has been expanded to include solvates (forms containing a solvent), amorphous forms (non-crystalline forms), and desolvated solvates (forms from which the solvent has been removed).
Some polymorphic crystals possess lower aqueous solubility than their amorphous counterparts, leading to incomplete absorption. For instance, the oral suspension of Chloramphenicol, which...
288
Protein Folding01:22

Protein Folding

117.5K
Overview
117.5K
Variability: Analysis01:11

Variability: Analysis

128
Measures of variability are statistical metrics that reveal the dispersion pattern within a dataset. They are pivotal in biostatistics, providing insights into the heterogeneity within health and biological data. Variability signifies the degree to which data points diverge from one another, helping researchers understand the potential range of values and associated uncertainty within the data.
The range is a simple measure of variability, indicating the difference between the highest and...
128

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Plasma GDF15 affects long-term dementia risk and alters neuroimmune signaling.

Science advances·2026
Same author

Integrative machine learning and gene regulatory network analysis identifies novel genes for leafy head formation in Brassica rapa and B. oleracea.

Molecular horticulture·2026
Same author

Genus-wide Homologous Recombination of Tail Fibers Maintains Tailocin Diversity in Pectobacterium.

Genome biology and evolution·2026
Same author

Association between mushroom consumption and digit span performance among middle-aged and older community-dwelling Japanese: the NILS-LSA project.

Food & function·2026
Same author

Multi-trajectories of intrinsic capacity and their impact on incident disability in community-dwelling older adults in Japan.

Archives of gerontology and geriatrics·2026
Same author

Ten-year longitudinal effects of physical activity and apolipoprotein E ..4 genotype on precuneus atrophy in Japanese older adults.

Archives of gerontology and geriatrics·2026
Same journal

Functional Genomic Evidence for Candidate Small Viral RNA-Mediated Epigenetic Interference in SARS-CoV-1 and SARS-CoV-2.

Computational and structural biotechnology journal·2026
Same journal

From Pixels to Patterns: A Multidimensional Framework to Decode Cytoskeletal Organization.

Computational and structural biotechnology journal·2026
Same journal

A Large Concept Model for Mechanistic Simulation of Disease Trajectories: A Hypothesis-Generating Exemplar for Pediatric Acute Lymphoblastic Leukemia.

Computational and structural biotechnology journal·2026
Same journal

Adversarial Sequence Mutations in AlphaFold and ESMFold Reveal Nonphysical Structural Invariance, Confidence Failures, and Concerns for Protein Design.

Computational and structural biotechnology journal·2026
Same journal

High-Throughput Prediction of Protein-Protein Interactions Uncovers Hidden Molecular Networks in Biosynthetic Gene Clusters.

Computational and structural biotechnology journal·2026
Same journal

A Region-Aware Structured Framework Improves Prediction of Gene Expression from DNA Methylation.

Computational and structural biotechnology journal·2026
See all related articles

Related Experiment Video

Updated: Jun 9, 2025

A Protocol for Computer-Based Protein Structure and Function Prediction
16:41

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

68.6K

Enzyme structure correlates with variant effect predictability.

Floris van der Flier1, Dave Estell2, Sina Pricelius3

  • 1Department of Plant Sciences, Wageningen University & Research, Wageningen, 6708 PB, the Netherlands.

Computational and Structural Biotechnology Journal
|October 22, 2024
PubMed
Summary
This summary is machine-generated.

Machine learning models for protein engineering show variable accuracy. Structural characteristics like buriedness and active site proximity significantly impact prediction error, suggesting improvements for variant effect prediction (VEP) models.

Keywords:
Machine learningPredictabilityProtein engineering

More Related Videos

Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues
07:08

Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues

Published on: July 14, 2015

7.2K
Following the Dynamics of Structural Variants in Experimentally Evolved Populations
04:52

Following the Dynamics of Structural Variants in Experimentally Evolved Populations

Published on: February 3, 2023

920

Related Experiment Videos

Last Updated: Jun 9, 2025

A Protocol for Computer-Based Protein Structure and Function Prediction
16:41

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

68.6K
Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues
07:08

Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues

Published on: July 14, 2015

7.2K
Following the Dynamics of Structural Variants in Experimentally Evolved Populations
04:52

Following the Dynamics of Structural Variants in Experimentally Evolved Populations

Published on: February 3, 2023

920

Area of Science:

  • Computational biology
  • Protein engineering
  • Machine learning

Background:

  • Machine learning models are increasingly used for pre-screening protein variants.
  • Current models show variable prediction accuracy for different protein variants.
  • The characteristics of variants with large prediction errors are not well understood.

Purpose of the Study:

  • To investigate whether structural characteristics influence the predictability of protein variant effects.
  • To identify specific structural features associated with prediction errors in machine learning models.
  • To guide improvements in machine learning-guided protein engineering.

Main Methods:

  • Created a high-order combinatorial dataset of 3,706 enzyme variants.
  • Partitioned the dataset based on structural characteristics (buriedness, contact residues, active site proximity, secondary structure).
  • Trained and evaluated four supervised variant effect prediction (VEP) models on these partitioned subsets.

Main Results:

  • Predictability strongly depended on all four tested structural characteristics.
  • These dependencies were consistent across all four tested VEP models.
  • The findings indicate that current algorithms do not fully account for structure-function determinants.

Conclusions:

  • Structural characteristics significantly influence the predictability of protein variant effects.
  • Improvements in VEP models can be achieved by incorporating new inductive biases and data modalities.
  • Stratified dataset design is crucial for identifying areas for improvement in machine learning-guided protein engineering.