Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Protein and Protein Structure02:15

Protein and Protein Structure

87.5K
Proteins are one of the most abundant organic molecules in living systems and have the most diverse range of functions of all macromolecules. Proteins may be structural, regulatory, contractile, or protective. They may serve in transport, storage, or membranes; or they may be toxins or enzymes. Their structures, like their functions, vary greatly. They are all, however, amino acid polymers arranged in a linear sequence.
A protein's shape is critical to its function. For example, an enzyme...
87.5K
Structural Protein Function01:56

Structural Protein Function

29.9K
Structural proteins are a category of proteins responsible for functions ranging from cell shape and movement to providing support to major structures such as bones, cartilage, hair, and muscles. This group includes proteins such as collagen, actin, myosin, and keratin.
Collagen, the most abundant protein in mammals, is found throughout the body. In connective tissue, such as skin, ligaments, and tendons, it provides tensile strength and elasticity.  In bones and teeth, it mineralizes to...
29.9K
Structural Protein Function01:56

Structural Protein Function

3.3K
3.3K
Protein and Protein Structures02:15

Protein and Protein Structures

19.1K
19.1K
Activation and Inactivation of G Proteins01:22

Activation and Inactivation of G Proteins

11.4K
Heterotrimeric G proteins are guanine nucleotide-binding proteins. As the name suggests, heterotrimeric G proteins are composed of three subunits: alpha, beta, and gamma. They remain GDP-bound or GTP-bound inside the cells and switch between inactive/active states. The Gα subunit possesses the nucleotide-binding pocket that binds guanine nucleotides and switches between GDP or GTP-bound states. In contrast, the Gꞵ and Gγ subunits are always bound together with high...
11.4K
Structure-Activity Relationships and Drug Design01:28

Structure-Activity Relationships and Drug Design

1.8K
Drug design is a dynamic field that involves discovering and developing new medications based on specific biological targets. This process heavily relies on structure-activity relationships (SAR) and quantitative structure-activity relationships (QSAR) to guide the design and optimization of efficient drugs.
SAR studies the intricate relationship between a drug's chemical structure and biological activity. It focuses on understanding how modifications to a drug's structure can influence...
1.8K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

SNAC-DB: An ML-ready database for antibody and NANOBODY® VHH-antigen complexes with expanded structural diversity and real-world benchmarking.

Protein science : a publication of the Protein Society·2026
Same author

Evaluation of hydrological models at gauged and ungauged basins using machine learning-based limits-of-acceptability and hydrological signatures.

Journal of hydrology·2026
Same author

Isolated Marginal Mandibular Nerve Injury Due to Prone Positioning During Cervical Spine Surgery: A Case Report.

Cureus·2026
Same author

Advancement in therapeutic application of quantum dots in amyotrophic lateral sclerosis: current opportunities and challenges.

Drug delivery and translational research·2026
Same author

Targeting BCMA in multiple myeloma with a trifunctional NK cell engager.

Cell reports. Medicine·2026
Same author

Exploring the therapeutic potential of crocin, a bioactive component of saffron, in neurocognitive deficit and pulmonary impairment induced by inhalational exposure to petroleum vapours.

Molecular biology reports·2026
Same journal

GMSA: A Graph Matching and Point Cloud Registration-Based Method for Spatial Transcriptomics Data Alignment.

Journal of computational biology : a journal of computational molecular cell biology·2026
Same journal

Investigations on Multiple Protein Scaffold Filling.

Journal of computational biology : a journal of computational molecular cell biology·2026
Same journal

Cell Type Prediction for Single-Cell RNA Sequencing Utilizing Unsupervised Domain Adaptation and Semi-Supervised Learning.

Journal of computational biology : a journal of computational molecular cell biology·2026
Same journal

PPIGAN: Prediction of Protein-Protein Interactions Using Generative Adversarial Networks.

Journal of computational biology : a journal of computational molecular cell biology·2026
Same journal

Deep Structure-Enhanced Cell Clustering Model for Single-Cell RNA Sequencing Data.

Journal of computational biology : a journal of computational molecular cell biology·2026
Same journal

Asymmetric Drug-Drug Interaction Prediction Based on Generative Adversarial Networks and Knowledge Graph.

Journal of computational biology : a journal of computational molecular cell biology·2026
See all related articles

Related Experiment Video

Updated: Jan 31, 2026

Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images
08:20

Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images

Published on: October 27, 2023

2.6K

Deep Batch Active Learning for Protein Structure Modeling.

Zexin Xue1, Michael Bailey1, Abhinav Gupta2

  • 1R&D Data & Computational Science, Sanofi, Cambridge, Massachusetts, USA.

Journal of Computational Biology : a Journal of Computational Molecular Cell Biology
|January 30, 2026
PubMed
Summary
This summary is machine-generated.

DEWDROP is a new active learning method that strategically selects data to improve molecular structure prediction for underrepresented proteins like VHH antibodies, enhancing model performance efficiently.

Keywords:
active learning for regressionbatch optimizationprotein structure prediction

More Related Videos

Deep Learning-Based Segmentation of Cryo-Electron Tomograms
10:25

Deep Learning-Based Segmentation of Cryo-Electron Tomograms

Published on: November 11, 2022

10.8K
One-step Negative Chromatographic Purification of Helicobacter pylori Neutrophil-activating Protein Overexpressed in Escherichia coli in Batch Mode
10:44

One-step Negative Chromatographic Purification of Helicobacter pylori Neutrophil-activating Protein Overexpressed in Escherichia coli in Batch Mode

Published on: June 18, 2016

10.1K

Related Experiment Videos

Last Updated: Jan 31, 2026

Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images
08:20

Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images

Published on: October 27, 2023

2.6K
Deep Learning-Based Segmentation of Cryo-Electron Tomograms
10:25

Deep Learning-Based Segmentation of Cryo-Electron Tomograms

Published on: November 11, 2022

10.8K
One-step Negative Chromatographic Purification of Helicobacter pylori Neutrophil-activating Protein Overexpressed in Escherichia coli in Batch Mode
10:44

One-step Negative Chromatographic Purification of Helicobacter pylori Neutrophil-activating Protein Overexpressed in Escherichia coli in Batch Mode

Published on: June 18, 2016

10.1K

Area of Science:

  • Structural biology and bioinformatics
  • Computational drug discovery and development
  • Machine learning applications in molecular modeling

Background:

  • Accurate molecular structure prediction is crucial for pharmaceutical research and understanding protein function.
  • Current deep learning models, while advanced, exhibit limitations in predicting structures for underrepresented molecules such as VHH antibodies.
  • Experimental structure determination is time-consuming and costly, making large-scale data collection for model training impractical.

Purpose of the Study:

  • To develop a strategic data selection method, DEWDROP, to improve the performance of molecular structure prediction models through iterative fine-tuning.
  • To address the challenge of underrepresented molecular domains in existing training datasets by optimizing the curation of new experimental data.
  • To enable superior model performance with reduced iterations and costs by maximizing the information content of selected structures.

Main Methods:

  • Proposed DEWDROP, an active learning selection method utilizing Monte Carlo dropout to generate prediction ensembles for optimal data selection.
  • Employed a structured prediction model, Equifold, based on coarse-grain molecular representations, independent of multiple sequence alignments.
  • Conducted retrospective iterative fine-tuning experiments and batch selection analysis on VHH antibodies (SAbDab-nano) and *Mycobacterium leprae* proteins (AlphaFold Protein Database).

Main Results:

  • DEWDROP significantly improved model training efficiency through optimized batch selection, outperforming baseline methods in iterative fine-tuning.
  • The method successfully identified and selected structurally informative data with high information content, crucial for enhancing prediction accuracy.
  • Demonstrated DEWDROP's effectiveness and broader applicability across different molecular domains beyond VHH antibodies.

Conclusions:

  • DEWDROP offers a model-agnostic approach for strategic data selection in structural biology, particularly beneficial for underrepresented molecular families.
  • The active learning strategy enhances the efficiency and cost-effectiveness of improving deep learning models for molecular structure prediction.
  • This method facilitates superior model performance by maximizing the value of newly acquired experimental structural data.