Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Conservation of Protein Domains Over Different Proteins02:26

Conservation of Protein Domains Over Different Proteins

10.9K
Protein domains are small structurally independent units that are part of a single amino acid chain.  Although these domains are often structurally independent, they may rely on synergistic effects to perform their functions as part of a larger protein. Protein domains may be conserved within the same organism, as well as across different organisms.
A limited set of protein domains often duplicate and recombine during evolution. These domains can be organized in different combinations to...
10.9K
Improving Translational Accuracy02:07

Improving Translational Accuracy

11.6K
Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
11.6K
Signal Sequences and Sorting Receptors01:41

Signal Sequences and Sorting Receptors

5.4K
Signal sequences are short amino acid sequences that guide newly synthesized proteins to their proper location within the cell. Classical signal sequences are fifteen to sixty amino acids long and present at the N-terminus of a polypeptide chain. Each signal sequence has a conserved segment of basic residues towards their N terminus, a hydrophobic core, and a C-terminus rich in polar residues. The C-terminus also contains a signal cleavage site and features a -3 -1 sequence motif. The -3-1...
5.4K
Protein and Protein Structure02:15

Protein and Protein Structure

79.7K
Proteins are one of the most abundant organic molecules in living systems and have the most diverse range of functions of all macromolecules. Proteins may be structural, regulatory, contractile, or protective. They may serve in transport, storage, or membranes; or they may be toxins or enzymes. Their structures, like their functions, vary greatly. They are all, however, amino acid polymers arranged in a linear sequence.
A protein's shape is critical to its function. For example, an enzyme...
79.7K
From DNA to Protein03:06

From DNA to Protein

18.5K
The flow of genetic information in cells from DNA to mRNA to protein is described by the central dogma, which states that genes specify the sequence of mRNAs, which in turn specify the sequence of amino acids making up all proteins. The decoding of one molecule to another is performed by specific proteins and RNAs. Because the information stored in DNA is so central to cellular function, it makes intuitive sense that the cell would make mRNA copies of this information for protein synthesis...
18.5K
Conservation of Protein Domains02:26

Conservation of Protein Domains

3.1K
3.1K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

StrucNS reveals interaction-weighted network topology as the driving predictor of absolute stability of natural and de novo proteins.

bioRxiv : the preprint server for biology·2026
Same author

Engineering a β-Sheet Enables Bispecific Binding in Single VHH Domains.

ACS synthetic biology·2026
Same author

Element-wise and Recursive Solutions for the Power Spectral Density of Biological Stochastic Dynamical Systems at Fixed Points.

Physical review research·2026
Same author

Persistently Increased Expression of PKMzeta and Unbiased Gene Expression Profiles Identify Hippocampal Molecular Traces of a Long-Term Active Place Avoidance Memory and "Shadow" Proteins.

Advanced science (Weinheim, Baden-Wurttemberg, Germany)·2026
Same author

Emergent universal long-range structure in random-organizing systems.

Nature communications·2026
Same author

PropMolFlow: property-guided molecule generation with geometry-complete flow matching.

Nature computational science·2026
Same journal

Breaking the Stability-Activity-Selectivity Trilemma in Unspecific Peroxygenase through Computation-Based Cross-Regional Combinatorial Mutagenesis.

ACS synthetic biology·2026
Same journal

Sequential Plasmid Curing and Genome Editing in <i>Escherichia coli</i> Nissle 1917.

ACS synthetic biology·2026
Same journal

An Explainable Deep Learning Framework Integrating DNA Sequence and Transcription Initiation Signals for Gene Expression Prediction.

ACS synthetic biology·2026
Same journal

A Multitask Prediction Framework for CircRNAs, Drugs, and Diseases Based on Multi-View Information Integration and Graph Contrastive Learning.

ACS synthetic biology·2026
Same journal

Engineering Modular Cargo Loading Strategies for Carboxysome-Derived Protein Particles.

ACS synthetic biology·2026
Same journal

Suppression of Salmonella Effectors with CRISPRi Controls the Immune Response to Bacterial Therapies.

ACS synthetic biology·2026
See all related articles

Related Experiment Video

Updated: Jul 17, 2025

Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues
07:08

Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues

Published on: July 14, 2015

7.3K

Predicting and Interpreting Protein Developability Via Transfer of Convolutional Sequence Representation.

Alexander W Golinski1, Zachary D Schmitz1, Gregory H Nielsen1

  • 1Department of Chemical Engineering and Materials Science, University of Minnesota, Minneapolis, Minnesota 55455, United States.

ACS Synthetic Biology
|August 29, 2023
PubMed
Summary
This summary is machine-generated.

Neural networks predict protein developability from amino acid sequences using high-throughput data. This approach improves recombinant expression prediction and visualizes protein fitness landscapes.

Keywords:
developabilitylandscapemodelpredictiveproteinsequence

More Related Videos

Application of I TASSER, trRosetta, UCSF Chimera, HADDOCK server, and HEX loria for De Novo and In Silico Design of Proteins
05:08

Application of I TASSER, trRosetta, UCSF Chimera, HADDOCK server, and HEX loria for De Novo and In Silico Design of Proteins

Published on: July 8, 2025

89
A Protocol for Computer-Based Protein Structure and Function Prediction
16:41

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

68.7K

Related Experiment Videos

Last Updated: Jul 17, 2025

Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues
07:08

Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues

Published on: July 14, 2015

7.3K
Application of I TASSER, trRosetta, UCSF Chimera, HADDOCK server, and HEX loria for De Novo and In Silico Design of Proteins
05:08

Application of I TASSER, trRosetta, UCSF Chimera, HADDOCK server, and HEX loria for De Novo and In Silico Design of Proteins

Published on: July 8, 2025

89
A Protocol for Computer-Based Protein Structure and Function Prediction
16:41

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

68.7K

Area of Science:

  • Protein Engineering
  • Computational Biology
  • Biophysics

Background:

  • Engineered proteins are valuable tools but often suffer from poor developability (expression, solubility, stability).
  • Predicting protein developability from amino acid sequence can streamline candidate selection and reduce experimental costs.
  • High-throughput screening has generated large datasets for protein developability, enabling machine learning approaches.

Purpose of the Study:

  • To evaluate neural networks' ability to learn protein developability representations from high-throughput data.
  • To predict recombinant expression for unseen protein sequences.
  • To visualize and understand the protein fitness landscape and key amino acid properties influencing developability.

Main Methods:

  • Development of a neural network model using a high-throughput developability dataset for the Gp2 protein scaffold.
  • Convolutional approach to learn amino acid properties and predict expression levels.
  • Nonlinear dimensionality reduction and nested sampling for fitness landscape analysis.

Main Results:

  • The neural network model predicted expression levels 44% closer to experimental variance than a control.
  • Learned amino acid embeddings revealed the significance of cysteine, hydrophobicity, and charge, while aromaticity was less important for small protein developability.
  • Direct visualization of the protein fitness landscape identified evolutionary bottlenecks and distinct subpopulations.

Conclusions:

  • Neural networks can effectively predict protein expression and interpret developability from limited data.
  • Understanding amino acid contributions and fitness landscapes aids in designing improved protein scaffolds.
  • This work bridges applied protein engineering with foundational biophysics for characterizing protein developability.