Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Signal Sequences and Sorting Receptors01:41

Signal Sequences and Sorting Receptors

5.4K
Signal sequences are short amino acid sequences that guide newly synthesized proteins to their proper location within the cell. Classical signal sequences are fifteen to sixty amino acids long and present at the N-terminus of a polypeptide chain. Each signal sequence has a conserved segment of basic residues towards their N terminus, a hydrophobic core, and a C-terminus rich in polar residues. The C-terminus also contains a signal cleavage site and features a -3 -1 sequence motif. The -3-1...
5.4K
Overview of Protein Sorting and Transport01:45

Overview of Protein Sorting and Transport

11.3K
Eukaryotic cells have different membrane-bound organelles with distinct protein requirements. The process by which proteins are targeted to a specific organelle is called protein sorting.
Protein sorting can be of two types: signal-based sorting and vesicle-based trafficking. In signal-based sorting, specific amino acid sequences called sorting signals target proteins to the proper location inside the cell either via gated transport or by protein translocation.  In gated transport, folded...
11.3K
Protein Networks02:26

Protein Networks

3.9K
An organism can have thousands of different proteins, and these proteins must cooperate to ensure the health of an organism. Proteins bind to other proteins and form complexes to carry out their functions. Many proteins interact with multiple other proteins creating a complex network of protein interactions.
These interactions can be represented through maps depicting protein-protein interaction networks, represented as nodes and edges. Nodes are circles that are representative of a protein,...
3.9K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Epidural anesthesia not associated with decreased 30-day surgical site infection occurrence after open colorectal surgery.

Antimicrobial stewardship & healthcare epidemiology : ASHE·2026
Same author

Analysis of Patient Presentation to the ED within 90-days of Infra-inguinal Bypass.

Annals of vascular surgery·2026
Same author

Pre-Saccadic Suppression is Reduced for Anti-Saccades.

Journal of neurophysiology·2026
Same author

Oligodendrocyte Piezo2 is a regulator of age-dependent myelin integrity and dysregulated in multiple sclerosis.

Communications biology·2026
Same author

Aetiology and Management of Acute Septic Arthritis and Prosthetic Joint Infection Presentations to Orthopaedics: An Evaluation of Tertiary Centre Performance.

Cureus·2026
Same author

Images in Clinical Neurology: A Case of Poorly Differentiated Chondrosarcoma.

The Neurohospitalist·2026
Same journal

Chemotactic self-organization captures the dynamics of mammalian hair follicle patterning.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same journal

Tomographic imaging of superconducting order using particle-hole interference.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same journal

Inhibitory potential of autologous neutralizing antibodies sets quantitative limits on the rebound-competent HIV-1 reservoir.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same journal

Inferring epidemiological parameters under an infectious phylogeography model with visitor dynamics.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same journal

Analytical modeling for suction cup designs for skin-interfaced wearable devices.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same journal

Improving cell-free metabolism through direct integration of artificial respiratory chains.

Proceedings of the National Academy of Sciences of the United States of America·2026
See all related articles

Related Experiment Video

Updated: Jul 1, 2025

Single-Cell Factor Localization on Chromatin using Ultra-Low Input Cleavage Under Targets and Release using Nuclease
09:20

Single-Cell Factor Localization on Chromatin using Ultra-Low Input Cleavage Under Targets and Release using Nuclease

Published on: February 1, 2022

2.7K

Machine learning to predict continuous protein properties from binary cell sorting data and map unseen sequence

Marshall Case1, Matthew Smith1,2, Jordan Vinh3

  • 1Chemical Engineering, University of Michigan, Ann Arbor, MI 48109.

Proceedings of the National Academy of Sciences of the United States of America
|March 7, 2024
PubMed
Summary
This summary is machine-generated.

This study introduces a machine learning framework to predict protein properties from directed evolution experiments. The approach uses linear models to efficiently identify optimized protein variants with improved functions.

Keywords:
directed evolutionmachine learningprotein engineering

More Related Videos

An Integrated Approach for Microprotein Identification and Sequence Analysis
09:37

An Integrated Approach for Microprotein Identification and Sequence Analysis

Published on: July 12, 2022

3.4K
Informatic Analysis of Sequence Data from Batch Yeast 2-Hybrid Screens
09:14

Informatic Analysis of Sequence Data from Batch Yeast 2-Hybrid Screens

Published on: June 28, 2018

7.2K

Related Experiment Videos

Last Updated: Jul 1, 2025

Single-Cell Factor Localization on Chromatin using Ultra-Low Input Cleavage Under Targets and Release using Nuclease
09:20

Single-Cell Factor Localization on Chromatin using Ultra-Low Input Cleavage Under Targets and Release using Nuclease

Published on: February 1, 2022

2.7K
An Integrated Approach for Microprotein Identification and Sequence Analysis
09:37

An Integrated Approach for Microprotein Identification and Sequence Analysis

Published on: July 12, 2022

3.4K
Informatic Analysis of Sequence Data from Batch Yeast 2-Hybrid Screens
09:14

Informatic Analysis of Sequence Data from Batch Yeast 2-Hybrid Screens

Published on: June 28, 2018

7.2K

Area of Science:

  • Biochemistry and Molecular Biology
  • Protein Engineering
  • Computational Biology

Background:

  • Proteins are essential biomolecules with diverse cellular functions.
  • Protein engineering aims to rapidly evolve proteins for improved properties.
  • High-throughput methods enhance directed evolution, but data interpretation remains challenging.

Purpose of the Study:

  • To develop a framework for predicting continuous protein properties from directed evolution data.
  • To improve protein optimization by leveraging interpretable, linear machine learning models.
  • To identify lead protein candidates with enhanced functions.

Main Methods:

  • Developed a framework using interpretable, linear machine learning models.
  • Utilized data from simple, imprecise experimental estimates of protein fitness.
  • Applied the framework to predict affinity and specificity from cell sorting data for stapled peptides.
  • Coupled integer linear programming with machine learning mutation scores for optimization.

Main Results:

  • Linear machine learning models accurately predict continuous protein properties from directed evolution data.
  • The framework's predictive capabilities approach those of more precise but expensive methods.
  • Protein fitness space is reasonably modeled by linear relationships among sequence mutations.
  • Successfully identified protein variants with improved and co-optimal properties in prospective tests.

Conclusions:

  • The developed framework offers a versatile tool for analyzing and identifying improved protein variants.
  • Predicting continuous protein properties from readily available deep sequencing data is feasible.
  • Linear relationships effectively model protein fitness landscapes, enabling efficient optimization.