Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Optimal data collection for correlated mutation analysis.

Haim Ashkenazy1, Ron Unger, Yossef Kliger

  • 1Compugen LTD, Tel Aviv 69512, Israel.

Proteins
|July 26, 2008
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Biosynthetic gene clusters in <i>Pseudomonas viridiflava</i> have a fitness cost during <i>Arabidopsis thaliana</i> infection.

mSystems·2026
Same author

Clustering the protein universe of life using DIAMOND DeepClust.

Nature methods·2026
Same author

Using machine learning to predict and analyze complex trait diseases: Lessons from a simple abstract model.

PloS one·2026
Same author

Machine learning-based prediction of large-for-gestational-age neonates in diabetic and non-diabetic pregnancies.

International journal of gynaecology and obstetrics: the official organ of the International Federation of Gynaecology and Obstetrics·2025
Same author

A comparison of 27 Arabidopsis thaliana genomes and the path toward an unbiased characterization of genetic polymorphism.

Nature genetics·2025
Same author

Targeting the Cargo Receptor TMED9 as a Therapeutic Strategy Against Brain Tumors.

Cells·2025
Same journal

Engineered HSP90-MP65 Bivalent Fusion Antigen: A Novel Vaccine Candidate Against Invasive Candidiasis.

Proteins·2026
Same journal

Physics-Based Energy Functions for Computational Protein Design.

Proteins·2026
Same journal

Impact of Stabilizing Osmolytes on the Conformational Dynamics of Human and Rat Islet Amyloid Polypeptides.

Proteins·2026
Same journal

Stabilization of Bone Morphogenetic Protein-2 at Physiological pH: Contrasting Roles of CHAPS and Arginine in Aggregation Inhibition.

Proteins·2026
Same journal

Structural Insights Into the Function of Leishmania major Adenylosuccinate Lyase.

Proteins·2026
Same journal

Generalizing the Gaussian Network Model: Spanning-Tree Thermodynamics Shows Entropy-Driven KRAS Activation.

Proteins·2026
See all related articles

Correlated mutation analysis (CMA) performance improves by including more homologs, including paralogs, in multiple sequence alignments (MSAs). This data collection strategy enhances accuracy for predicting protein residue interactions from sequence data.

Area of Science:

  • Structural Biology
  • Bioinformatics
  • Computational Biology

Background:

  • Correlated mutation analysis (CMA) aims to predict intraprotein residue-residue interactions using only protein sequence data.
  • Current CMA methods exhibit limited performance despite advancements in algorithms and computational power.

Purpose of the Study:

  • To investigate the impact of sequence selection within multiple sequence alignments (MSAs) on CMA method performance.
  • To determine the extent to which MSA quality influences the accuracy of predicting residue-residue interactions.

Main Methods:

  • Systematic examination of the relationship between MSA composition (homologs, orthologs, paralogs) and CMA prediction strength.
  • Development and evaluation of an automated data collection procedure for MSA generation.

Related Experiment Videos

Main Results:

  • A strong positive correlation was observed between the number of homologs in an MSA and CMA prediction accuracy.
  • Including both orthologs and paralogs, even remote homologs, significantly benefits CMA performance compared to using only orthologs.
  • An automated data collection procedure, requiring 50% coverage, enhances accuracy without manual curation.

Conclusions:

  • The performance of CMA methods is highly dependent on the quality and diversity of sequences within the MSA.
  • Expanding MSAs to include diverse homologs, including paralogs, is crucial for improving predictive accuracy.
  • CMA holds significant untapped potential in structural biology, with current methods not yet reaching their performance limits.