Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Conservation of Protein Domains Over Different Proteins02:26

Conservation of Protein Domains Over Different Proteins

14.8K
Protein domains are small structurally independent units that are part of a single amino acid chain.  Although these domains are often structurally independent, they may rely on synergistic effects to perform their functions as part of a larger protein. Protein domains may be conserved within the same organism, as well as across different organisms.
A limited set of protein domains often duplicate and recombine during evolution. These domains can be organized in different combinations to...
14.8K
Protein and Protein Structure02:15

Protein and Protein Structure

90.4K
Proteins are one of the most abundant organic molecules in living systems and have the most diverse range of functions of all macromolecules. Proteins may be structural, regulatory, contractile, or protective. They may serve in transport, storage, or membranes; or they may be toxins or enzymes. Their structures, like their functions, vary greatly. They are all, however, amino acid polymers arranged in a linear sequence.
A protein's shape is critical to its function. For example, an enzyme...
90.4K
Protein and Protein Structures02:15

Protein and Protein Structures

19.7K
19.7K
Conservation of Protein Domains02:26

Conservation of Protein Domains

4.3K
4.3K
Protein Families02:47

Protein Families

4.6K
4.6K
Protein Families02:47

Protein Families

17.3K
Protein families are groups of homologous proteins; that is, they have similarities in amino acid sequences and three-dimensional structures. Protein families usually occur because of gene duplication, where an additional copy of a gene is inserted into the genome of an organism.   Mutations that change the amino acids but still allow the protein to be properly synthesized, will lead to new protein family members.   If these new proteins contain similar amino acids in key...
17.3K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Ten simple rules for writing a peer review.

PLoS computational biology·2026
Same author

An educator framework for organizing Wikipedia editathons for computational biology.

Bioinformatics (Oxford, England)·2025
Same author

Robust Optimal Metabolic Factories.

Journal of computational biology : a journal of computational molecular cell biology·2024
Same author

Closing the computational biology 'knowledge gap': Spanish Wikipedia as a case study.

Bioinformatics (Oxford, England)·2024
Same author

Sketching methods with small window guarantee using minimum decycling sets.

ArXiv·2023
Same author

Shortest Hyperpaths in Directed Hypergraphs for Reaction Pathway Inference.

Journal of computational biology : a journal of computational molecular cell biology·2023
Same journal

Haplotype-aware long-read error correction.

Algorithms for molecular biology : AMB·2026
Same journal

Extension of partial atom-to-atom maps: uniqueness and algorithms.

Algorithms for molecular biology : AMB·2026
Same journal

Lossless pangenome indexing using tag arrays.

Algorithms for molecular biology : AMB·2026
Same journal

Dolphyin: a combinatorial algorithm for identifying 1-Dollo phylogenies in cancer.

Algorithms for molecular biology : AMB·2026
Same journal

Probing transcription factor subsets in gene regulatory networks.

Algorithms for molecular biology : AMB·2026
Same journal

Comparing the ability of embedding methods on metabolic hypergraphs for capturing taxonomy-based features.

Algorithms for molecular biology : AMB·2026
See all related articles

Related Experiment Video

Updated: Mar 3, 2026

A Protocol for Computer-Based Protein Structure and Function Prediction
16:41

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

70.0K

Core column prediction for protein multiple sequence alignments.

Dan DeBlasio1,2, John Kececioglu1

  • 1Department of Computer Science, The University of Arizona, Tucson, AZ 85721 USA.

Algorithms for Molecular Biology : AMB
|April 25, 2017
PubMed
Summary
This summary is machine-generated.

We developed a novel predictor for column coreness in protein multiple sequence alignments. This tool accurately estimates alignment accuracy by identifying reliable columns, outperforming existing methods.

Keywords:
Accuracy estimationAlignment accuracyCore blocksMachine learningMultiple sequence alignmentParameter advisingRegression

More Related Videos

Author Spotlight: A Computational Approach to Decipher Amino Acid Preferences in Multispecific Protein-Protein Interactions
06:50

Author Spotlight: A Computational Approach to Decipher Amino Acid Preferences in Multispecific Protein-Protein Interactions

Published on: January 26, 2024

2.7K
Creating and Applying a Reference to Facilitate the Discussion and Classification of Proteins in a Diverse Group
07:49

Creating and Applying a Reference to Facilitate the Discussion and Classification of Proteins in a Diverse Group

Published on: August 16, 2017

7.5K

Related Experiment Videos

Last Updated: Mar 3, 2026

A Protocol for Computer-Based Protein Structure and Function Prediction
16:41

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

70.0K
Author Spotlight: A Computational Approach to Decipher Amino Acid Preferences in Multispecific Protein-Protein Interactions
06:50

Author Spotlight: A Computational Approach to Decipher Amino Acid Preferences in Multispecific Protein-Protein Interactions

Published on: January 26, 2024

2.7K
Creating and Applying a Reference to Facilitate the Discussion and Classification of Proteins in a Diverse Group
07:49

Creating and Applying a Reference to Facilitate the Discussion and Classification of Proteins in a Diverse Group

Published on: August 16, 2017

7.5K

Area of Science:

  • Computational Biology
  • Bioinformatics
  • Structural Biology

Background:

  • Column coreness in protein multiple sequence alignments (MSAs) quantifies the reliability of alignment columns based on reference alignments.
  • Reference alignments use 3D structures to identify confident core columns, but this is unavailable for practical alignments.
  • Current methods typically measure MSA accuracy only against these core columns.

Purpose of the Study:

  • To develop the first predictor for column coreness in protein MSAs.
  • To enable accurate estimation of MSA accuracy when no reference alignment is known.
  • To improve the selection of alignment parameters for enhanced accuracy.

Main Methods:

  • Developed a novel coreness predictor using a nearest-neighbor classification approach.
  • Transformed nearest-neighbor distances into coreness predictions via a regression function.
  • Optimized a distance function using large-scale linear programming.

Main Results:

  • The developed coreness predictor accurately identifies reliable columns in computed MSAs.
  • Applied to parameter advising, the predictor significantly outperforms existing column-confidence estimators.
  • Achieved a substantial boost in overall protein multiple sequence alignment accuracy.

Conclusions:

  • The novel coreness predictor enables reliable estimation of MSA accuracy without a reference.
  • This method offers a significant improvement for parameter advising in protein alignment.
  • The approach provides a robust tool for assessing and enhancing the quality of computed MSAs.