Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Causes of Similarity-Dissimilarity Effect

Causes of Similarity-Dissimilarity Effect

The similarity-dissimilarity effect, a fundamental concept in social psychology, explains how interpersonal similarities and differences influence attraction and social interactions. This effect is supported by three key psychological perspectives: balance theory, social comparison theory, and consensual validation.Balance Theory and Cognitive ConsistencyBalance theory, developed by Fritz Heider, posits that individuals seek cognitive consistency in their relationships. When two people share...

Multiple Comparison Tests

Multiple Comparison Tests

Multiple comparison test, abbreviated as MCT, is a post hoc analysis generally performed after comparing multiple samples with one or more tests. An MCT will help identify a significantly different sample among multiple samples or a factor among multiple factors.
It would be easy to compare two samples using a significance alpha level of 0.05. In other words, there is only one sample pair to be compared. However, it would be difficult to identify a significantly different sample if the number...

Wilcoxon Signed-Ranks Test for Matched Pairs

Wilcoxon Signed-Ranks Test for Matched Pairs

The Wilcoxon signed-rank test for matched pairs evaluates the null hypothesis by combining the ranks of differences with their signs. It essentially tests whether the median of the differences in a population of matched pairs is zero. Since the test incorporates more information than the sign test, it generally yields more trustable conclusions. This test also does not require the data to follow a normal distribution, but two conditions must be met for it to be applicable: (1) the data must...

Calculating and Interpreting the Linear Correlation Coefficient

Calculating and Interpreting the Linear Correlation Coefficient

The correlation coefficient, r, developed by Karl Pearson in the early 1900s, is numerical and provides a measure of strength and direction of the linear association between the independent variable, x, and the dependent variable, y. Hence, it is also known as the Pearson product-moment correlation coefficient. It can be calculated using the following equation:

Spearman's Rank Correlation Test

Spearman's Rank Correlation Test

Spearman's rank correlation test, also known as Spearman's rho, is a nonparametric method for assessing the strength and direction of association between two variables. This test is particularly valuable when the data distribution is unknown or when the assumption of normality does not hold. Named after the English psychologist and statistician Dr. Charles Edward Spearman, it serves as the nonparametric counterpart to Pearson's correlation coefficient.
Spearman's test calculates correlation by...

One-Way ANOVA: Equal Sample Sizes

One-Way ANOVA: Equal Sample Sizes

One-Way ANOVA can be performed on three or more samples with equal or unequal sample sizes. When one-way ANOVA is performed on two datasets with samples of equal sizes, it can be easily observed that the computed F statistic is highly sensitive to the sample mean.
Different sample means can result in different values for the variance estimate: variance between samples. This is because the variance between samples is calculated as the product of the sample size and the variance between the...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Improved selection of canonical proteins for reference proteomes.

NAR genomics and bioinformatics·2024

Same author

Comparison of detection methods and genome quality when quantifying nuclear mitochondrial insertions in vertebrate genomes.

Frontiers in genetics·2022

Same author

Barriers to integration of bioinformatics into undergraduate life sciences education: A national study of US life sciences faculty uncover significant barriers to integrating bioinformatics into undergraduate instruction.

PloS one·2019

Same author

Using SQL Databases for Sequence Similarity Searching and Analysis.

Current protocols in bioinformatics·2017

Same author

Query-seeded iterative sequence similarity searching improves selectivity 5-20-fold.

Nucleic acids research·2016

Same author

Finding Protein and Nucleotide Similarities with FASTA.

Current protocols in bioinformatics·2016

Same journal

Protein Sequence Analysis Using the MPI Bioinformatics Toolkit.

Current protocols in bioinformatics·2020

Same journal

Exploring Manually Curated Annotations of Intrinsically Disordered Proteins with DisProt.

Current protocols in bioinformatics·2020

Same journal

Network Building with the Cytoscape BioGateway App Explained in Five Use Cases.

Current protocols in bioinformatics·2020

Same journal

Expanding the Perseus Software for Omics Data Analysis With Custom Plugins.

Current protocols in bioinformatics·2020

Same journal

Exploring Non-Coding RNAs in RNAcentral.

Current protocols in bioinformatics·2020

Same journal

How to Illuminate the Dark Proteome Using the Multi-omic OpenProt Resource.

Current protocols in bioinformatics·2020

See all related articles

Search research articles

Related Experiment Video

Updated: May 3, 2026

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

Selecting the Right Similarity-Scoring Matrix.

William R Pearson¹

¹University of Virginia School of Medicine, Charlottesville, Virginia.

Current Protocols in Bioinformatics

|February 11, 2014

Summary

This summary is machine-generated.

Choosing the right protein similarity scoring matrices (like BLOSUM62) is crucial for evolutionary analysis. Deep matrices suit sensitive, full-length searches, while shallow matrices are better for short domains or recent evolutionary relationships.

Keywords:

BLOSUM matrices PAM matrices sequence alignment similarity scoring matrices

More Related Videos

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

Related Experiment Videos

Last Updated: May 3, 2026

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

Area of Science:

Bioinformatics
Computational Biology
Evolutionary Biology

Background:

Protein similarity searching programs like BLASTP, SSEARCH, and FASTA utilize scoring matrices to identify evolutionary relationships.
Scoring matrices vary in effectiveness based on evolutionary distances, with "deep" matrices (e.g., BLOSUM62) targeting distant relationships and "shallow" matrices (e.g., VTML) targeting recent ones.

Purpose of the Study:

To discuss the theoretical foundations guiding the selection of protein and DNA similarity scoring matrices and gap penalties.
To provide guidance on choosing appropriate scoring matrices for different evolutionary scales and search objectives.

Main Methods:

The study discusses the principles behind different scoring matrices, including BLOSUM62, BLOSUM50, and VTML series.
It examines the trade-offs between sensitivity, alignment length, and potential overextension associated with deep versus shallow matrices.
The role of match/mismatch parameters in DNA searches for defining evolutionary look-back times and domain boundaries is also considered.

Main Results:

"Deep" scoring matrices (BLOSUM62, BLOSUM50) offer high sensitivity for full-length protein sequences but may require longer alignments and risk overextension.
"Shallow" scoring matrices are more effective for identifying short protein domains or for searches focusing on recently diverged organisms.
The choice of scoring matrix and gap penalties directly impacts the accuracy and scope of evolutionary inferences.

Conclusions:

Deep scoring matrices are recommended for sensitive searches using full-length protein sequences.
Shallow scoring matrices are preferable for analyzing short domains or when a restricted evolutionary perspective is needed.
Optimal selection of scoring matrices and gap penalties is essential for accurate protein and DNA similarity searches across various evolutionary scales.