Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Wilcoxon Signed-Ranks Test for Matched Pairs

Wilcoxon Signed-Ranks Test for Matched Pairs

The Wilcoxon signed-rank test for matched pairs evaluates the null hypothesis by combining the ranks of differences with their signs. It essentially tests whether the median of the differences in a population of matched pairs is zero. Since the test incorporates more information than the sign test, it generally yields more trustable conclusions. This test also does not require the data to follow a normal distribution, but two conditions must be met for it to be applicable: (1) the data must...

Kendall's Coefficient of Concordance

Kendall's Coefficient of Concordance

Kendall's Coefficient of Concordance (W), also known as Kendall's W, is a non-parametric statistical measure used to assess the agreement or concordance between multiple raters or judges when they rank a set of items. It is often used when you have ordinal data (ranks) and you want to see if there is consistency or consensus among the raters. It is widely applied in research areas such as psychology, medicine, and social sciences, where multiple judges are asked to rank or rate subjects...

Position Vectors

Position Vectors

A position vector is a fundamental concept in mathematics that helps determine the position of one point with respect to another point in space. It is a vector that describes the direction and distance between two points. Position vectors are highly useful in the field of math and science, as they help represent spatial relationships and make calculations easier.
For instance, we want to locate a point P(x, y, z) relative to the origin of coordinates O. In that case, we can define a position...

Sign Test for Matched Pairs

Sign Test for Matched Pairs

The sign test for matched pairs offers a robust method for comparing two paired samples, often for the effects of an intervention in one of them. This method is very useful in situations where the underlying distribution of the data is unknown. The test compares two related samples—often pre- and post-treatment measurements on the same subjects—to determine if there are significant differences in their median values.
To conduct the sign test, we first calculate the differences in...

Spearman's Rank Correlation Test

Spearman's Rank Correlation Test

Spearman's rank correlation test, also known as Spearman's rho, is a nonparametric method for assessing the strength and direction of association between two variables. This test is particularly valuable when the data distribution is unknown or when the assumption of normality does not hold. Named after the English psychologist and statistician Dr. Charles Edward Spearman, it serves as the nonparametric counterpart to Pearson's correlation coefficient.
Spearman's test calculates...

Vector Product (Cross Product)

Vector Product (Cross Product)

Vector multiplication of two vectors yields a vector product, with the magnitude equal to the product of the individual vectors multiplied by the sine of the angle between both the vectors and the direction perpendicular to both the individual vectors. As there are always two directions perpendicular to a given plane, one on each side, the direction of the vector product is governed by the right-hand thumb rule.
Consider the cross product of two vectors. Imagine rotating the first vector about...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Systemic Acquired Resistance signaling molecule N-hydroxypipecolic acid is involved in Age-Related Resistance in Arabidopsis thaliana.

Plant physiology·2026

Same author

Ankh-Score Produces Better Sequence Alignments Than AlphaFold3.

Proteins·2026

Same author

Protein embeddings and local alignments.

Computational and structural biotechnology journal·2026

Same author

Component puzzle protein-protein interaction prediction.

Briefings in bioinformatics·2025

Same author

Explainability of Protein Deep Learning Models.

International journal of molecular sciences·2025

Same author

Reply to: Insufficient evidence for natural selection associated with the Black Death.

Nature·2025

Same journal

STED: flexible cross-modal topic modeling infers cell-type-specific regulatory landscapes from bulk epigenomics.

Briefings in bioinformatics·2026

Same journal

A knowledge-guided deep learning framework for quantitative nucleic acid testing.

Briefings in bioinformatics·2026

Same journal

Optimal transport for label transfer in single-cell multi-omics integration.

Briefings in bioinformatics·2026

Same journal

Continuous multi-omics pathway enrichment analysis resolves hidden functional heterogeneity.

Briefings in bioinformatics·2026

Same journal

Evaluating completeness, coherence, and consistency of genome-scale function annotations.

Briefings in bioinformatics·2026

Same journal

Transformers for single-cell RNA sequencing: a survey.

Briefings in bioinformatics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 27, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Scoring alignments by embedding vector similarity.

Sepehr Ashrafzadeh¹, G Brian Golding², Silvana Ilie³

¹Department of Computer Science, University of Western Ontario, London, N6A 5B7, Ontario, Canada.

Briefings in Bioinformatics

|May 2, 2024

Summary

This summary is machine-generated.

This study introduces a novel E-score method for amino acid similarity, outperforming traditional BLOSUM matrices in sequence alignment. This deep learning approach leverages contextual embeddings for more accurate biological sequence analysis.

Keywords:

alignment distance amino acid scoring matrices sequence alignment sequence similarity word embedding

More Related Videos

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

A Protocol for Computer-Based Protein Structure and Function Prediction

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

Related Experiment Videos

Last Updated: Jun 27, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

A Protocol for Computer-Based Protein Structure and Function Prediction

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

Area of Science:

Bioinformatics
Computational Biology
Machine Learning

Background:

Sequence similarity is vital for understanding protein function and evolutionary relationships.
Existing scoring matrices (e.g., PAM, BLOSUM) are context-independent, limiting their accuracy.
Deep learning offers a way to create context-dependent representations.

Purpose of the Study:

To develop a novel, context-dependent scoring method for amino acid similarity.
To improve the accuracy of biological sequence alignment.
To leverage deep learning embeddings for protein sequence analysis.

Main Methods:

Utilized deep learning architectures with self-supervised learning on large unlabeled protein sequence datasets.
Generated contextual embedding vectors for individual amino acid residues.
Defined the E-score as the cosine similarity between residue embedding vectors.

Main Results:

Alignments generated using the E-score method, particularly ProtT5-score, showed significant improvement over BLOSUM-based alignments.
The new method demonstrated superior performance across various reference multiple sequence alignments.
The E-score effectively captures context-dependent amino acid similarity.

Conclusions:

The E-score offers a more accurate and context-aware approach to sequence similarity scoring.
This method has the potential to revolutionize sequence alignment and related bioinformatics tasks.
The developed tool is accessible via a web server and open-source code.