Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Wilcoxon Signed-Ranks Test for Matched Pairs01:09

Wilcoxon Signed-Ranks Test for Matched Pairs

121
The Wilcoxon signed-rank test for matched pairs evaluates the null hypothesis by combining the ranks of differences with their signs. It essentially tests whether the median of the differences in a population of matched pairs is zero. Since the test incorporates more information than the sign test, it generally yields more trustable conclusions. This test also does not require the data to follow a normal distribution, but two conditions must be met for it to be applicable: (1) the data must...
121
Kendall's Coefficient of Concordance01:20

Kendall's Coefficient of Concordance

329
Kendall's Coefficient of Concordance (W), also known as Kendall's W, is a non-parametric statistical measure used to assess the agreement or concordance between multiple raters or judges when they rank a set of items. It is often used when you have ordinal data (ranks) and you want to see if there is consistency or consensus among the raters. It is widely applied in research areas such as psychology, medicine, and social sciences, where multiple judges are asked to rank or rate subjects...
329
Position Vectors01:29

Position Vectors

877
A position vector is a fundamental concept in mathematics that helps determine the position of one point with respect to another point in space. It is a vector that describes the direction and distance between two points. Position vectors are highly useful in the field of math and science, as they help represent spatial relationships and make calculations easier.
For instance, we want to locate a point P(x, y, z) relative to the origin of coordinates O. In that case, we can define a position...
877
Sign Test for Matched Pairs01:17

Sign Test for Matched Pairs

131
The sign test for matched pairs offers a robust method for comparing two paired samples, often for the effects of an intervention in one of them. This method is very useful in situations where the underlying distribution of the data is unknown. The test compares two related samples—often pre- and post-treatment measurements on the same subjects—to determine if there are significant differences in their median values.
To conduct the sign test, we first calculate the differences in...
131
Spearman's Rank Correlation Test01:20

Spearman's Rank Correlation Test

771
Spearman's rank correlation test, also known as Spearman's rho, is a nonparametric method for assessing the strength and direction of association between two variables. This test is particularly valuable when the data distribution is unknown or when the assumption of normality does not hold. Named after the English psychologist and statistician Dr. Charles Edward Spearman, it serves as the nonparametric counterpart to Pearson's correlation coefficient.
Spearman's test calculates...
771
Vector Product (Cross Product)01:17

Vector Product (Cross Product)

9.5K
Vector multiplication of two vectors yields a vector product, with the magnitude equal to the product of the individual vectors multiplied by the sine of the angle between both the vectors and the direction perpendicular to both the individual vectors. As there are always two directions perpendicular to a given plane, one on each side, the direction of the vector product is governed by the right-hand thumb rule.
Consider the cross product of two vectors. Imagine rotating the first vector about...
9.5K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Systemic Acquired Resistance signaling molecule N-hydroxypipecolic acid is involved in Age-Related Resistance in Arabidopsis thaliana.

Plant physiology·2026
Same author

Ankh-Score Produces Better Sequence Alignments Than AlphaFold3.

Proteins·2026
Same author

Protein embeddings and local alignments.

Computational and structural biotechnology journal·2026
Same author

Component puzzle protein-protein interaction prediction.

Briefings in bioinformatics·2025
Same author

Explainability of Protein Deep Learning Models.

International journal of molecular sciences·2025
Same author

Reply to: Insufficient evidence for natural selection associated with the Black Death.

Nature·2025
Same journal

STED: flexible cross-modal topic modeling infers cell-type-specific regulatory landscapes from bulk epigenomics.

Briefings in bioinformatics·2026
Same journal

A knowledge-guided deep learning framework for quantitative nucleic acid testing.

Briefings in bioinformatics·2026
Same journal

Optimal transport for label transfer in single-cell multi-omics integration.

Briefings in bioinformatics·2026
Same journal

Continuous multi-omics pathway enrichment analysis resolves hidden functional heterogeneity.

Briefings in bioinformatics·2026
Same journal

Evaluating completeness, coherence, and consistency of genome-scale function annotations.

Briefings in bioinformatics·2026
Same journal

Transformers for single-cell RNA sequencing: a survey.

Briefings in bioinformatics·2026
See all related articles

Related Experiment Video

Updated: Jun 27, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

548

Scoring alignments by embedding vector similarity.

Sepehr Ashrafzadeh1, G Brian Golding2, Silvana Ilie3

  • 1Department of Computer Science, University of Western Ontario, London, N6A 5B7, Ontario, Canada.

Briefings in Bioinformatics
|May 2, 2024
PubMed
Summary
This summary is machine-generated.

This study introduces a novel E-score method for amino acid similarity, outperforming traditional BLOSUM matrices in sequence alignment. This deep learning approach leverages contextual embeddings for more accurate biological sequence analysis.

Keywords:
alignment distanceamino acid scoring matricessequence alignmentsequence similarityword embedding

More Related Videos

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody
09:09

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

443
A Protocol for Computer-Based Protein Structure and Function Prediction
16:41

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

68.7K

Related Experiment Videos

Last Updated: Jun 27, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

548
Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody
09:09

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

443
A Protocol for Computer-Based Protein Structure and Function Prediction
16:41

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

68.7K

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Machine Learning

Background:

  • Sequence similarity is vital for understanding protein function and evolutionary relationships.
  • Existing scoring matrices (e.g., PAM, BLOSUM) are context-independent, limiting their accuracy.
  • Deep learning offers a way to create context-dependent representations.

Purpose of the Study:

  • To develop a novel, context-dependent scoring method for amino acid similarity.
  • To improve the accuracy of biological sequence alignment.
  • To leverage deep learning embeddings for protein sequence analysis.

Main Methods:

  • Utilized deep learning architectures with self-supervised learning on large unlabeled protein sequence datasets.
  • Generated contextual embedding vectors for individual amino acid residues.
  • Defined the E-score as the cosine similarity between residue embedding vectors.

Main Results:

  • Alignments generated using the E-score method, particularly ProtT5-score, showed significant improvement over BLOSUM-based alignments.
  • The new method demonstrated superior performance across various reference multiple sequence alignments.
  • The E-score effectively captures context-dependent amino acid similarity.

Conclusions:

  • The E-score offers a more accurate and context-aware approach to sequence similarity scoring.
  • This method has the potential to revolutionize sequence alignment and related bioinformatics tasks.
  • The developed tool is accessible via a web server and open-source code.