Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Causes of Similarity-Dissimilarity Effect01:26

Causes of Similarity-Dissimilarity Effect

240
The similarity-dissimilarity effect, a fundamental concept in social psychology, explains how interpersonal similarities and differences influence attraction and social interactions. This effect is supported by three key psychological perspectives: balance theory, social comparison theory, and consensual validation.Balance Theory and Cognitive ConsistencyBalance theory, developed by Fritz Heider, posits that individuals seek cognitive consistency in their relationships. When two people share...
240
Associative Learning01:27

Associative Learning

1.2K
Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...
1.2K
Distance Corrections01:15

Distance Corrections

257
To achieve precise distance measurements, especially in surveying and construction, certain corrections must be applied to account for potential sources of error like the standardization errors, temperature variations, and slope adjustments.Standardization error emerges when measurement equipment undergoes changes, such as wear, repairs, or weather impacts. To address this, surveyors compare the equipment’s readings to a standard. This process identifies any deviation that might lead to...
257
Collisions in Multiple Dimensions: Introduction01:05

Collisions in Multiple Dimensions: Introduction

6.5K
It is far more common for collisions to occur in two dimensions; that is, the initial velocity vectors are neither parallel nor antiparallel to each other. Let's see what complications arise from this. The first idea is that momentum is a vector. Like all vectors, it can be expressed as a sum of perpendicular components (usually, though not always, an x-component and a y-component, and a z-component if necessary). Thus, when the statement of conservation of momentum is written for a...
6.5K
Collisions in Multiple Dimensions: Problem Solving01:06

Collisions in Multiple Dimensions: Problem Solving

5.2K
In multiple dimensions, the conservation of momentum applies in each direction independently. Hence, to solve collisions in multiple dimensions, we should write down the momentum conservation in each direction separately. To help understand collisions in multiple dimensions, consider an example.
A small car of mass 1,200 kg traveling east at 60 km/h collides at an intersection with a truck of mass 3,000 kg traveling due north at 40 km/h. The two vehicles are locked together. What is the...
5.2K
Improving Translational Accuracy02:07

Improving Translational Accuracy

14.0K
Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
14.0K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Augmenting transcriptome annotations through the lens of splicing evolution.

Genome research·2026
Same author

Hash functions in nucleotide sequence analysis.

Genome research·2026
Same author

Minimum flow decomposition guided by saturating subflows.

bioRxiv : the preprint server for biology·2025
Same author

Amaranth: Enhanced Single-Cell Transcript Assembly via Discriminative Modeling of UMI Reads and Internal Reads.

bioRxiv : the preprint server for biology·2025
Same author

Accurate Reconstruction of Circular RNAs from Complex Rolling Circular Long Reads with CircPlex.

bioRxiv : the preprint server for biology·2025
Same author

Boosting Transcript Assembly via Delineating Transcript Start and End Sites.

bioRxiv : the preprint server for biology·2025
Same journal

Layered social competition coordinates reproductive hierarchy formation in ants.

bioRxiv : the preprint server for biology·2026
Same journal

Combination epigenetic-targeted therapy increases the immunogenicity of poorly immunogenic sarcomas.

bioRxiv : the preprint server for biology·2026
Same journal

Loss of LanC-like proteins delays post-injury regeneration of aging skeletal muscles.

bioRxiv : the preprint server for biology·2026
Same journal

Integrative Transfer Network: Deep Transfer Learning Across Populations and Prediction Targets.

bioRxiv : the preprint server for biology·2026
Same journal

Confidence-supported label-free metabolic imaging with FPhaS phase autofluorescence microscopy.

bioRxiv : the preprint server for biology·2026
Same journal

Sequence-encoded autoinhibition couples mRNA decapping activity to phase separation.

bioRxiv : the preprint server for biology·2026
See all related articles

Related Experiment Video

Updated: Jan 9, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

994

MELO-ED: learning locality-sensitive multi-embeddings for edit distance.

Xin Yuan1,2, Ke Chen1, Ajmain Yasar Ahmed1

  • 1Department of Computer Science and Engineering, The Pennsylvania State University, PA 16803, USA.

Biorxiv : the Preprint Server for Biology
|December 8, 2025
PubMed
Summary
This summary is machine-generated.

MELO-ED, a new framework, enhances biological sequence similarity searches by approximating edit distance using multi-dimensional embeddings. This method achieves high accuracy and scalability for massive genomic datasets.

Related Experiment Videos

Last Updated: Jan 9, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

994

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Machine Learning

Background:

  • Edit distance is crucial for biological sequence similarity but computationally expensive.
  • Previous methods like learned locality-sensitive bucketing (LSB) had accuracy limitations due to one-dimensional hash values.
  • Scalable and accurate sequence comparison is vital for large-scale genomic analysis.

Purpose of the Study:

  • Introduce MELO-ED, a novel framework to efficiently approximate edit distance for biological sequences.
  • Improve accuracy and scalability of sequence similarity searches beyond existing methods.
  • Enable fast classification of similar and dissimilar sequences in massive datasets.

Main Methods:

  • Developed MELO-ED, a multi-embedding locality-sensitive framework.
  • Employed a Siamese convolutional neural architecture for learning complementary embeddings.
  • Integrated locality-sensitive bucketing with higher-dimensional embeddings.
  • Utilized mature indexing methods in the embedding space for similarity searches.

Main Results:

  • MELO-ED achieves near-perfect accuracy in approximating edit distance.
  • The framework demonstrates superior performance and efficiency compared to traditional and machine learning methods.
  • Evaluations on simulated DNA and real barcode data confirm MELO-ED's effectiveness.
  • MELO-ED significantly speeds up edit distance computations for large databases.

Conclusions:

  • MELO-ED offers a state-of-the-art solution for fast and accurate biological sequence classification.
  • The multi-embedding approach overcomes limitations of previous LSB functions.
  • MELO-ED enables scalable similarity searches in massive genomic databases.