Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Acceleration Vectors01:30

Acceleration Vectors

20.0K
In everyday conversation, accelerating means speeding up. Acceleration is a vector in the same direction as the change in velocity, Δv, therefore the greater the acceleration, the greater the change in velocity over a given time. Since velocity is a vector, it can change in magnitude, direction, or both. Thus acceleration is a change in speed or direction, or both. For example, if a runner traveling at 10 km/h due east slows to a stop, reverses direction, and continues their run at 10 km/h...
20.0K
Per-Unit Sequence Models01:26

Per-Unit Sequence Models

336
An ideal Y-Y transformer, grounded through neutral impedances, displays per-unit sequence networks akin to those of a single-phase ideal transformer when subjected to balanced positive- or negative-sequence currents. These currents do not produce neutral currents, and their associated voltage drops.
Zero-sequence currents, which are identical in magnitude and phase, generate a neutral current, resulting in voltage drops across the neutral impedance and the low-voltage winding. If the...
336
Improving Translational Accuracy02:07

Improving Translational Accuracy

13.0K
Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
13.0K
Improving Translational Accuracy02:07

Improving Translational Accuracy

3.4K
3.4K
Associative Learning01:27

Associative Learning

924
Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...
924
Relative Motion Analysis - Acceleration01:10

Relative Motion Analysis - Acceleration

658
A slider-crank mechanism converts rotational motion from the crank into linear motion of the slider or vice versa. This mechanism consists of three main parts: the crank, the connecting rod, and the slider. The movement of the slider-crank is an example of general plane motion as the fluctuating angle between the crank and the connecting rod. Consider a segment AB where point A is at the end of the slider and point B is on the diametrically opposite end to point A, on a crack. The variance in...
658

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

The Common Fund Data Ecosystem (CFDE).

bioRxiv : the preprint server for biology·2026
Same author

Learning Explainable Imaging-Genetics Associations Related to a Neurological Disorder.

Medical image computing and computer-assisted intervention : MICCAI ... International Conference on Medical Image Computing and Computer-Assisted Intervention·2026
Same author

Assembling unmapped reads reveals hidden variation in South Asian genomes.

Nature communications·2026
Same author

Perseus: Lineage-Aware Refinement of Kraken2 Taxonomic Classification for Long Read Metagenomes.

bioRxiv : the preprint server for biology·2026
Same author

A neofunctionalized flowering antagonist created an evolutionary contingency that channeled Solanaceae adaptation.

bioRxiv : the preprint server for biology·2026
Same author

AniAnn's: alignment-free annotation of tandem repeat arrays using fast average nucleotide identity estimates.

bioRxiv : the preprint server for biology·2026
Same journal

conMItion: an R package adjusting confounding factors for associations in multi-omics.

Bioinformatics (Oxford, England)·2026
Same journal

SpaMFG: a Spatial Multi-omics Integration Method based on Feature Grouping.

Bioinformatics (Oxford, England)·2026
Same journal

CSCN: Inference of Cell-Specific Causal Networks Using Single-Cell RNA-Seq Data.

Bioinformatics (Oxford, England)·2026
Same journal

Sparse CCA-Based Mediation Analysis with High-Dimensional Exposures and Mediators.

Bioinformatics (Oxford, England)·2026
Same journal

Enhancing Cross-Context Generalization in Drug Perturbation Prediction with a Multimodal Conditional Diffusion Framework.

Bioinformatics (Oxford, England)·2026
Same journal

Primer Design through Submodular Function Estimation.

Bioinformatics (Oxford, England)·2026
See all related articles

Related Experiment Video

Updated: Dec 3, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

875

Sapling: accelerating suffix array queries with learned data models.

Melanie Kirsche1, Arun Das1, Michael C Schatz1,2,3

  • 1Department of Computer Science, Johns Hopkins University, Baltimore, MD 21218, USA.

Bioinformatics (Oxford, England)
|October 27, 2020
PubMed
Summary
This summary is machine-generated.

Sapling accelerates sequence alignment by augmenting suffix arrays with learned data models. This novel approach speeds up genomic data processing by over twofold with minimal memory overhead.

More Related Videos

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems
05:47

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

997

Related Experiment Videos

Last Updated: Dec 3, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

875
Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems
05:47

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

997

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • Genomic data expansion necessitates efficient sequence alignment algorithms.
  • Suffix arrays accelerate alignment but suffer from cache misses due to binary search on large datasets.

Purpose of the Study:

  • To develop and evaluate Sapling, a novel algorithm for faster sequence alignment.
  • To improve the efficiency of suffix array queries using learned data models.

Main Methods:

  • Developed Sapling, an algorithm integrating learned data models with suffix arrays.
  • Investigated various neural network models and implemented a compact, piecewise linear model.
  • Evaluated Sapling against optimized binary search and existing read aligners.

Main Results:

  • Sapling achieved more than twofold speedup in sequence alignment across diverse genomes (human, bacteria, plants).
  • The memory footprint increase was less than 1% of the suffix array size.
  • Outperformed optimized binary search and widely used read aligners.

Conclusions:

  • Sapling offers a significant improvement in sequence alignment speed and efficiency.
  • The learned data model approach effectively reduces cache misses and accelerates queries.
  • Sapling provides a practical and efficient solution for large-scale genomic data analysis.