Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Bootstrapping01:24

Bootstrapping

577
The term "bootstrap" originated in the 19th century as a metaphor for self-improvement or achieving something independently, without external assistance. This concept extends to statistical bootstrapping, a self-contained method for estimating population parameters through resampling, even though it can be computationally intensive. Developed by the American statistician Dr. Bradley Efron in 1979, bootstrapping provides a robust way to perform inference when the original sample size is...
577
RNA-seq03:21

RNA-seq

9.8K
RNA sequencing, or RNA-Seq, is a high-throughput sequencing technology used to study the transcriptome of a cell. Transcriptomics helps to interpret the functional elements of a genome and identify the molecular constituents of an organism. Additionally, it also helps in understanding the development of an organism and the occurrence of diseases. 
Before the discovery of RNA-seq, microarray-based methods and Sanger sequencing were used for transcriptome analysis. However, while...
9.8K
Evolutionary Relationships through Genome Comparisons02:54

Evolutionary Relationships through Genome Comparisons

5.7K
Genome comparison is one of the excellent ways to interpret the evolutionary relationships between organisms. The basic principle of genome comparison is that if two species share a common feature, it is likely encoded by the DNA sequence conserved between both species. The advent of genome sequencing technologies in the late 20th century enabled scientists to understand the concept of conservation of domains between species and helped them to deduce evolutionary relationships across diverse...
5.7K
Maxam-Gilbert Sequencing01:05

Maxam-Gilbert Sequencing

11.0K
In the same year as the discovery of the Sanger sequencing method, another group of scientists, Allan Maxam and Walter Gilbert, demonstrated their chemical-cleavage method for DNA sequencing. The Maxam-Gilbert method relies on using different chemicals that can cleave the DNA sequence at specific sites, the separation of resulting DNA fragments of variable size using electrophoresis, and deciphering the DNA sequence from the resulting gel bands.
Challenges of the Maxam-Gilbert Method
The...
11.0K
Random Sampling Method01:09

Random Sampling Method

11.0K
Sampling is a technique to select a portion (or subset) of the larger population and study that portion (the sample) to gain information about the population. Data are the result of sampling from a population. The sampling method ensures that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest. Among the various sampling methods used by...
11.0K
Wald-Wolfowitz Runs Test II01:17

Wald-Wolfowitz Runs Test II

172
The Wald-Wolfowitz runs test, commonly referred to as the runs test, is a nonparametric test used to assess the randomness of ordered data. The test evaluates the number of runs, which are consecutive sequences of similar elements within the data. If the number of runs is significantly higher or lower than expected, the data is considered non-random, indicating a detectable pattern or structure.
For binary data, runs are identified using symbols such as + and −, or equivalently, 1s and...
172

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Augmenting transcriptome annotations through the lens of splicing evolution.

Genome research·2026
Same author

Hash functions in nucleotide sequence analysis.

Genome research·2026
Same author

Minimum flow decomposition guided by saturating subflows.

bioRxiv : the preprint server for biology·2025
Same author

MELO-ED: learning locality-sensitive multi-embeddings for edit distance.

bioRxiv : the preprint server for biology·2025
Same author

Amaranth: Enhanced Single-Cell Transcript Assembly via Discriminative Modeling of UMI Reads and Internal Reads.

bioRxiv : the preprint server for biology·2025
Same author

Accurate Reconstruction of Circular RNAs from Complex Rolling Circular Long Reads with CircPlex.

bioRxiv : the preprint server for biology·2025
Same journal

A human-specific genetic modifier reconfigures large-scale cortical network dynamics underlying behavioral performance.

bioRxiv : the preprint server for biology·2026
Same journal

<i>Staphylococcus aureus</i> uses a eukaryotic-like uridyltransferase to make UDP-GlcNAc for cell wall synthesis.

bioRxiv : the preprint server for biology·2026
Same journal

Dynamic redistribution of eIF4F controls cap-dependent translation initiation.

bioRxiv : the preprint server for biology·2026
Same journal

When does additional information improve accuracy of RNA secondary structure prediction?

bioRxiv : the preprint server for biology·2026
Same journal

Normative brain-state trajectories reveal deviation from healthy aging in Alzheimer's disease.

bioRxiv : the preprint server for biology·2026
Same journal

Noradrenergic infraslow rhythm during sleep is the critical link between heart-rate dynamics and memory consolidation.

bioRxiv : the preprint server for biology·2026
See all related articles

Related Experiment Video

Updated: May 27, 2025

Novel Sequence Discovery by Subtractive Genomics
09:40

Novel Sequence Discovery by Subtractive Genomics

Published on: January 25, 2019

8.6K

Sequence similarity estimation by random subsequence sketching.

Ke Chen1, Vinamratha Pattar2, Mingfu Shao1,3

  • 1Department of Computer Science and Engineering, The Pennsylvania State University, PA 16801.

Biorxiv : the Preprint Server for Biology
|February 20, 2025
PubMed
Summary
This summary is machine-generated.

SubseqSketch introduces a new alignment-free method for sequence similarity estimation using dynamic random subsequences. This approach efficiently approximates edit distances, improving bioinformatics tasks like phylogenetic analysis.

Keywords:
Alignment-free sequence comparisonEdit distance embeddingNearest neighbor searchPhylogenetic clustering

More Related Videos

Rare Event Detection Using Error-corrected DNA and RNA Sequencing
10:36

Rare Event Detection Using Error-corrected DNA and RNA Sequencing

Published on: August 3, 2018

12.0K
Demonstration of the Sequence Alignment to Predict Across Species Susceptibility Tool for Rapid Assessment of Protein Conservation
16:02

Demonstration of the Sequence Alignment to Predict Across Species Susceptibility Tool for Rapid Assessment of Protein Conservation

Published on: February 10, 2023

2.6K

Related Experiment Videos

Last Updated: May 27, 2025

Novel Sequence Discovery by Subtractive Genomics
09:40

Novel Sequence Discovery by Subtractive Genomics

Published on: January 25, 2019

8.6K
Rare Event Detection Using Error-corrected DNA and RNA Sequencing
10:36

Rare Event Detection Using Error-corrected DNA and RNA Sequencing

Published on: August 3, 2018

12.0K
Demonstration of the Sequence Alignment to Predict Across Species Susceptibility Tool for Rapid Assessment of Protein Conservation
16:02

Demonstration of the Sequence Alignment to Predict Across Species Susceptibility Tool for Rapid Assessment of Protein Conservation

Published on: February 10, 2023

2.6K

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • Sequence similarity estimation is crucial for bioinformatics tasks like functional annotation and phylogenetic analysis.
  • Alignment-free methods offer efficient large-scale sequence comparison by approximating edit distances.
  • Existing methods like k-mers face trade-offs, while subsequence methods are computationally intensive.

Purpose of the Study:

  • To introduce SubseqSketch, a novel alignment-free scheme for sequence similarity estimation.
  • To develop a method that overcomes the limitations of fixed-length k-mers and computationally demanding subsequence methods.
  • To demonstrate the efficiency and effectiveness of SubseqSketch in various bioinformatics applications.

Main Methods:

  • SubseqSketch maps sequences to integer vectors representing dynamic lengths of random subsequences.
  • Cosine similarity is used to compare these vectors, correlating strongly with edit similarity.
  • The method was evaluated on benchmark datasets for alignment-free tasks.

Main Results:

  • SubseqSketch demonstrates strong correlation between vector cosine similarity and original sequence edit similarity.
  • The method proved efficient and effective in nearest neighbor search and phylogenetic clustering.
  • Experimental results validate SubseqSketch's performance across diverse alignment-free applications.

Conclusions:

  • SubseqSketch offers an efficient and effective alignment-free approach for sequence similarity estimation.
  • The dynamic subsequence mapping overcomes limitations of traditional k-mer and fixed-length subsequence methods.
  • The open-source implementation facilitates broader adoption in bioinformatics research.