Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Evolutionary Relationships through Genome Comparisons02:54

Evolutionary Relationships through Genome Comparisons

6.6K
Genome comparison is one of the excellent ways to interpret the evolutionary relationships between organisms. The basic principle of genome comparison is that if two species share a common feature, it is likely encoded by the DNA sequence conserved between both species. The advent of genome sequencing technologies in the late 20th century enabled scientists to understand the concept of conservation of domains between species and helped them to deduce evolutionary relationships across diverse...
6.6K
Leaky Scanning02:28

Leaky Scanning

5.4K
During most eukaryotic translation processes, the small 40S ribosome subunit scans an mRNA from its 5' end until it encounters the first start AUG codon. The large 60S ribosomal subunit then joins the smaller one to initiate protein synthesis. The location of the translation initiation is largely determined by the nucleotides near the start codon as there may be multiple translation initiation sites present on the mRNA.  Marilyn Kozak discovered that the sequence RCCAUGG (where R...
5.4K
Next-generation Sequencing03:00

Next-generation Sequencing

96.2K
The first human genome sequencing project cost $2.7 billion and was declared complete in 2003, after 15 years of international cooperation and collaboration between several research teams and funding agencies. Today, with the advent of next-generation sequencing technologies, the cost and time of sequencing a human genome have dropped over 100 fold.
Next-Generation Sequencing Methods
Although all next-generation methods use different technologies, they all share a set of standard features....
96.2K
Mismatch Repair01:36

Mismatch Repair

42.8K
Overview
42.8K
Sanger Sequencing01:57

Sanger Sequencing

768.1K
DNA sequencing is a fundamental technique that is routinely used in the biological sciences. This method can be applied to a range of questions at different scales - from the sequencing of a cloned DNA fragment or the study of a mutation in a gene up to whole-genome sequencing. However, despite the widespread use of sequencing today, it was not until 1977 that Fredrick Sanger and his collaborators developed the chain-termination method to decode DNA sequences. It relies on the separation of a...
768.1K
Gene Duplication and Divergence02:37

Gene Duplication and Divergence

7.5K
The seminal work of Ohno in 1970 popularized the idea of gene duplication and divergence. DNA sequence comparison studies reveal that a large portion of the genes in bacteria, archaebacteria, and eukaryotes was  generated by gene duplication and divergence, indicating its critical role in evolution.
The duplicated copies of the gene are called Paralogs. Paralogs with similar sequences and functions form a gene family. Across several species, a large number of gene families are...
7.5K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

App-SpaM: phylogenetic placement of short reads without sequence alignment.

Bioinformatics advances·2023
Same author

Insertions and deletions as phylogenetic signal in an alignment-free context.

PLoS computational biology·2022
Same author

Genome and Methylome analysis of a phylogenetic novel <i>Campylobacter coli</i> cluster with <i>C. jejuni</i> introgression.

Microbial genomics·2021
Same author

A 20-kb lineage-specific genomic region tames virulence in pathogenic amphidiploid Verticillium longisporum.

Molecular plant pathology·2021
Same author

S-conLSH: alignment-free gapped mapping of noisy long reads.

BMC bioinformatics·2021
Same author

'Multi-SpaM': a maximum-likelihood approach to phylogeny reconstruction using multiple spaced-word matches and quartet trees.

NAR genomics and bioinformatics·2021
Same journal

Mapping the 3D Chromosome Organization of a Biosynthetic Gene Cluster by Capture Hi-C (CHi-C).

Methods in molecular biology (Clifton, N.J.)·2026
Same journal

Mapping the 3D Chromosome Organization of Streptomyces by Hi-C.

Methods in molecular biology (Clifton, N.J.)·2026
Same journal

CUT&Tag Epigenomic Profiling of Biosynthetic Gene Clusters in Arabidopsis thaliana.

Methods in molecular biology (Clifton, N.J.)·2026
Same journal

Rhizobium rhizogenes-Mediated Hairy Root Transformation Protocol for Lotus japonicus and Other Legumes.

Methods in molecular biology (Clifton, N.J.)·2026
Same journal

Characterization of Bioactive Saponins from Sea Cucumbers.

Methods in molecular biology (Clifton, N.J.)·2026
Same journal

Methods for Functional Validation of Terpenoid Metabolic Clusters in Nicotiana benthamiana and Aspergillus oryzae.

Methods in molecular biology (Clifton, N.J.)·2026
See all related articles

Related Experiment Video

Updated: Nov 26, 2025

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER
14:06

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

15.5K

Sequence Comparison Without Alignment: The SpaM Approaches.

Burkhard Morgenstern1

  • 1University of Göttingen, Department of Bioinformatics (IMG), Göttingen, Germany. bmorgen@gwdg.de.

Methods in Molecular Biology (Clifton, N.J.)
|December 8, 2020
PubMed
Summary
This summary is machine-generated.

Fast alignment-free methods using spaced-word matches (SpaM) enable accurate phylogenetic distance estimation for large DNA and protein sequence datasets. These novel approaches overcome limitations of traditional alignment methods for modern sequencing data.

Keywords:
Alignment freeFSWMGenome comparisonPhylogenomicsPhylogenySpaMSpaced words

More Related Videos

Creating and Applying a Reference to Facilitate the Discussion and Classification of Proteins in a Diverse Group
07:49

Creating and Applying a Reference to Facilitate the Discussion and Classification of Proteins in a Diverse Group

Published on: August 16, 2017

7.3K
Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin
08:57

Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin

Published on: August 14, 2018

16.2K

Related Experiment Videos

Last Updated: Nov 26, 2025

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER
14:06

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

15.5K
Creating and Applying a Reference to Facilitate the Discussion and Classification of Proteins in a Diverse Group
07:49

Creating and Applying a Reference to Facilitate the Discussion and Classification of Proteins in a Diverse Group

Published on: August 16, 2017

7.3K
Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin
08:57

Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin

Published on: August 14, 2018

16.2K

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • Traditional pairwise and multiple sequence alignment methods are computationally intensive for massive datasets from modern sequencing technologies.
  • Existing alignment-free methods often use ad hoc similarity measures that are difficult to interpret.
  • There is a need for fast and interpretable methods for large-scale sequence comparison.

Purpose of the Study:

  • To introduce novel alignment-free methods for DNA and protein sequence analysis.
  • To develop approaches that can accurately estimate phylogenetic distances.
  • To address the computational challenges posed by large sequencing data volumes.

Main Methods:

  • Development of alignment-free methods based on spaced-word matches (SpaM).
  • Utilizing inexact word matches with pre-defined mismatch positions.
  • Application of a stochastic model of molecular evolution for distance estimation.

Main Results:

  • The developed SpaM approaches provide accurate phylogenetic distance estimations.
  • These methods are significantly faster than traditional alignment techniques for large datasets.
  • The approaches offer a more interpretable basis for sequence similarity assessment.

Conclusions:

  • Spaced-word matches offer a powerful and efficient alternative to traditional sequence alignment.
  • The developed methods accurately capture evolutionary relationships in DNA and protein sequences.
  • These alignment-free techniques are well-suited for analyzing data from high-throughput sequencing projects.