Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Modern Molecular Taxonomy01:29

Modern Molecular Taxonomy

154
Advancements in molecular biology have revolutionized the identification and characterization of bacteria, with multiple methods leveraging DNA sequencing for enhanced precision. As sequencing technologies improve and costs decline, these approaches are increasingly used in clinical, environmental, and evolutionary studies.Multilocus Sequence Typing (MLST) examines several housekeeping genes, essential chromosomal genes encoding cellular functions, to distinguish strains. Approximately...
154
Multi-species Conserved Sequences02:51

Multi-species Conserved Sequences

4.3K
Next-generation sequencing technologies have created large genomic databases of a variety of animals and plants. Ever since the human genome project was completed, scientists studied the genome of primates, mammals, and other phylogenetically distant living beings. Such large-scale  studies have provided new insights into the evolutionary relationship between organisms.
Although the genome of each species varies greatly from each other, a few sequences are highly conserved. Such conserved...
4.3K
Other Unique Bacteria01:18

Other Unique Bacteria

86
Magnetic bacteria exhibit a directed movement called magnetotaxis, driven by structures called magnetosomes. These magnetosomes consist of chains of magnetic particles made of either magnetite (Fe₃O₄) or greigite (Fe₃S₄) and are organized in a linear conformation by a protein scaffold within invaginations of the cell membrane. The bacteria align along the north–south magnetic field lines, much like a compass needle. They are typically microaerophilic or anaerobic...
86
Second Uniqueness Theorem01:16

Second Uniqueness Theorem

1.1K
Consider a region consisting of several individual conductors with a definite charge density in the region between these conductors. The second uniqueness theorem states that if the total charge on each conductor and the charge density in the in-between region are known, then the electric field can be uniquely determined.
In contrast, consider that the electric field is non-unique and apply Gauss's law in divergence form in the region between the conductors and the integral form to the...
1.1K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Integrated flexible DNA methylation-chromatin segmentation modeling enhances epigenomic state annotation.

Nucleic acids research·2026
Same author

Cleanifier: contamination removal from microbial sequences using spaced seeds of a human pangenome index.

Bioinformatics (Oxford, England)·2025
Same author

Sustainable data analysis with Snakemake.

F1000Research·2025
Same author

Activation of NF-κB Signaling by Optogenetic Clustering of IKKα and β.

Advanced biology·2025
Same author

A comprehensive review and evaluation of species richness estimation.

Briefings in bioinformatics·2025
Same author

Tumor Necrosis Factor-Alpha Inhibits the Replication of Patient-Derived Archetype BK Polyomavirus While Activating Rearranged Strains.

Journal of medical virology·2025
Same journal

Haplotype-aware long-read error correction.

Algorithms for molecular biology : AMB·2026
Same journal

Extension of partial atom-to-atom maps: uniqueness and algorithms.

Algorithms for molecular biology : AMB·2026
Same journal

Lossless pangenome indexing using tag arrays.

Algorithms for molecular biology : AMB·2026
Same journal

Dolphyin: a combinatorial algorithm for identifying 1-Dollo phylogenies in cancer.

Algorithms for molecular biology : AMB·2026
Same journal

Probing transcription factor subsets in gene regulatory networks.

Algorithms for molecular biology : AMB·2026
Same journal

Comparing the ability of embedding methods on metabolic hypergraphs for capturing taxonomy-based features.

Algorithms for molecular biology : AMB·2026
See all related articles

Related Experiment Video

Updated: Sep 15, 2025

Use of MALDI-TOF Mass Spectrometry and a Custom Database to Characterize Bacteria Indigenous to a Unique Cave Environment Kartchner Caverns, AZ, USA
11:09

Use of MALDI-TOF Mass Spectrometry and a Custom Database to Characterize Bacteria Indigenous to a Unique Cave Environment Kartchner Caverns, AZ, USA

Published on: January 2, 2015

14.1K

Swiftly identifying strongly unique k-mers.

Jens Zentgraf1,2,3, Sven Rahmann4,5

  • 1Algorithmic Bioinformatics, Department of Computer Science, Saarland University, Campus E2.1, Saarbrücken, 66123, Saarland, Germany.

Algorithms for Molecular Biology : AMB
|July 13, 2025
PubMed
Summary
This summary is machine-generated.

This study introduces efficient algorithms to identify strongly unique k-mers, which are DNA sequences robust against single basepair changes. These methods rapidly distinguish k-mers with conflicting neighbors in large genomic datasets.

Keywords:
k-merAlgorithm engineeringHamming distanceParallelizationStrong uniqueness

More Related Videos

A Concoction Pipeline for Generating Molecular Operational Taxonomic Units (MOTUs) Among Riparian and Aquatic Beetles
10:23

A Concoction Pipeline for Generating Molecular Operational Taxonomic Units (MOTUs) Among Riparian and Aquatic Beetles

Published on: July 11, 2025

221
Identifying Amino Acid Overproducers Using Rare-Codon-Rich Markers
10:41

Identifying Amino Acid Overproducers Using Rare-Codon-Rich Markers

Published on: June 24, 2019

8.5K

Related Experiment Videos

Last Updated: Sep 15, 2025

Use of MALDI-TOF Mass Spectrometry and a Custom Database to Characterize Bacteria Indigenous to a Unique Cave Environment Kartchner Caverns, AZ, USA
11:09

Use of MALDI-TOF Mass Spectrometry and a Custom Database to Characterize Bacteria Indigenous to a Unique Cave Environment Kartchner Caverns, AZ, USA

Published on: January 2, 2015

14.1K
A Concoction Pipeline for Generating Molecular Operational Taxonomic Units (MOTUs) Among Riparian and Aquatic Beetles
10:23

A Concoction Pipeline for Generating Molecular Operational Taxonomic Units (MOTUs) Among Riparian and Aquatic Beetles

Published on: July 11, 2025

221
Identifying Amino Acid Overproducers Using Rare-Codon-Rich Markers
10:41

Identifying Amino Acid Overproducers Using Rare-Codon-Rich Markers

Published on: June 24, 2019

8.5K

Area of Science:

  • Bioinformatics
  • Genomics
  • Computational Biology

Background:

  • Unique k-mers are short DNA sequences found at a single genomic location, useful for DNA fragment placement.
  • However, unique k-mers can be sensitive to single basepair changes, leading to ambiguous data.
  • Strongly unique k-mers offer greater robustness by excluding those with conflicting Hamming-distance-1 neighbors.

Purpose of the Study:

  • To develop efficient algorithms for identifying k-mers with Hamming-distance-1 neighbors within a given set.
  • To distinguish robust strongly unique k-mers from less reliable unique k-mers.
  • To enable faster and more accurate analysis of large-scale genomic data.

Main Methods:

  • Developed two engineered algorithms: one using recursive 4-way comparison on sorted intervals, and another using bucketing with bit-parallel Hamming distance tests.
  • Both algorithms process canonical k-mers, considering reverse complements.
  • Algorithms are designed for efficient parallelization.

Main Results:

  • Successfully identified and marked k-mers with Hamming-distance-1 neighbors in large datasets.
  • An optimized combined approach processed 2.5 billion distinct 31-mers from the human T2T genome in under 20 seconds using 16 threads.
  • Demonstrated practical application and efficiency on billions of k-mers.

Conclusions:

  • The presented algorithms efficiently identify k-mers with conflicting neighbors, crucial for defining strongly unique k-mers.
  • These methods significantly improve the robustness and reliability of k-mer-based genomic analysis.
  • The implementation provides a valuable tool for large-scale genomics research, as demonstrated by the human T2T genome analysis.