Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Evolutionary Relationships through Genome Comparisons02:54

Evolutionary Relationships through Genome Comparisons

6.3K
Genome comparison is one of the excellent ways to interpret the evolutionary relationships between organisms. The basic principle of genome comparison is that if two species share a common feature, it is likely encoded by the DNA sequence conserved between both species. The advent of genome sequencing technologies in the late 20th century enabled scientists to understand the concept of conservation of domains between species and helped them to deduce evolutionary relationships across diverse...
6.3K
Modern Molecular Taxonomy01:29

Modern Molecular Taxonomy

206
Advancements in molecular biology have revolutionized the identification and characterization of bacteria, with multiple methods leveraging DNA sequencing for enhanced precision. As sequencing technologies improve and costs decline, these approaches are increasingly used in clinical, environmental, and evolutionary studies.Multilocus Sequence Typing (MLST) examines several housekeeping genes, essential chromosomal genes encoding cellular functions, to distinguish strains. Approximately...
206
RNA-seq03:21

RNA-seq

10.5K
RNA sequencing, or RNA-Seq, is a high-throughput sequencing technology used to study the transcriptome of a cell. Transcriptomics helps to interpret the functional elements of a genome and identify the molecular constituents of an organism. Additionally, it also helps in understanding the development of an organism and the occurrence of diseases. 
Before the discovery of RNA-seq, microarray-based methods and Sanger sequencing were used for transcriptome analysis. However, while...
10.5K
Multi-species Conserved Sequences02:51

Multi-species Conserved Sequences

4.3K
Next-generation sequencing technologies have created large genomic databases of a variety of animals and plants. Ever since the human genome project was completed, scientists studied the genome of primates, mammals, and other phylogenetically distant living beings. Such large-scale  studies have provided new insights into the evolutionary relationship between organisms.
Although the genome of each species varies greatly from each other, a few sequences are highly conserved. Such conserved...
4.3K
Applications of Molecular Taxonomy01:20

Applications of Molecular Taxonomy

159
Molecular taxonomy has revolutionized the understanding and classification of bacteria, providing precise insights into their diversity, evolutionary relationships, and ecological roles. By utilizing molecular techniques such as DNA sequencing and fingerprinting, researchers have made significant strides in various fields related to bacterial studies.Resolving Taxonomic AmbiguitiesMolecular taxonomy has been instrumental in distinguishing closely related bacterial species initially thought to...
159
Maxam-Gilbert Sequencing01:05

Maxam-Gilbert Sequencing

11.6K
In the same year as the discovery of the Sanger sequencing method, another group of scientists, Allan Maxam and Walter Gilbert, demonstrated their chemical-cleavage method for DNA sequencing. The Maxam-Gilbert method relies on using different chemicals that can cleave the DNA sequence at specific sites, the separation of resulting DNA fragments of variable size using electrophoresis, and deciphering the DNA sequence from the resulting gel bands.
Challenges of the Maxam-Gilbert Method
The...
11.6K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Timely bespoke phage-antibiotic combination to treat refractory Pseudomonas aeruginosa mediastinitis and vascular graft infection.

Nature communications·2026
Same author

Transmission dynamics of Escherichia coli sequence type 131 in households-a one health prospective cohort study.

Nature communications·2025
Same author

Characterization of mobile resistance elements in extended-spectrum β-lactamase producing gram-negative bacteria from aquatic environment.

The Science of the total environment·2025
Same author

Advancing pathogen genomics in resource-limited settings.

Cell genomics·2023
Same author

Evolution and transmission of antibiotic resistance is driven by Beijing lineage <i>Mycobacterium tuberculosis</i> in Vietnam.

Microbiology spectrum·2023
Same author

Whole genome sequencing of multidrug resistant Enterobacterales identified in children and their household members within Siem Reap, Cambodia.

JAC-antimicrobial resistance·2023
Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026
Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026
Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026
Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026
Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026
Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026
See all related articles

Related Experiment Video

Updated: Sep 28, 2025

A Concoction Pipeline for Generating Molecular Operational Taxonomic Units (MOTUs) Among Riparian and Aquatic Beetles
10:23

A Concoction Pipeline for Generating Molecular Operational Taxonomic Units (MOTUs) Among Riparian and Aquatic Beetles

Published on: July 11, 2025

251

Clustering biological sequences with dynamic sequence similarity threshold.

Jimmy Ka Ho Chiu1, Rick Twee-Hee Ong2

  • 1Saw Swee Hock School of Public Health, National University of Singapore and National University Health System, Singapore, 117549, Singapore.

BMC Bioinformatics
|March 31, 2022
PubMed
Summary
This summary is machine-generated.

ALFATClust offers a novel approach to biological sequence clustering, improving accuracy by dynamically adjusting similarity thresholds for each cluster. This method enhances cluster quality without requiring manual parameter tuning.

Keywords:
Graph clusteringHomologous sequencesMetagenomicsSequence clustering

More Related Videos

Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin
08:57

Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin

Published on: August 14, 2018

16.1K
A Practical Guide to Phylogenetics for Nonexperts
12:00

A Practical Guide to Phylogenetics for Nonexperts

Published on: February 5, 2014

35.5K

Related Experiment Videos

Last Updated: Sep 28, 2025

A Concoction Pipeline for Generating Molecular Operational Taxonomic Units (MOTUs) Among Riparian and Aquatic Beetles
10:23

A Concoction Pipeline for Generating Molecular Operational Taxonomic Units (MOTUs) Among Riparian and Aquatic Beetles

Published on: July 11, 2025

251
Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin
08:57

Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin

Published on: August 14, 2018

16.1K
A Practical Guide to Phylogenetics for Nonexperts
12:00

A Practical Guide to Phylogenetics for Nonexperts

Published on: February 5, 2014

35.5K

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • Biological sequence clustering faces challenges due to high computational costs of sequence alignments and difficulties in parameter selection.
  • Existing methods often rely on a single, fixed sequence identity threshold, potentially leading to low-quality clusters.
  • Limited user support exists for selecting appropriate thresholds for diverse biological sequence datasets.

Purpose of the Study:

  • To develop a novel, efficient, and robust biological sequence clustering approach.
  • To overcome limitations of fixed-threshold methods in sequence clustering.
  • To provide automated threshold determination for improved cluster quality.

Main Methods:

  • Implemented ALFATClust, a novel sequence clustering approach.
  • Utilized rapid, alignment-free pairwise sequence distance calculations.
  • Employed community detection in graphs for cluster generation.
  • Dynamically determined cluster-specific cut-off thresholds based on cluster separation and intra-cluster similarity.

Main Results:

  • ALFATClust outperforms existing sequence clustering approaches on benchmark datasets.
  • The method simultaneously maintains cluster robustness and substantial cluster separation.
  • Dynamically determined thresholds enhance the quality of generated clusters.
  • An evaluation report is provided for verifying non-singleton cluster quality.

Conclusions:

  • ALFATClust effectively generates high-quality sequence clusters with strong intra-cluster similarity and clear inter-cluster separation.
  • The approach eliminates the need for users to manually specify precise similarity cut-off thresholds.
  • ALFATClust offers an automated and robust solution for biological sequence clustering.