Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Evolutionary Relationships through Genome Comparisons02:54

Evolutionary Relationships through Genome Comparisons

7.2K
Genome comparison is one of the excellent ways to interpret the evolutionary relationships between organisms. The basic principle of genome comparison is that if two species share a common feature, it is likely encoded by the DNA sequence conserved between both species. The advent of genome sequencing technologies in the late 20th century enabled scientists to understand the concept of conservation of domains between species and helped them to deduce evolutionary relationships across diverse...
7.2K
Applications of Molecular Taxonomy01:20

Applications of Molecular Taxonomy

650
Molecular taxonomy has revolutionized the understanding and classification of bacteria, providing precise insights into their diversity, evolutionary relationships, and ecological roles. By utilizing molecular techniques such as DNA sequencing and fingerprinting, researchers have made significant strides in various fields related to bacterial studies.Resolving Taxonomic AmbiguitiesMolecular taxonomy has been instrumental in distinguishing closely related bacterial species initially thought to...
650

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Evaluating splicing factor and kinase network crosstalk through global phosphoproteomics.

bioRxiv : the preprint server for biology·2026
Same author

Critical role of cell competition in gliomagenesis.

bioRxiv : the preprint server for biology·2026
Same author

Electron-Polaron-Enriched Fe-Doped Co<sub>3</sub>O<sub>4</sub> Nanoneedle Arrays Enabled by Tip Curvature and Vacancy Engineering for Efficient Oxygen Evolution.

Small (Weinheim an der Bergstrasse, Germany)·2025
Same author

Co<sub>3</sub>O<sub>4</sub>/C-NFs Induced 3D Electric Field Enhancement for Dual-Regulation of Polysulfides and Li<sup>+</sup> Transport in Lithium-Sulfur Batteries.

Advanced science (Weinheim, Baden-Wurttemberg, Germany)·2025
Same author

CoDIAC: A comprehensive approach for interaction analysis provides insights into SH2 domain function and regulation.

Science signaling·2025
Same author

Reconstructing the Electric Double Layer via Ni-HAB MOF-Derived Conductive Ligands: Modulating Debye Length and Reducing the Deprotonation Energy Barrier for the Oxygen Evolution Reaction.

Inorganic chemistry·2025
Same journal

Diacylglycerol kinase ζ in B lymphocytes supports CD40-mediated immune synapse formation, mTORC1 signaling, and plasma cell fate.

Science signaling·2026
Same journal

The APC/C adaptor Cdh1 stabilizes STING to potentiate innate immune activation in renal cell carcinoma.

Science signaling·2026
Same journal

Fattening mother's milk with oxytocin.

Science signaling·2026
Same journal

Virion display reveals MD-1 as an endogenous agonist for the orphan receptor GPRC5B.

Science signaling·2026
Same journal

Cyclin-dependent kinase CDK1 targets cell-cell junction components and governs epithelial monolayer integrity throughout the cell cycle.

Science signaling·2026
Same journal

Designer binders.

Science signaling·2026
See all related articles

Related Experiment Video

Updated: Mar 19, 2026

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations
08:03

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations

Published on: December 7, 2021

2.9K

Avoiding common pitfalls when clustering biological data.

Tom Ronan1, Zhijie Qi1, Kristen M Naegle2

  • 1Department of Biomedical Engineering, Center for Biological Systems Engineering, Washington University in St. Louis, St. Louis, MO 63130, USA.

Science Signaling
|June 16, 2016
PubMed
Summary
This summary is machine-generated.

This study reviews common pitfalls in applying clustering, an unsupervised learning method, to biological data. Awareness and ensemble clustering methods can improve the accuracy of biological inferences from complex datasets.

More Related Videos

Creating and Applying a Reference to Facilitate the Discussion and Classification of Proteins in a Diverse Group
07:49

Creating and Applying a Reference to Facilitate the Discussion and Classification of Proteins in a Diverse Group

Published on: August 16, 2017

7.5K
Visualization and Quantification of High-Dimensional Cytometry Data using Cytofast and the Upstream Clustering Methods FlowSOM and Cytosplore
06:01

Visualization and Quantification of High-Dimensional Cytometry Data using Cytofast and the Upstream Clustering Methods FlowSOM and Cytosplore

Published on: December 12, 2019

9.0K

Related Experiment Videos

Last Updated: Mar 19, 2026

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations
08:03

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations

Published on: December 7, 2021

2.9K
Creating and Applying a Reference to Facilitate the Discussion and Classification of Proteins in a Diverse Group
07:49

Creating and Applying a Reference to Facilitate the Discussion and Classification of Proteins in a Diverse Group

Published on: August 16, 2017

7.5K
Visualization and Quantification of High-Dimensional Cytometry Data using Cytofast and the Upstream Clustering Methods FlowSOM and Cytosplore
06:01

Visualization and Quantification of High-Dimensional Cytometry Data using Cytofast and the Upstream Clustering Methods FlowSOM and Cytosplore

Published on: December 12, 2019

9.0K

Area of Science:

  • Computational Biology
  • Bioinformatics
  • Data Science

Background:

  • Clustering is a key unsupervised learning technique for uncovering patterns in complex biological data.
  • High-throughput biological experiments generate multidimensional datasets requiring robust analytical methods.
  • Misapplication of clustering can lead to wasted research resources and flawed biological inferences.

Purpose of the Study:

  • To identify and review common pitfalls in the application of clustering to molecular biology data.
  • To provide practical solutions and methods for avoiding these pitfalls.
  • To enhance the reliability of biological insights derived from clustering analyses.

Main Methods:

  • Review of common clustering pitfalls from published molecular biology literature.
  • Presentation of toy problems and real biological data examples to illustrate issues.
  • Discussion of ensemble clustering as a method to improve robustness and explore multiple solutions.

Main Results:

  • Identified common pitfalls related to high-dimensional data, single method application, and result interpretation.
  • Demonstrated solutions using concrete examples for both simulated and real biological data.
  • Highlighted ensemble clustering as an effective strategy for robust analysis.

Conclusions:

  • Increased awareness of clustering pitfalls is crucial for accurate interpretation of biological data.
  • Employing multiple clustering methods and ensemble techniques can mitigate risks.
  • Proper application of clustering enhances the validity of biological inferences and experimental planning.