Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Data clustering in life sciences.

Ying Zhao1, George Karypis

  • 1Department of Computer Science and Engineering, University of Minnesota, Minneapolis, MN 55455, USA.

Molecular Biotechnology
|August 25, 2005
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

PropMolFlow: property-guided molecule generation with geometry-complete flow matching.

Nature computational science·2026
Same author

TraverseNet: Unifying Space and Time in Message Passing for Traffic Forecasting.

IEEE transactions on neural networks and learning systems·2022
Same author

A knowledge graph of clinical trials ([Formula: see text]).

Scientific reports·2022
Same author

Benchmarking Accuracy and Generalizability of Four Graph Neural Networks Using Large In Vitro ADME Datasets from Different Chemical Spaces.

Molecular informatics·2022
Same author

DGL-LifeSci: An Open-Source Toolkit for Deep Learning on Graphs in Life Science.

ACS omega·2021
Same author

Anomaly Detection on Attributed Networks via Contrastive Self-Supervised Learning.

IEEE transactions on neural networks and learning systems·2021
Same journal

Cloning and Functional Characterization of AhyAP-T65Lig, an ATP-Dependent DNA Ligase from Trabzonvirus AP-T65.

Molecular biotechnology·2026
Same journal

Overexpression of the ATP-Citrate Lyase Gene Enhances Ganoderic Acid Biosynthesis in Ganoderma lingzhi.

Molecular biotechnology·2026
Same journal

CRISPR/Cas9 Mediated Genome Editing for Enhancing Abiotic Stress Tolerance in Rice: An Omics Guided Perspective.

Molecular biotechnology·2026
Same journal

Functionalization of Chitosan with Silver Oxide and Berberis aristata for Bioactive Composites with In Vivo Wound Healing Evaluation.

Molecular biotechnology·2026
Same journal

Resilience in Tribolium Castaneum Through Antioxidant Priming Under Phosphine-Induced Oxidative Stress.

Molecular biotechnology·2026
Same journal

MicroRNA-181a-5p promotes papillary thyroid carcinoma progress via the PTEN/AKT pathway.

Molecular biotechnology·2026
See all related articles

This article overviews clustering methods for large biological datasets, covering applications in genomics and proteomics. It introduces CLUTO, a toolkit for life science data analysis.

Area of Science:

  • Life Sciences
  • Bioinformatics
  • Computational Biology

Background:

  • Clustering is widely applied in life sciences for analyzing clinical information, phylogeny, genomics, and proteomics.
  • The increasing size of biological datasets presents unique challenges for clustering analysis.

Purpose of the Study:

  • To provide an overview of issues in clustering large biological datasets.
  • To describe common clustering approaches, their merits, and assumptions.
  • To offer insights for clustering diverse life science datasets.

Main Methods:

  • Review of established clustering algorithms.
  • Discussion of challenges in large-scale biological data analysis.
  • Introduction to the CLUTO toolkit and its life science applications.

Related Experiment Videos

Main Results:

  • Identification of key considerations for effective biological data clustering.
  • Evaluation of different clustering techniques for specific life science domains.
  • Demonstration of CLUTO's utility in analyzing complex biological data.

Conclusions:

  • Effective clustering of large biological datasets requires careful consideration of methodology.
  • CLUTO offers a versatile solution for various life science data analysis needs.
  • This overview aids researchers in selecting and applying appropriate clustering strategies.