Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Cluster Sampling Method

Cluster Sampling Method

Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...

Survival Tree

Survival Tree

Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
Building a Survival Tree
Constructing a...

Aggregates Classification

Aggregates Classification

Aggregate classification is generally based on its size, petrographic characteristics, weight, and source. Size classification ranges from coarse to fine aggregates, defined by the size of the particles. Coarse aggregates are particles that do not pass through ASTM sieve No. 4, and aggregates that pass through the sieve are fine aggregates.
Petrographic classification groups aggregates based on common mineralogical characteristics. Some of the common mineral groups found in aggregates are...

Sampling Plans

Sampling Plans

Sampling is a crucial step in analytical chemistry, allowing researchers to collect representative data from a large population. Common sampling methods include random, judgmental, systematic, stratified, and cluster sampling.
Random sampling is a method where each member of the population has an equal chance of being selected for the sample. It involves selecting individuals randomly, often using random number generators or lottery-type methods. For example, when analyzing the properties of a...

Causes of Similarity-Dissimilarity Effect

Causes of Similarity-Dissimilarity Effect

The similarity-dissimilarity effect, a fundamental concept in social psychology, explains how interpersonal similarities and differences influence attraction and social interactions. This effect is supported by three key psychological perspectives: balance theory, social comparison theory, and consensual validation.Balance Theory and Cognitive ConsistencyBalance theory, developed by Fritz Heider, posits that individuals seek cognitive consistency in their relationships. When two people share...

Quantifying and Rejecting Outliers: The Grubbs Test

Quantifying and Rejecting Outliers: The Grubbs Test

Sometimes, a data set can have a recorded numerical observation that greatly deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier. To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Self-organizing neural network-based generative AI with embedded error inflation control enhances effective knowledge extraction from preclinical studies with reduced sample size.

Pharmacological research·2026

Same author

A model-agnostic framework for dataset-specific selection of missing value imputation methods in pain-related numerical data.

Canadian journal of pain = Revue canadienne de la douleur·2026

Same author

Sleep and Aging. A Polysomnographic Follow-Up Study, Some 40 Years Later.

Journal of sleep research·2025

Same author

Augmenting small biomedical datasets using generative AI methods based on self-organizing neural networks.

Briefings in bioinformatics·2024

Same author

Revisiting Fold-Change Calculation: Preference for Median or Geometric Mean over Arithmetic Mean-Based Methods.

Biomedicines·2024

Same author

Development of an explainable AI system using routine clinical parameters for rapid differentiation of inflammatory conditions.

Frontiers in immunology·2024

Same journal

A harmonized fast-fashion garment-variant dataset for textile circularity and sustainability assessment.

Data in brief·2026

Same journal

Terahertz reflectivity dataset: Reading text on both sides of the page.

Data in brief·2026

Same journal

High-quality draft genome sequence data of <i>Levilactobacillus brevis</i> 3LB isolated from fermented milk koumiss.

Data in brief·2026

Same journal

Interview dataset: Encouraging the development of industrial symbiosis networks in Slovenia - transition to the circular economy.

Data in brief·2026

Same journal

Timeseries of multispectral and radar data and vegetation indices from Sentinel-1, Sentinel-2 and Landsat-8 at field scale.

Data in brief·2026

Same journal

BACI-VI-Bench: A dataset of variational inequality benchmark instances for multi-agent trade-network equilibrium.

Data in brief·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Dec 22, 2025

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

Clustering benchmark datasets exploiting the fundamental clustering problems.

Michael C Thrun^1,2, Alfred Ultsch¹

¹Databionics Research Group, Philipps-University of Marburg, Hans-Meerwein-Straße 6, D-35032 Marburg, Germany.

|May 7, 2020

Summary

This summary is machine-generated.

The Fundamental Clustering Problems Suite (FCPS) provides datasets for evaluating clustering algorithms. It helps identify shortcomings in algorithms and dimensionality reduction for complex, high-dimensional data.

Keywords:

Cluster analysis Dimensionality reduction Pattern recognition Projection methods

More Related Videos

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data

ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data

Published on: January 16, 2019

Related Experiment Videos

Last Updated: Dec 22, 2025

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data

ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data

Published on: January 16, 2019

Area of Science:

Data Science
Machine Learning
Computer Science

Background:

Clustering algorithms are essential for data analysis, but their performance varies across different data structures.
Existing benchmarks may not fully capture the complexities of real-world, high-dimensional datasets.
The Fundamental Clustering Problems Suite (FCPS) was developed to address these limitations.

Purpose of the Study:

To introduce the Fundamental Clustering Problems Suite (FCPS) as a comprehensive benchmark for clustering algorithms.
To provide datasets designed to challenge and evaluate the capabilities of clustering and dimensionality reduction methods.
To facilitate the investigation of algorithm shortcomings, particularly in higher dimensions.

Main Methods:

The FCPS comprises datasets with known classifications, designed for visualization in 2D or 3D.
Datasets are intentionally crafted to represent specific clustering challenges.
Includes user-defined sample sizes via an R package and distance matrices for high-dimensional datasets (Leukemia, Tetragonula).

Main Results:

The FCPS datasets highlight varying success rates of known clustering algorithms.
Demonstrates the utility of FCPS in revealing limitations of dimensionality reduction techniques for datasets beyond 3D.
Provides a standardized suite for comparative analysis of clustering algorithm performance.

Conclusions:

The FCPS is a valuable resource for assessing the robustness and limitations of clustering algorithms.
It serves as a critical tool for advancing the development of more effective clustering and dimensionality reduction methods.
The suite is particularly relevant for tackling challenges posed by high-dimensional and complex data structures.