Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Cluster Sampling Method01:20

Cluster Sampling Method

13.8K
Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...
13.8K
Survival Tree01:19

Survival Tree

333
Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
 Building a Survival Tree
Constructing a...
333
Aggregates Classification01:29

Aggregates Classification

887
Aggregate classification is generally based on its size, petrographic characteristics, weight, and source. Size classification ranges from coarse to fine aggregates, defined by the size of the particles. Coarse aggregates are particles that do not pass through ASTM sieve No. 4, and aggregates that pass through the sieve are fine aggregates.
Petrographic classification groups aggregates based on common mineralogical characteristics. Some of the common mineral groups found in aggregates are...
887
Sampling Plans01:23

Sampling Plans

808
Sampling is a crucial step in analytical chemistry, allowing researchers to collect representative data from a large population. Common sampling methods include random, judgmental, systematic, stratified, and cluster sampling.
Random sampling is a method where each member of the population has an equal chance of being selected for the sample. It involves selecting individuals randomly, often using random number generators or lottery-type methods. For example, when analyzing the properties of a...
808
Causes of Similarity-Dissimilarity Effect01:26

Causes of Similarity-Dissimilarity Effect

190
The similarity-dissimilarity effect, a fundamental concept in social psychology, explains how interpersonal similarities and differences influence attraction and social interactions. This effect is supported by three key psychological perspectives: balance theory, social comparison theory, and consensual validation.Balance Theory and Cognitive ConsistencyBalance theory, developed by Fritz Heider, posits that individuals seek cognitive consistency in their relationships. When two people share...
190
Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

3.4K
Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...
3.4K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Self-organizing neural network-based generative AI with embedded error inflation control enhances effective knowledge extraction from preclinical studies with reduced sample size.

Pharmacological research·2026
Same author

A model-agnostic framework for dataset-specific selection of missing value imputation methods in pain-related numerical data.

Canadian journal of pain = Revue canadienne de la douleur·2026
Same author

Sleep and Aging. A Polysomnographic Follow-Up Study, Some 40 Years Later.

Journal of sleep research·2025
Same author

Augmenting small biomedical datasets using generative AI methods based on self-organizing neural networks.

Briefings in bioinformatics·2024
Same author

Revisiting Fold-Change Calculation: Preference for Median or Geometric Mean over Arithmetic Mean-Based Methods.

Biomedicines·2024
Same author

Development of an explainable AI system using routine clinical parameters for rapid differentiation of inflammatory conditions.

Frontiers in immunology·2024
Same journal

A harmonized fast-fashion garment-variant dataset for textile circularity and sustainability assessment.

Data in brief·2026
Same journal

Terahertz reflectivity dataset: Reading text on both sides of the page.

Data in brief·2026
Same journal

High-quality draft genome sequence data of <i>Levilactobacillus brevis</i> 3LB isolated from fermented milk koumiss.

Data in brief·2026
Same journal

Interview dataset: Encouraging the development of industrial symbiosis networks in Slovenia - transition to the circular economy.

Data in brief·2026
Same journal

Timeseries of multispectral and radar data and vegetation indices from Sentinel-1, Sentinel-2 and Landsat-8 at field scale.

Data in brief·2026
Same journal

BACI-VI-Bench: A dataset of variational inequality benchmark instances for multi-agent trade-network equilibrium.

Data in brief·2026
See all related articles

Related Experiment Video

Updated: Dec 22, 2025

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations
12:27

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

7.2K

Clustering benchmark datasets exploiting the fundamental clustering problems.

Michael C Thrun1,2, Alfred Ultsch1

  • 1Databionics Research Group, Philipps-University of Marburg, Hans-Meerwein-Straße 6, D-35032 Marburg, Germany.

Data in Brief
|May 7, 2020
PubMed
Summary
This summary is machine-generated.

The Fundamental Clustering Problems Suite (FCPS) provides datasets for evaluating clustering algorithms. It helps identify shortcomings in algorithms and dimensionality reduction for complex, high-dimensional data.

Keywords:
Cluster analysisDimensionality reductionPattern recognitionProjection methods

More Related Videos

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments
08:12

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

2.8K
ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data
05:12

ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data

Published on: January 16, 2019

11.8K

Related Experiment Videos

Last Updated: Dec 22, 2025

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations
12:27

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

7.2K
A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments
08:12

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

2.8K
ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data
05:12

ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data

Published on: January 16, 2019

11.8K

Area of Science:

  • Data Science
  • Machine Learning
  • Computer Science

Background:

  • Clustering algorithms are essential for data analysis, but their performance varies across different data structures.
  • Existing benchmarks may not fully capture the complexities of real-world, high-dimensional datasets.
  • The Fundamental Clustering Problems Suite (FCPS) was developed to address these limitations.

Purpose of the Study:

  • To introduce the Fundamental Clustering Problems Suite (FCPS) as a comprehensive benchmark for clustering algorithms.
  • To provide datasets designed to challenge and evaluate the capabilities of clustering and dimensionality reduction methods.
  • To facilitate the investigation of algorithm shortcomings, particularly in higher dimensions.

Main Methods:

  • The FCPS comprises datasets with known classifications, designed for visualization in 2D or 3D.
  • Datasets are intentionally crafted to represent specific clustering challenges.
  • Includes user-defined sample sizes via an R package and distance matrices for high-dimensional datasets (Leukemia, Tetragonula).

Main Results:

  • The FCPS datasets highlight varying success rates of known clustering algorithms.
  • Demonstrates the utility of FCPS in revealing limitations of dimensionality reduction techniques for datasets beyond 3D.
  • Provides a standardized suite for comparative analysis of clustering algorithm performance.

Conclusions:

  • The FCPS is a valuable resource for assessing the robustness and limitations of clustering algorithms.
  • It serves as a critical tool for advancing the development of more effective clustering and dimensionality reduction methods.
  • The suite is particularly relevant for tackling challenges posed by high-dimensional and complex data structures.