Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Prediction Intervals

Prediction Intervals

The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y.
The...

Calibration Curves: Correlation Coefficient

Calibration Curves: Correlation Coefficient

In a linear calibration curve, there is a value called the calibration coefficient, denoted by 'r,' which measures the strength and the direction of association between two variables. The correlation coefficient value ranges from −1 to +1. A value of +1 indicates a perfect positive linear correlation, −1 denotes a perfect negative correlation, and 0 implies no correlation between the two variables. A positive correlation value establishes that as one variable increases, the other increases, and...

Cluster Sampling Method

Cluster Sampling Method

Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...

Calibration Curves: Linear Least Squares

Calibration Curves: Linear Least Squares

A calibration curve is a plot of the instrument's response against a series of known concentrations of a substance. This curve is used to set the instrument response levels, using the substance and its concentrations as standards. Alternatively, or additionally, an equation is fitted to the calibration curve plot and subsequently used to calculate the unknown concentrations of other samples reliably.
For data that follow a straight line, the standard method for fitting is the linear...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same journal

Invaders taking over-Mollusc faunal change in volcanic barrier lakes of the Albertine Rift biodiversity hotspot.

PloS one·2026

Same journal

AI-driven molecular diversification and ligand-based optimization of macitentan derivatives targeting VEGFR1 and endothelin signaling pathways.

PloS one·2026

Same journal

Performance patterns and records in the world aquatics masters championships: Where do the most frequently represented nations among the top-ten masters swimmers come from?

PloS one·2026

Same journal

Modeling diurnal Temperature-Rainfall relationships under multicollinearity using PLS-SEM: A case study of Ghana.

PloS one·2026

Same journal

Organizational culture, social capital, and emergency capacity in primary healthcare institutions: A cross-sectional structural equation modeling study comparing ordinary and older communities.

PloS one·2026

Same journal

Impact of kidney function on the metabolome in the general population.

PloS one·2026

See all related articles

Search research articles

Related Experiment Videos

PEC-CDC: A prediction error-based calibration framework for robust unsupervised deep clustering.

¹College of Art, Northeast Agricultural University, Heilongjiang, Harbin, China.

|June 25, 2026

Summary

This summary is machine-generated.

This study introduces PEC-CDC, a new deep clustering framework that uses regression instead of probabilities to overcome overconfidence. This novel approach significantly improves clustering accuracy and semantic boundary definition.

Related Experiment Videos

Area of Science:

Computer Science
Artificial Intelligence
Machine Learning

Background:

Deep clustering methods often suffer from overconfidence due to probability-based calibration.
Existing frameworks like Calibrated Deep Clustering (CDC) still rely on relative metrics, limiting performance on ambiguous samples.
Inter-class normalization in probability metrics can lead to inaccurate high confidence scores for poorly distinguished data points.

Purpose of the Study:

To address the overconfidence issue in deep clustering by proposing a novel calibration approach.
To enhance the robustness and accuracy of deep clustering by moving beyond probability-based metrics.
To introduce a new framework, PEC-CDC, that utilizes prediction error for more reliable cluster assignment.

Main Methods:

Reconfigured the Calibrated Deep Clustering (CDC) architecture by replacing the probability-based calibration head with a regression-based Prediction Error-based Classification (PEC) head.
Employed a frozen Teacher-Student module to shift model evaluation from relative confidence to absolute sample-cluster distribution consistency.
Utilized t-SNE and error distribution visualizations for qualitative analysis of semantic boundaries and cluster compactness.

Main Results:

The proposed PEC-CDC framework demonstrates superior semantic boundaries and compact cluster manifolds.
Achieved state-of-the-art performance in deep clustering tasks.
On the CIFAR-100 dataset, PEC-CDC attained an Adjusted Rand Index (ARI) of 45.61%, significantly outperforming leading baselines.

Conclusions:

Prediction error serves as an absolute measure resistant to overconfidence, outperforming relative probability metrics.
The PEC-CDC framework effectively mitigates overconfidence in deep clustering.
The novel regression-based approach offers a significant advancement in clustering accuracy and reliability.