Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

What are Estimates?

What are Estimates?

It isn't easy to measure a parameter such as the mean height or the mean weight of a population. So, we draw samples from the population and calculate the mean height or mean weight of the individuals in the sample. This sample data acts as a representative measure of the population parameter. These sample statistics are known as estimates.
The estimate for the mean of a sample is denoted by ͞x, whereas the mean of the population is designated as μ. Further, parameters such...

Variability: Analysis

Variability: Analysis

Measures of variability are statistical metrics that reveal the dispersion pattern within a dataset. They are pivotal in biostatistics, providing insights into the heterogeneity within health and biological data. Variability signifies the degree to which data points diverge from one another, helping researchers understand the potential range of values and associated uncertainty within the data.
The range is a simple measure of variability, indicating the difference between the highest and...

Random Variables

Random Variables

A random variable is a single numerical value that indicates the outcome of a procedure. The concept of random variables is fundamental to the probability theory and was introduced by a Russian mathematician, Pafnuty Chebyshev, in the mid-nineteenth century.
Uppercase letters such as X or Y denote a random variable. Lowercase letters like x or y denote the value of a random variable. If X is a random variable, then X is written in words, and x is given as a number.
For example, let X = the...

Graphs of Equations in Two Variables

Graphs of Equations in Two Variables

An equation with two variables, typically written in the form y = f(x) or Ax + By = C, describes a relationship between quantities represented by x and y. Each solution to such an equation is an ordered pair (x, y) that satisfies the equation when substituted. These pairs can be represented graphically to understand the variables' relationship visually.A common technique for constructing the graph of a two-variable equation is to create a value table. Begin by choosing several values for the...

Variables Affecting Phosphorescence and Fluorescence

Variables Affecting Phosphorescence and Fluorescence

Fluorescence and phosphorescence are essential phenomena in fields like analytical chemistry, biological imaging, and materials science, where they detect molecular properties and visualize cellular structures. Understanding the variables that influence these luminescent behaviors is crucial for maximizing accuracy and efficiency in their applications. These variables can broadly be grouped into chemical structure, solvent properties, and external conditions, each playing a distinct role in...

Cluster Sampling Method

Cluster Sampling Method

Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

A stochastic block prior for clustering in graphical models.

Psychological methods·2026

Same author

Bayesian clustering of spatially distributed compositional data with application to the Great Barrier Reef.

Scientific reports·2024

Same author

Assessing epidemic curves for evidence of superspreading.

Journal of the Royal Statistical Society. Series A, (Statistics in Society)·2023

Same author

Calibrating COVID-19 susceptible-exposed-infected-removed models with time-varying effectivecontact rates.

Philosophical transactions. Series A, Mathematical, physical, and engineering sciences·2021

Same author

A Model-Based Approach to Assess Epidemic Risk.

Statistics in biosciences·2021

Same author

Adaptive Incremental Mixture Markov Chain Monte Carlo.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2020

Same journal

Neural posterior estimation on exponential random graph models: evaluating bias and implementation challenges.

Statistics and computing·2026

Same journal

Subgroup Analysis of Differential Networks with Latent Variables.

Statistics and computing·2026

Same journal

Non-negative matrix factorization algorithms generally improve topic model fits.

Statistics and computing·2026

Same journal

Approximating evidence via bounded harmonic means.

Statistics and computing·2026

Same journal

Efficient Inference in First Passage Time Models.

Statistics and computing·2026

Same journal

Optimal <i>F</i>-score Matching for Bipartite Record Linkage.

Statistics and computing·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Feb 5, 2026

Optimization of the Retinal Vein Occlusion Mouse Model to Limit Variability

Optimization of the Retinal Vein Occlusion Mouse Model to Limit Variability

Published on: August 6, 2021

Optimal Bayesian estimators for latent variable cluster models.

Riccardo Rastelli¹, Nial Friel^2,3

¹1Institute for Statistics and Mathematics, WU Vienna University of Economics and Business, Vienna, Austria.

Statistics and Computing

|September 18, 2018

Summary

This summary is machine-generated.

This study introduces a novel Bayesian approach for cluster analysis, offering a fast algorithm to identify optimal group partitions and automatically determine the number of clusters. This method enhances the interpretation of complex clustering models.

Keywords:

Bayesian clustering Cluster analysis Greedy optimisation Latent variable models Markov chain Monte Carlo

More Related Videos

A Modeling and Simulation Method for Preliminary Design of an Electro-Variable Displacement Pump

A Modeling and Simulation Method for Preliminary Design of an Electro-Variable Displacement Pump

Published on: June 1, 2022

A Novel Bayesian Change-point Algorithm for Genome-wide Analysis of Diverse ChIPseq Data Types

A Novel Bayesian Change-point Algorithm for Genome-wide Analysis of Diverse ChIPseq Data Types

Published on: December 10, 2012

Related Experiment Videos

Last Updated: Feb 5, 2026

Optimization of the Retinal Vein Occlusion Mouse Model to Limit Variability

Optimization of the Retinal Vein Occlusion Mouse Model to Limit Variability

Published on: August 6, 2021

A Modeling and Simulation Method for Preliminary Design of an Electro-Variable Displacement Pump

A Modeling and Simulation Method for Preliminary Design of an Electro-Variable Displacement Pump

Published on: June 1, 2022

A Novel Bayesian Change-point Algorithm for Genome-wide Analysis of Diverse ChIPseq Data Types

A Novel Bayesian Change-point Algorithm for Genome-wide Analysis of Diverse ChIPseq Data Types

Published on: December 10, 2012

Area of Science:

Statistics
Machine Learning
Data Mining

Background:

Cluster analysis aims to group similar individuals or items.
Bayesian methods offer probabilistic clustering but lack scalable interpretation tools for latent allocation variables.
Existing methods struggle with categorical clustering variables and determining the optimal number of groups.

Purpose of the Study:

To develop a scalable Bayesian decision-theoretic framework for cluster analysis.
To propose a fast, context-independent greedy algorithm for optimal cluster allocation.
To simultaneously solve clustering and model-choice problems by automatically selecting the optimal number of groups.

Main Methods:

Utilized a Bayesian decision-theoretic approach to define an optimality criterion for clusterings.
Developed a fast and context-independent greedy algorithm for finding optimal allocations.
Incorporated various loss functions to compare and evaluate different partitions.
Applied the approach to Gaussian mixtures, stochastic block models, and latent block models.

Main Results:

The proposed greedy algorithm efficiently finds optimal cluster allocations.
The method automatically selects the optimal number of groups, addressing model-choice uncertainty.
Demonstrated effectiveness across diverse clustering models and datasets (artificial and real).

Conclusions:

The developed Bayesian framework provides a robust and scalable solution for interpreting clustering results.
The automatic selection of the number of clusters simplifies model selection in cluster analysis.
This approach offers a versatile tool for various applications requiring probabilistic partitioning.