Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

What are Estimates?01:06

What are Estimates?

8.8K
It isn't easy to measure a parameter such as the mean height or the mean weight of a population. So, we draw samples from the population and calculate the mean height or mean weight of the individuals in the sample. This sample data acts as a representative measure of the population parameter. These sample statistics are known as estimates. 
The estimate for the mean of a sample is denoted by ͞x, whereas the mean of the population is designated as μ. Further, parameters such...
8.8K
Variability: Analysis01:11

Variability: Analysis

519
Measures of variability are statistical metrics that reveal the dispersion pattern within a dataset. They are pivotal in biostatistics, providing insights into the heterogeneity within health and biological data. Variability signifies the degree to which data points diverge from one another, helping researchers understand the potential range of values and associated uncertainty within the data.
The range is a simple measure of variability, indicating the difference between the highest and...
519
Random Variables01:09

Random Variables

17.9K
A random variable is a single numerical value that indicates the outcome of a procedure. The concept of random variables is fundamental to the probability theory and was introduced by a Russian mathematician, Pafnuty Chebyshev, in the mid-nineteenth century.
Uppercase letters such as X or Y denote a random variable. Lowercase letters like x or y denote the value of a random variable. If X is a random variable, then X is written in words, and x is given as a number.
For example, let X = the...
17.9K
Graphs of Equations in Two Variables01:30

Graphs of Equations in Two Variables

249
An equation with two variables, typically written in the form y = f(x) or Ax + By = C, describes a relationship between quantities represented by x and y. Each solution to such an equation is an ordered pair (x, y) that satisfies the equation when substituted. These pairs can be represented graphically to understand the variables' relationship visually.A common technique for constructing the graph of a two-variable equation is to create a value table. Begin by choosing several values for the...
249
Variables Affecting Phosphorescence and Fluorescence01:26

Variables Affecting Phosphorescence and Fluorescence

1.5K
Fluorescence and phosphorescence are essential phenomena in fields like analytical chemistry, biological imaging, and materials science, where they detect molecular properties and visualize cellular structures. Understanding the variables that influence these luminescent behaviors is crucial for maximizing accuracy and efficiency in their applications. These variables can broadly be grouped into chemical structure, solvent properties, and external conditions, each playing a distinct role in...
1.5K
Cluster Sampling Method01:20

Cluster Sampling Method

14.7K
Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...
14.7K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

A stochastic block prior for clustering in graphical models.

Psychological methods·2026
Same author

Bayesian clustering of spatially distributed compositional data with application to the Great Barrier Reef.

Scientific reports·2024
Same author

Assessing epidemic curves for evidence of superspreading.

Journal of the Royal Statistical Society. Series A, (Statistics in Society)·2023
Same author

Calibrating COVID-19 susceptible-exposed-infected-removed models with time-varying effectivecontact rates.

Philosophical transactions. Series A, Mathematical, physical, and engineering sciences·2021
Same author

A Model-Based Approach to Assess Epidemic Risk.

Statistics in biosciences·2021
Same author

Adaptive Incremental Mixture Markov Chain Monte Carlo.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2020
Same journal

Neural posterior estimation on exponential random graph models: evaluating bias and implementation challenges.

Statistics and computing·2026
Same journal

Subgroup Analysis of Differential Networks with Latent Variables.

Statistics and computing·2026
Same journal

Non-negative matrix factorization algorithms generally improve topic model fits.

Statistics and computing·2026
Same journal

Approximating evidence via bounded harmonic means.

Statistics and computing·2026
Same journal

Efficient Inference in First Passage Time Models.

Statistics and computing·2026
Same journal

Optimal <i>F</i>-score Matching for Bipartite Record Linkage.

Statistics and computing·2026
See all related articles

Related Experiment Video

Updated: Feb 5, 2026

Optimization of the Retinal Vein Occlusion Mouse Model to Limit Variability
07:23

Optimization of the Retinal Vein Occlusion Mouse Model to Limit Variability

Published on: August 6, 2021

3.2K

Optimal Bayesian estimators for latent variable cluster models.

Riccardo Rastelli1, Nial Friel2,3

  • 11Institute for Statistics and Mathematics, WU Vienna University of Economics and Business, Vienna, Austria.

Statistics and Computing
|September 18, 2018
PubMed
Summary
This summary is machine-generated.

This study introduces a novel Bayesian approach for cluster analysis, offering a fast algorithm to identify optimal group partitions and automatically determine the number of clusters. This method enhances the interpretation of complex clustering models.

Keywords:
Bayesian clusteringCluster analysisGreedy optimisationLatent variable modelsMarkov chain Monte Carlo

More Related Videos

A Modeling and Simulation Method for Preliminary Design of an Electro-Variable Displacement Pump
09:04

A Modeling and Simulation Method for Preliminary Design of an Electro-Variable Displacement Pump

Published on: June 1, 2022

3.7K
A Novel Bayesian Change-point Algorithm for Genome-wide Analysis of Diverse ChIPseq Data Types
12:39

A Novel Bayesian Change-point Algorithm for Genome-wide Analysis of Diverse ChIPseq Data Types

Published on: December 10, 2012

11.7K

Related Experiment Videos

Last Updated: Feb 5, 2026

Optimization of the Retinal Vein Occlusion Mouse Model to Limit Variability
07:23

Optimization of the Retinal Vein Occlusion Mouse Model to Limit Variability

Published on: August 6, 2021

3.2K
A Modeling and Simulation Method for Preliminary Design of an Electro-Variable Displacement Pump
09:04

A Modeling and Simulation Method for Preliminary Design of an Electro-Variable Displacement Pump

Published on: June 1, 2022

3.7K
A Novel Bayesian Change-point Algorithm for Genome-wide Analysis of Diverse ChIPseq Data Types
12:39

A Novel Bayesian Change-point Algorithm for Genome-wide Analysis of Diverse ChIPseq Data Types

Published on: December 10, 2012

11.7K

Area of Science:

  • Statistics
  • Machine Learning
  • Data Mining

Background:

  • Cluster analysis aims to group similar individuals or items.
  • Bayesian methods offer probabilistic clustering but lack scalable interpretation tools for latent allocation variables.
  • Existing methods struggle with categorical clustering variables and determining the optimal number of groups.

Purpose of the Study:

  • To develop a scalable Bayesian decision-theoretic framework for cluster analysis.
  • To propose a fast, context-independent greedy algorithm for optimal cluster allocation.
  • To simultaneously solve clustering and model-choice problems by automatically selecting the optimal number of groups.

Main Methods:

  • Utilized a Bayesian decision-theoretic approach to define an optimality criterion for clusterings.
  • Developed a fast and context-independent greedy algorithm for finding optimal allocations.
  • Incorporated various loss functions to compare and evaluate different partitions.
  • Applied the approach to Gaussian mixtures, stochastic block models, and latent block models.

Main Results:

  • The proposed greedy algorithm efficiently finds optimal cluster allocations.
  • The method automatically selects the optimal number of groups, addressing model-choice uncertainty.
  • Demonstrated effectiveness across diverse clustering models and datasets (artificial and real).

Conclusions:

  • The developed Bayesian framework provides a robust and scalable solution for interpreting clustering results.
  • The automatic selection of the number of clusters simplifies model selection in cluster analysis.
  • This approach offers a versatile tool for various applications requiring probabilistic partitioning.