Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Cluster Sampling Method01:20

Cluster Sampling Method

11.5K
Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...
11.5K
Model Approaches for Pharmacokinetic Data: Distributed Parameter Models01:06

Model Approaches for Pharmacokinetic Data: Distributed Parameter Models

47
Pharmacokinetic models are mathematical constructs that represent and predict the time course of drug concentrations in the body, providing meaningful pharmacokinetic parameters. These models are categorized into compartment, physiological, and distributed parameter models.
The distributed parameter models are specifically designed to account for variations and differences in some drug classes. This model is particularly useful for assessing regional concentrations of anticancer or...
47
Multicompartment Models: Overview01:14

Multicompartment Models: Overview

65
Multicompartment models are mathematical constructs that depict how drugs are distributed and eliminated within the body. They segment the body into several compartments, symbolizing various physiological or anatomical areas connected through drug transfer processes such as absorption, metabolism, distribution, and elimination.
These models offer a more comprehensive representation of drug behavior in the body than one-compartment models. They accommodate the complexity of drug distribution,...
65
One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for ka Estimation01:24

One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for ka Estimation

233
This lesson introduces two critical methods in pharmacokinetics, the Wagner-Nelson and Loo-Riegelman methods, used for estimating the absorption rate constant (ka) for drugs administered via non-intravenous routes. The Wagner-Nelson method relates ka to the plasma concentration derived from the slope of a semilog percent unabsorbed time plot. However, it is limited to drugs with one-compartment kinetics and can be impacted by factors like gastrointestinal motility or enzymatic degradation.
On...
233
Collisions in Multiple Dimensions: Problem Solving01:06

Collisions in Multiple Dimensions: Problem Solving

3.5K
In multiple dimensions, the conservation of momentum applies in each direction independently. Hence, to solve collisions in multiple dimensions, we should write down the momentum conservation in each direction separately. To help understand collisions in multiple dimensions, consider an example.
A small car of mass 1,200 kg traveling east at 60 km/h collides at an intersection with a truck of mass 3,000 kg traveling due north at 40 km/h. The two vehicles are locked together. What is the...
3.5K
Collisions in Multiple Dimensions: Introduction01:05

Collisions in Multiple Dimensions: Introduction

4.4K
It is far more common for collisions to occur in two dimensions; that is, the initial velocity vectors are neither parallel nor antiparallel to each other. Let's see what complications arise from this. The first idea is that momentum is a vector. Like all vectors, it can be expressed as a sum of perpendicular components (usually, though not always, an x-component and a y-component, and a z-component if necessary). Thus, when the statement of conservation of momentum is written for a...
4.4K
  1. Home
  2. Research Domains
  3. Mathematical Sciences
  4. Statistics
  5. Large And Complex Data Theory
  6. Escaping The Curse Of Dimensionality In Bayesian Model-based Clustering.
  1. Home
  2. Research Domains
  3. Mathematical Sciences
  4. Statistics
  5. Large And Complex Data Theory
  6. Escaping The Curse Of Dimensionality In Bayesian Model-based Clustering.

Related Experiment Video

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments
08:12

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

2.4K

Escaping The Curse of Dimensionality in Bayesian Model-Based Clustering.

Noirrit Kiran Chandra1, Antonio Canale2, David B Dunson3

  • 1Department of Mathematical Sciences The University of Texas at Dallas Richardson, TX, USA.

Journal of Machine Learning Research : JMLR
|April 16, 2025

View abstract on PubMed

Summary
This summary is machine-generated.

Bayesian mixture models struggle with high-dimensional data clustering. This study explains why and introduces Latent Mixtures for Bayesian Clustering (Lamb) to overcome these challenges in dimensionality.

Keywords:
Big dataClusteringDirichlet processExchangeable partition probability function

More Related Videos

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations
12:27

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

6.9K
ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data
05:12

ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data

Published on: January 16, 2019

11.2K

Related Experiment Videos

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments
08:12

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

2.4K
Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations
12:27

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

6.9K
ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data
05:12

ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data

Published on: January 16, 2019

11.2K

Area of Science:

  • Statistics
  • Machine Learning
  • Computational Biology

Background:

  • Bayesian mixture models are standard for clustering high-dimensional data.
  • High dimensionality can lead to incorrect cluster counts in posterior inference.

Purpose of the Study:

  • To explain the tendency of Bayesian clustering to produce too many or too few clusters in high dimensions.
  • To propose a novel Bayesian clustering method that addresses high-dimensionality issues.

Main Methods:

  • Analysis of the random partition posterior in a fixed-sample, increasing-dimension setting.
  • Development of Latent Mixtures for Bayesian Clustering (Lamb) using low-dimensional latent variables.
  • Scalable posterior inference techniques for the proposed model.
High dimensional
Latent variables
Mixture model

Main Results:

  • Identified conditions for posterior inference favoring extreme clustering (all separate or all together) as dimension increases.
  • Demonstrated that these conditions are prior-choice independent.
  • Showcased Lamb's ability to avoid high-dimensionality pitfalls under mild assumptions.

Conclusions:

  • The proposed Latent Mixtures for Bayesian Clustering (Lamb) offers a robust solution for high-dimensional data.
  • Lamb demonstrates strong performance in simulations and real-world applications like single-cell RNA sequencing data analysis.