Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Aggregates Classification01:29

Aggregates Classification

666
Aggregate classification is generally based on its size, petrographic characteristics, weight, and source. Size classification ranges from coarse to fine aggregates, defined by the size of the particles. Coarse aggregates are particles that do not pass through ASTM sieve No. 4, and aggregates that pass through the sieve are fine aggregates.
Petrographic classification groups aggregates based on common mineralogical characteristics. Some of the common mineral groups found in aggregates are...
666
Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data01:16

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

348
Statistical inference techniques, paramount in hypothesis testing, differentiate into two broad categories: parametric and nonparametric statistics.
Parametric statistics, as the name suggests, assumes that data follow a specific distribution, often a normal distribution. This assumption enables robust hypothesis testing and estimation. Parametric methods, like the Student's t-test or Goodness-of-fit test, are frequently employed in biostatistics due to their robustness. For instance,...
348
Cluster Sampling Method01:20

Cluster Sampling Method

13.8K
Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...
13.8K
Model Approaches for Pharmacokinetic Data: Distributed Parameter Models01:06

Model Approaches for Pharmacokinetic Data: Distributed Parameter Models

190
Pharmacokinetic models are mathematical constructs that represent and predict the time course of drug concentrations in the body, providing meaningful pharmacokinetic parameters. These models are categorized into compartment, physiological, and distributed parameter models.
The distributed parameter models are specifically designed to account for variations and differences in some drug classes. This model is particularly useful for assessing regional concentrations of anticancer or...
190
Introduction to Nonparametric Statistics01:28

Introduction to Nonparametric Statistics

1.1K
Nonparametric statistics offer a powerful alternative to traditional parametric methods, useful when assumptions about the population distribution cannot be made. Unlike parametric tests, which require data to follow a specific distribution with well-defined parameters (such as the mean and standard deviation), nonparametric tests do not require such constraints. This makes them particularly valuable when dealing with small sample sizes, skewed data, or ordinal and categorical variables.
One of...
1.1K
Classification of Systems-I01:26

Classification of Systems-I

470
Linearity is a system property characterized by a direct input-output relationship, combining homogeneity and additivity.
Homogeneity dictates that if an input x(t) is multiplied by a constant c, the output y(t) is multiplied by the same constant. Mathematically, this is expressed as:
470

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Shorebird loss increases soil CO<sub>2</sub> emissions in coastal wetlands under restoration.

Fundamental research·2026
Same author

Evidence for an Electronically Driven Charge Density Wave in a 1D Metallic MOF.

ACS central science·2026
Same author

Redox-Active Doubly Boron-Doped Indenofluorenes: Isomer-Directed Access to Open- and Closed-Shell Electronic States Induces Strong Near-Infrared Activity.

Journal of the American Chemical Society·2026
Same author

Mammalian-like steroidogenesis in plants gives rise to endocrine-mimetic cardenolides.

Science advances·2026
Same author

Sterol trafficking in yeast studied by one- and two-photon live-cell imaging of an intrinsically fluorescent ergosterol analog.

Methods and applications in fluorescence·2026
Same author

The risk of psychosis associated with cannabis use by people with attention-deficit hyperactivity disorder: A systematic review.

Australasian psychiatry : bulletin of Royal Australian and New Zealand College of Psychiatrists·2026
Same journal

Probabilistic Joint and Individual Variation Explained (ProJIVE) for Data Integration.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2026
Same journal

fastkqr: A Fast Algorithm for Kernel Quantile Regression.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2026
Same journal

Empirical Bayes Covariance Decomposition, and a Solution to the Multiple Tuning Problem in Sparse PCA.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2026
Same journal

Joint Registration and Conformal Prediction for Partially Observed Functional Data.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2026
Same journal

Efficient Decision Trees for Tensor Regressions.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2026
Same journal

Distributed Nonparametric Regression with Heterogeneity Through Prediction-Based Aggregation.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2026
See all related articles

Related Experiment Video

Updated: Dec 7, 2025

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations
12:27

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

7.2K

Scalable Bayesian Nonparametric Clustering and Classification.

Yang Ni1,2, Peter Müller3, Maurice Diesendruck2

  • 1Department of Statistics, Texas A&M University.

Journal of Computational and Graphical Statistics : a Joint Publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America
|September 28, 2020
PubMed
Summary
This summary is machine-generated.

We present a scalable Monte Carlo algorithm for Bayesian nonparametric models, enabling efficient clustering and classification on large datasets. This parallelizable method enhances analysis of complex data, including electronic health records.

Keywords:
Electronic health recordsnon-conjugate modelsparallel computingproduct partition models

More Related Videos

A Novel Bayesian Change-point Algorithm for Genome-wide Analysis of Diverse ChIPseq Data Types
12:39

A Novel Bayesian Change-point Algorithm for Genome-wide Analysis of Diverse ChIPseq Data Types

Published on: December 10, 2012

11.6K
ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data
05:12

ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data

Published on: January 16, 2019

11.8K

Related Experiment Videos

Last Updated: Dec 7, 2025

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations
12:27

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

7.2K
A Novel Bayesian Change-point Algorithm for Genome-wide Analysis of Diverse ChIPseq Data Types
12:39

A Novel Bayesian Change-point Algorithm for Genome-wide Analysis of Diverse ChIPseq Data Types

Published on: December 10, 2012

11.6K
ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data
05:12

ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data

Published on: January 16, 2019

11.8K

Area of Science:

  • Computational Statistics
  • Machine Learning
  • Bayesian Inference

Background:

  • Bayesian nonparametric models are powerful for clustering and classification but often computationally intensive.
  • Scalable inference methods are crucial for applying these models to large, complex datasets.

Purpose of the Study:

  • To develop a scalable, multi-step Monte Carlo algorithm for inference in a broad class of Bayesian nonparametric models.
  • To demonstrate the algorithm's applicability to large datasets for clustering and classification tasks.

Main Methods:

  • A novel, embarrassingly parallel multi-step Monte Carlo algorithm.
  • Utilizes a single Markov chain Monte Carlo sampler across all steps.
  • Applied to a product partition model with regression on covariates.

Main Results:

  • The algorithm enables efficient inference for Bayesian nonparametric mixture models on large datasets.
  • Achieved competitive classification performance on electronic health records and bank telemarketing datasets.
  • Identified meaningful clusters within the analyzed data.

Conclusions:

  • The proposed scalable algorithm significantly enhances the applicability of Bayesian nonparametric models to large-scale data.
  • Offers a general and simple approach for inference in complex mixture models.
  • Demonstrates practical utility in real-world applications like healthcare and marketing.