Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Probability Distributions

Probability Distributions

The probability of a random variable x is the likelihood of its occurrence. A probability distribution represents the probabilities of a random variable using a formula, graph, or table. There are two types of probability distribution– discrete probability distribution and continuous probability distribution.
A discrete probability distribution is a probability distribution of discrete random variables. It can be categorized into binomial probability distribution and Poisson probability...

How Data are Classified: Categorical Data

How Data are Classified: Categorical Data

A variable, usually notated by capital letters such as X and Y, is a characteristic or measurement that can be determined for each member of a population. Data are the actual values of variables. They may be numbers, or they may be words. Datum is a single value.
Data are classified based on whether they are measurable or not. Categorical data cannot be measured; instead, it can be divided into categories. For example, if Y denotes a person's party affiliation, some examples of Y include...

Censoring Survival Data

Censoring Survival Data

Survival analysis is a statistical method used to analyze time-to-event data, often employed in fields such as medicine, engineering, and social sciences. One of the key challenges in survival analysis is dealing with incomplete data, a phenomenon known as "censoring." Censoring occurs when the event of interest (such as death, relapse, or system failure) has not occurred for some individuals by the end of the study period or is otherwise unobservable, and it might have many different reasons...

Model Approaches for Pharmacokinetic Data: Distributed Parameter Models

Model Approaches for Pharmacokinetic Data: Distributed Parameter Models

Pharmacokinetic models are mathematical constructs that represent and predict the time course of drug concentrations in the body, providing meaningful pharmacokinetic parameters. These models are categorized into compartment, physiological, and distributed parameter models.
The distributed parameter models are specifically designed to account for variations and differences in some drug classes. This model is particularly useful for assessing regional concentrations of anticancer or...

Data: Types and Distribution

Data: Types and Distribution

In biostatistics, data are the observations collected for analysis. There are two main types: parametric and non-parametric. Parametric data, which include continuous (e.g., weight) and discrete numerical data (e.g., number of tablets), assume a particular distribution pattern, often the normal distribution. Non-parametric data do not adhere to a specific distribution and typically comprise nominal (e.g., gender) and ordinal categorical data (e.g., pain scale ratings).
Distributions in...

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

Statistical inference techniques, paramount in hypothesis testing, differentiate into two broad categories: parametric and nonparametric statistics.
Parametric statistics, as the name suggests, assumes that data follow a specific distribution, often a normal distribution. This assumption enables robust hypothesis testing and estimation. Parametric methods, like the Student's t-test or Goodness-of-fit test, are frequently employed in biostatistics due to their robustness. For instance, comparing...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

KAT2A-IGF2BP1-CXCL2 axis in the high lactate tumor microenvironment facilitates resistance to anti-PD-1 therapy in lung adenocarcinoma by recruiting myeloid-derived suppressor cells.

Cell death & disease·2026

Same author

Sequential Gibbs posteriors with applications to principal component analysis.

Biometrika·2026

Same author

Scalable and robust regression models for continuous proportional data.

Journal of the American Statistical Association·2026

Same author

Enhanced nitrogen removal via simultaneous nitrification and denitrification by a newly isolated strain Enterobacter cloacae GW6 from estuarine sediment.

PloS one·2026

Same author

Local graph estimation with pathwise false discovery control.

Nature communications·2026

Same author

Single-cell transcriptomic analysis reveals that the circRNA circGCLM promotes tumorigenesis and confers cisplatin resistance in NSCLC through the miR-505-3p/ERBB4 axis.

Translational oncology·2026

Same journal

Classification Under Local Differential Privacy with Model Reversal and Model Averaging.

Journal of machine learning research : JMLR·2026

Same journal

Sparse Semiparametric Discriminant Analysis for High-dimensional Zero-inflated Data.

Journal of machine learning research : JMLR·2026

Same journal

Heterogeneity-aware Clustered Distributed Learning for Multi-source Data Analysis.

Journal of machine learning research : JMLR·2026

Same journal

Unsupervised Tree Boosting for Learning Probability Distributions.

Journal of machine learning research : JMLR·2026

Same journal

A Two-Stage Penalized Least Squares Method for Constructing Large Systems of Structural Equations.

Journal of machine learning research : JMLR·2026

Same journal

Bayesian Multinomial Logistic Normal Models through Marginally Latent Matrix-T Processes.

Journal of machine learning research : JMLR·2026

See all related articles

Search research articles

Related Experiment Video

Updated: May 8, 2026

Creating Objects and Object Categories for Studying Perception and Perceptual Learning

Creating Objects and Object Categories for Studying Perception and Perceptual Learning

Published on: November 2, 2012

Classification with Incomplete Data Using Dirichlet Process Priors.

Chunping Wang¹, Xuejun Liao, Lawrence Carin

¹Department of Electrical and Computer Engineering, Duke University, Durham, NC 27708-0291, USA.

Journal of Machine Learning Research : JMLR

|August 31, 2013

Summary

This summary is machine-generated.

This study introduces a flexible Bayesian framework for building classifiers using multiple simple "experts." This approach efficiently handles incomplete data and enables multi-task learning across datasets, offering a powerful new tool for machine learning.

Keywords:

Dirichlet process classification expert incomplete data multitask learning variational Bayesian

More Related Videos

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

Related Experiment Videos

Last Updated: May 8, 2026

Creating Objects and Object Categories for Studying Perception and Perceptual Learning

Creating Objects and Object Categories for Studying Perception and Perceptual Learning

Published on: November 2, 2012

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

Area of Science:

Machine Learning
Statistical Modeling
Computational Statistics

Background:

Classifier design often relies on rigid models.
Handling incomplete data and multi-dataset learning presents significant challenges.
Existing non-parametric methods may lack flexibility or efficient inference.

Purpose of the Study:

To develop a non-parametric hierarchical Bayesian framework for classifier design.
To enable analytical handling of incomplete data through simple "expert" classifiers.
To extend the framework for non-parametric multi-task learning.

Main Methods:

A Dirichlet process formulation to define experts and their construction.
Incorporation of linear classifiers as local "experts."
Variational Bayesian (VB) analysis for fast inference.
Gibbs sampling for comparative inference.

Main Results:

The proposed framework successfully designs classifiers using a mixture of experts.
Analytical handling of incomplete data was demonstrated.
The model effectively performs non-parametric multi-task learning.
VB inference provided fast and comparable results to Gibbs sampling.

Conclusions:

The developed Bayesian framework offers a flexible and robust approach to classifier design.
The method excels in handling incomplete data and multi-task learning scenarios.
Variational Bayesian inference enables efficient application of this powerful technique.