Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Probability Distributions01:32

Probability Distributions

The probability of a random variable x  is the likelihood of its occurrence. A probability distribution represents the probabilities of a random variable using a formula, graph, or table. There are two types of probability distribution– discrete probability distribution and continuous probability distribution.
A discrete probability distribution is a probability distribution of discrete random variables. It can be categorized into binomial probability distribution and Poisson probability...
How Data are Classified: Categorical Data01:11

How Data are Classified: Categorical Data

A variable, usually notated by capital letters such as X and Y, is a characteristic or measurement that can be determined for each member of a population. Data are the actual values of variables. They may be numbers, or they may be words. Datum is a single value.
Data are classified based on whether they are measurable or not. Categorical data cannot be measured; instead, it can be divided into categories. For example, if Y denotes a person's party affiliation, some examples of Y include...
Censoring Survival Data01:09

Censoring Survival Data

Survival analysis is a statistical method used to analyze time-to-event data, often employed in fields such as medicine, engineering, and social sciences. One of the key challenges in survival analysis is dealing with incomplete data, a phenomenon known as "censoring." Censoring occurs when the event of interest (such as death, relapse, or system failure) has not occurred for some individuals by the end of the study period or is otherwise unobservable, and it might have many different reasons...
Model Approaches for Pharmacokinetic Data: Distributed Parameter Models01:06

Model Approaches for Pharmacokinetic Data: Distributed Parameter Models

Pharmacokinetic models are mathematical constructs that represent and predict the time course of drug concentrations in the body, providing meaningful pharmacokinetic parameters. These models are categorized into compartment, physiological, and distributed parameter models.
The distributed parameter models are specifically designed to account for variations and differences in some drug classes. This model is particularly useful for assessing regional concentrations of anticancer or...
Data: Types and Distribution01:19

Data: Types and Distribution

In biostatistics, data are the observations collected for analysis. There are two main types: parametric and non-parametric. Parametric data, which include continuous (e.g., weight) and discrete numerical data (e.g., number of tablets), assume a particular distribution pattern, often the normal distribution. Non-parametric data do not adhere to a specific distribution and typically comprise nominal (e.g., gender) and ordinal categorical data (e.g., pain scale ratings).
Distributions in...
Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data01:16

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

Statistical inference techniques, paramount in hypothesis testing, differentiate into two broad categories: parametric and nonparametric statistics.
Parametric statistics, as the name suggests, assumes that data follow a specific distribution, often a normal distribution. This assumption enables robust hypothesis testing and estimation. Parametric methods, like the Student's t-test or Goodness-of-fit test, are frequently employed in biostatistics due to their robustness. For instance, comparing...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

KAT2A-IGF2BP1-CXCL2 axis in the high lactate tumor microenvironment facilitates resistance to anti-PD-1 therapy in lung adenocarcinoma by recruiting myeloid-derived suppressor cells.

Cell death & disease·2026
Same author

Sequential Gibbs posteriors with applications to principal component analysis.

Biometrika·2026
Same author

Scalable and robust regression models for continuous proportional data.

Journal of the American Statistical Association·2026
Same author

Enhanced nitrogen removal via simultaneous nitrification and denitrification by a newly isolated strain Enterobacter cloacae GW6 from estuarine sediment.

PloS one·2026
Same author

Local graph estimation with pathwise false discovery control.

Nature communications·2026
Same author

Single-cell transcriptomic analysis reveals that the circRNA circGCLM promotes tumorigenesis and confers cisplatin resistance in NSCLC through the miR-505-3p/ERBB4 axis.

Translational oncology·2026
Same journal

Classification Under Local Differential Privacy with Model Reversal and Model Averaging.

Journal of machine learning research : JMLR·2026
Same journal

Sparse Semiparametric Discriminant Analysis for High-dimensional Zero-inflated Data.

Journal of machine learning research : JMLR·2026
Same journal

Heterogeneity-aware Clustered Distributed Learning for Multi-source Data Analysis.

Journal of machine learning research : JMLR·2026
Same journal

Unsupervised Tree Boosting for Learning Probability Distributions.

Journal of machine learning research : JMLR·2026
Same journal

A Two-Stage Penalized Least Squares Method for Constructing Large Systems of Structural Equations.

Journal of machine learning research : JMLR·2026
Same journal

Bayesian Multinomial Logistic Normal Models through Marginally Latent Matrix-T Processes.

Journal of machine learning research : JMLR·2026
See all related articles

Related Experiment Video

Updated: May 8, 2026

Creating Objects and Object Categories for Studying Perception and Perceptual Learning
14:38

Creating Objects and Object Categories for Studying Perception and Perceptual Learning

Published on: November 2, 2012

Classification with Incomplete Data Using Dirichlet Process Priors.

Chunping Wang1, Xuejun Liao, Lawrence Carin

  • 1Department of Electrical and Computer Engineering, Duke University, Durham, NC 27708-0291, USA.

Journal of Machine Learning Research : JMLR
|August 31, 2013
PubMed
Summary
This summary is machine-generated.

This study introduces a flexible Bayesian framework for building classifiers using multiple simple "experts." This approach efficiently handles incomplete data and enables multi-task learning across datasets, offering a powerful new tool for machine learning.

Keywords:
Dirichlet processclassificationexpertincomplete datamultitask learningvariational Bayesian

More Related Videos

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations
12:27

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

Related Experiment Videos

Last Updated: May 8, 2026

Creating Objects and Object Categories for Studying Perception and Perceptual Learning
14:38

Creating Objects and Object Categories for Studying Perception and Perceptual Learning

Published on: November 2, 2012

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations
12:27

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

Area of Science:

  • Machine Learning
  • Statistical Modeling
  • Computational Statistics

Background:

  • Classifier design often relies on rigid models.
  • Handling incomplete data and multi-dataset learning presents significant challenges.
  • Existing non-parametric methods may lack flexibility or efficient inference.

Purpose of the Study:

  • To develop a non-parametric hierarchical Bayesian framework for classifier design.
  • To enable analytical handling of incomplete data through simple "expert" classifiers.
  • To extend the framework for non-parametric multi-task learning.

Main Methods:

  • A Dirichlet process formulation to define experts and their construction.
  • Incorporation of linear classifiers as local "experts."
  • Variational Bayesian (VB) analysis for fast inference.
  • Gibbs sampling for comparative inference.

Main Results:

  • The proposed framework successfully designs classifiers using a mixture of experts.
  • Analytical handling of incomplete data was demonstrated.
  • The model effectively performs non-parametric multi-task learning.
  • VB inference provided fast and comparable results to Gibbs sampling.

Conclusions:

  • The developed Bayesian framework offers a flexible and robust approach to classifier design.
  • The method excels in handling incomplete data and multi-task learning scenarios.
  • Variational Bayesian inference enables efficient application of this powerful technique.