Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Factorial Design02:01

Factorial Design

Factorial Analysis is an experimental design that applies Analysis of Variance (ANOVA) statistical procedures to examine a change in a dependent variable due to more than one independent variable, also known as factors. Changes in worker productivity can be reasoned, for example, to be influenced by salary and other conditions, such as skill level. One way to test this hypothesis is by categorizing salary into three levels (low, moderate, and high) and skills sets into two levels (entry level...
Mechanistic Models: Compartment Models in Individual and Population Analysis01:23

Mechanistic Models: Compartment Models in Individual and Population Analysis

Mechanistic models are utilized in individual analysis using single-source data, but imperfections arise due to data collection errors, preventing perfect prediction of observed data. The mathematical equation involves known values (Xi), observed concentrations (Ci), measurement errors (εi), model parameters (ϕj), and the related function (ƒi) for i number of values. Different least-squares metrics quantify differences between predicted and observed values. The ordinary least squares (OLS)...
Friedman Two-way Analysis of Variance by Ranks01:21

Friedman Two-way Analysis of Variance by Ranks

Friedman's Two-Way Analysis of Variance by Ranks is a nonparametric test designed to identify differences across multiple test attempts when traditional assumptions of normality and equal variances do not apply. Unlike conventional ANOVA, which requires normally distributed data with equal variances, Friedman's test is ideal for ordinal or non-normally distributed data, making it particularly useful for analyzing dependent samples, such as matched subjects over time or repeated measures from...
Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving01:29

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

Mechanistic models play a crucial role in algorithms for numerical problem-solving, particularly in nonlinear mixed effects modeling (NMEM). These models aim to minimize specific objective functions by evaluating various parameter estimates, leading to the development of systematic algorithms. In some cases, linearization techniques approximate the model using linear equations.
In individual population analyses, different algorithms are employed, such as Cauchy's method, which uses a...
How Data are Classified: Categorical Data01:11

How Data are Classified: Categorical Data

A variable, usually notated by capital letters such as X and Y, is a characteristic or measurement that can be determined for each member of a population. Data are the actual values of variables. They may be numbers, or they may be words. Datum is a single value.
Data are classified based on whether they are measurable or not. Categorical data cannot be measured; instead, it can be divided into categories. For example, if Y denotes a person's party affiliation, some examples of Y include...
Multicompartment Models: Overview01:14

Multicompartment Models: Overview

Multicompartment models are mathematical constructs that depict how drugs are distributed and eliminated within the body. They segment the body into several compartments, symbolizing various physiological or anatomical areas connected through drug transfer processes such as absorption, metabolism, distribution, and elimination.
These models offer a more comprehensive representation of drug behavior in the body than one-compartment models. They accommodate the complexity of drug distribution,...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Sequential Gibbs posteriors with applications to principal component analysis.

Biometrika·2026
Same author

Scalable and robust regression models for continuous proportional data.

Journal of the American Statistical Association·2026
Same author

Local graph estimation with pathwise false discovery control.

Nature communications·2026
Same author

Bayesian Transfer Learning.

Statistical science : a review journal of the Institute of Mathematical Statistics·2026
Same author

Domain Adaptive Bootstrap Aggregating.

IEEE transactions on signal processing : a publication of the IEEE Signal Processing Society·2026
Same author

Logistic-Beta Processes for Dependent Random Probabilities with Beta Marginals.

Bayesian analysis·2026
Same journal

Instrumental Variable Estimation of Marginal Structural Mean Models for Time-Varying Treatment.

Journal of the American Statistical Association·2026
Same journal

Semiparametric Joint Modeling for Survival Analysis with Longitudinal Covariates.

Journal of the American Statistical Association·2026
Same journal

Dimension Reduction for Large-Scale Federated Data: Statistical Rate and Asymptotic Inference.

Journal of the American Statistical Association·2026
Same journal

Facilitating Heterogeneous Effect Estimation via Statistically Efficient Categorical Modifiers.

Journal of the American Statistical Association·2026
Same journal

Nonparametric Density Estimation of a Long-Term Trend from Repeated Semicontinuous Data.

Journal of the American Statistical Association·2026
Same journal

Functional Integrative Bayesian Analysis of High-dimensional Multiplatform Clinicogenomic Data.

Journal of the American Statistical Association·2026
See all related articles

Related Experiment Video

Updated: May 9, 2026

Application of Unsupervised Multi-Omic Factor Analysis to Uncover Patterns of Variation and Molecular Processes Linked to Cardiovascular Disease
08:51

Application of Unsupervised Multi-Omic Factor Analysis to Uncover Patterns of Variation and Molecular Processes Linked to Cardiovascular Disease

Published on: September 20, 2024

Simplex Factor Models for Multivariate Unordered Categorical Data.

Anirban Bhattacharya1, David B Dunson

  • 1Department of Statistical Science, Duke University, NC 27708.

Journal of the American Statistical Association
|August 3, 2013
PubMed
Summary
This summary is machine-generated.

We introduce simplex factor models as a powerful alternative for analyzing unordered categorical data. These models offer flexible dependence structures and efficient computation, outperforming Gaussian latent factor models.

Keywords:
ClassificationContingency tableFactor analysisLatent variableMutual informationNonnegative tensor factorizationNonparametric BayesPolytomous regression

More Related Videos

Applying an eMASS Customization Program as a Research Tool to Evaluate Consumer Benefits
08:27

Applying an eMASS Customization Program as a Research Tool to Evaluate Consumer Benefits

Published on: September 27, 2019

Using Cholesky Decomposition to Explore Individual Differences in Longitudinal Relations between Reading Skills
06:52

Using Cholesky Decomposition to Explore Individual Differences in Longitudinal Relations between Reading Skills

Published on: September 17, 2019

Related Experiment Videos

Last Updated: May 9, 2026

Application of Unsupervised Multi-Omic Factor Analysis to Uncover Patterns of Variation and Molecular Processes Linked to Cardiovascular Disease
08:51

Application of Unsupervised Multi-Omic Factor Analysis to Uncover Patterns of Variation and Molecular Processes Linked to Cardiovascular Disease

Published on: September 20, 2024

Applying an eMASS Customization Program as a Research Tool to Evaluate Consumer Benefits
08:27

Applying an eMASS Customization Program as a Research Tool to Evaluate Consumer Benefits

Published on: September 27, 2019

Using Cholesky Decomposition to Explore Individual Differences in Longitudinal Relations between Reading Skills
06:52

Using Cholesky Decomposition to Explore Individual Differences in Longitudinal Relations between Reading Skills

Published on: September 17, 2019

Area of Science:

  • Statistics
  • Machine Learning
  • Computational Biology

Background:

  • Gaussian latent factor models are standard for continuous, binary, and ordered categorical data.
  • These models present computational challenges and complex structures for unordered categorical variables.

Purpose of the Study:

  • Propose a novel class of simplex factor models for unordered categorical data.
  • Develop a Bayesian approach with a scalable Markov chain Monte Carlo (MCMC) algorithm.
  • Address limitations of Gaussian models in high-dimensional categorical data analysis.

Main Methods:

  • Introduced simplex factor models for flexible dependence modeling.
  • Employed a Bayesian framework with an MCMC algorithm for computation and inference.
  • Developed an efficient proposal for updating base probability vectors in hierarchical Dirichlet models.

Main Results:

  • Simplex factor models effectively characterize flexible dependence structures with few factors.
  • The proposed MCMC algorithm demonstrates scalability with increasing data dimensions.
  • The model accurately approximates multivariate categorical data distributions as factors increase.

Conclusions:

  • Simplex factor models provide a computationally efficient and flexible alternative for unordered categorical data.
  • The approach is suitable for modeling dependencies in nucleotide sequences and high-dimensional categorical features.
  • This work advances statistical modeling for complex categorical data analysis.