Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Longitudinal Studies01:26

Longitudinal Studies

156
Longitudinal studies are also widely used in other medical and social science fields. For instance, in cardiovascular research, they can monitor patients' health over decades to identify risk factors for heart disease, such as high cholesterol or smoking, and evaluate the long-term effectiveness of preventive measures. Similarly, in mental health studies, researchers might follow individuals from adolescence into adulthood to understand the development and progression of conditions like...
156
Longitudinal Research02:20

Longitudinal Research

11.9K
Sometimes we want to see how people change over time, as in studies of human development and lifespan. When we test the same group of individuals repeatedly over an extended period of time, we are conducting longitudinal research. Longitudinal research is a research design in which data-gathering is administered repeatedly over an extended period of time. For example, we may survey a group of individuals about their dietary habits at age 20, retest them a decade later at age 30, and then again...
11.9K
Factorial Design02:01

Factorial Design

13.0K
Factorial Analysis is an experimental design that applies Analysis of Variance (ANOVA) statistical procedures to examine a change in a dependent variable due to more than one independent variable, also known as factors. Changes in worker productivity can be reasoned, for example, to be influenced by salary and other conditions, such as skill level. One way to test this hypothesis is by categorizing salary into three levels (low, moderate, and high) and skills sets into two levels (entry level...
13.0K
Cluster Sampling Method01:20

Cluster Sampling Method

11.9K
Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...
11.9K
Survival Tree01:19

Survival Tree

79
Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
 Building a Survival Tree
Constructing a...
79
Friedman Two-way Analysis of Variance by Ranks01:21

Friedman Two-way Analysis of Variance by Ranks

180
Friedman's Two-Way Analysis of Variance by Ranks is a nonparametric test designed to identify differences across multiple test attempts when traditional assumptions of normality and equal variances do not apply. Unlike conventional ANOVA, which requires normally distributed data with equal variances, Friedman's test is ideal for ordinal or non-normally distributed data, making it particularly useful for analyzing dependent samples, such as matched subjects over time or repeated measures...
180

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Bayesian Nonparametric Common Atoms Regression for Generating Synthetic Controls in Clinical Trials.

Journal of the American Statistical Association·2026
Same author

Multilevel predictors of ultra-processed food intake in Canadian preschoolers.

Communications medicine·2026
Same author

A fast integrative clustering and feature selection approach for high-dimensional multiview data.

Statistical methods in medical research·2026
Same author

Inferring Covariance Structure from Multiple Data Sources via Subspace Factor Analysis.

Journal of the American Statistical Association·2025
Same author

Escaping The Curse of Dimensionality in Bayesian Model-Based Clustering.

Journal of machine learning research : JMLR·2025
Same author

Functional connectivity across the human subcortical auditory system using an autoregressive matrix-Gaussian copula graphical model approach with partial correlations.

Imaging neuroscience (Cambridge, Mass.)·2024
Same journal

Optimal Weighted Tests for Replication Studies and the 'Two-Trials Rule' With Multiple Hypotheses.

Statistics in medicine·2026
Same journal

Identifiable Copula-Double-Cox Models: A Fully Parametric Framework for Dependent Right-Censored Survival Data.

Statistics in medicine·2026
Same journal

Moving From Individualized Risk-Based Prevention to Benefit-Based Prevention: Estimating Individualized Life-Years Gained From Prevention Services as a Basis for Eligibility.

Statistics in medicine·2026
Same journal

A Mixture of Distributed Lag Non-Linear Models to Account for Spatially Heterogeneous Exposure-Lag-Response Associations.

Statistics in medicine·2026
Same journal

Practical Considerations for Gaussian Process Modeling for Causal Inference in Quasi-Experimental Studies With Panel Data.

Statistics in medicine·2026
Same journal

Covariate Adjustment for Wilcoxon Two Sample Statistic and Test.

Statistics in medicine·2026
See all related articles

Related Experiment Video

Updated: Jun 23, 2025

Using Cholesky Decomposition to Explore Individual Differences in Longitudinal Relations between Reading Skills
06:52

Using Cholesky Decomposition to Explore Individual Differences in Longitudinal Relations between Reading Skills

Published on: September 17, 2019

6.3K

A sparse factor model for clustering high-dimensional longitudinal data.

Zihang Lu1,2, Noirrit Kiran Chandra3

  • 1Department of Public Health Sciences, Queen's University, Kingston, Ontario, Canada.

Statistics in Medicine
|June 17, 2024
PubMed
Summary
This summary is machine-generated.

This study introduces a novel Bayesian nonparametric mixture model for clustering complex, high-dimensional longitudinal data. The method effectively identifies disease patterns by clustering latent factors, overcoming common analytical challenges.

Keywords:
Bayesian nonparametric modelclusteringhigh dimensional datalongitudinal datamixture model

More Related Videos

Author Spotlight: Integrated Multi-Omics Analysis for Unveiling Multicellular Immune Signatures in Clinical Heart Attack Cohorts
08:51

Author Spotlight: Integrated Multi-Omics Analysis for Unveiling Multicellular Immune Signatures in Clinical Heart Attack Cohorts

Published on: September 20, 2024

1.2K
A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments
08:12

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

2.5K

Related Experiment Videos

Last Updated: Jun 23, 2025

Using Cholesky Decomposition to Explore Individual Differences in Longitudinal Relations between Reading Skills
06:52

Using Cholesky Decomposition to Explore Individual Differences in Longitudinal Relations between Reading Skills

Published on: September 17, 2019

6.3K
Author Spotlight: Integrated Multi-Omics Analysis for Unveiling Multicellular Immune Signatures in Clinical Heart Attack Cohorts
08:51

Author Spotlight: Integrated Multi-Omics Analysis for Unveiling Multicellular Immune Signatures in Clinical Heart Attack Cohorts

Published on: September 20, 2024

1.2K
A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments
08:12

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

2.5K

Area of Science:

  • Biostatistics
  • Computational Biology
  • Data Science

Background:

  • Engineering advances generate large longitudinal datasets, offering insights into disease mechanisms.
  • Analyzing high-dimensional, heterogeneous longitudinal data presents significant computational challenges.

Purpose of the Study:

  • To propose a Bayesian nonparametric mixture model for clustering high-dimensional, mixed-type longitudinal features.
  • To address the curse of dimensionality by clustering at the latent factor level.

Main Methods:

  • A sparse factor model is applied to the joint distribution of random effects.
  • Clustering is induced at the latent factor level, not the original data.
  • A Dirichlet process prior estimates the number of clusters.
  • An efficient Gibbs sampler is used for posterior distribution estimation.

Main Results:

  • The proposed model effectively clusters high-dimensional longitudinal data with mixed feature types.
  • The approach overcomes the curse of dimensionality inherent in such datasets.
  • Analysis of real and simulated data validates the model's utility.

Conclusions:

  • The developed Bayesian nonparametric mixture model is a valuable tool for analyzing complex longitudinal data.
  • This method facilitates deeper investigation into disease mechanisms using high-dimensional feature data.