Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Cluster Sampling Method

Cluster Sampling Method

Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...

Statistical Analysis: Overview

Statistical Analysis: Overview

When we take repeated measurements on the same or replicated samples, we will observe inconsistencies in the magnitude. These inconsistencies are called errors. To categorize and characterize these results and their errors, the researcher can use statistical analysis to determine the quality of the measurements and/or suitability of the methods.
One of the most commonly used statistical quantifiers is the mean, which is the ratio between the sum of the numerical values of all results and the...

One-Way ANOVA: Unequal Sample Sizes

One-Way ANOVA: Unequal Sample Sizes

One-way ANOVA can be performed on three or more samples of unequal sizes. However, calculations get complicated when sample sizes are not always the same. So, while performing ANOVA with unequal samples size, the following equation is used:

One-Way ANOVA: Equal Sample Sizes

One-Way ANOVA: Equal Sample Sizes

One-Way ANOVA can be performed on three or more samples with equal or unequal sample sizes. When one-way ANOVA is performed on two datasets with samples of equal sizes, it can be easily observed that the computed F statistic is highly sensitive to the sample mean.
Different sample means can result in different values for the variance estimate: variance between samples. This is because the variance between samples is calculated as the product of the sample size and the variance between the...

Comparing the Survival Analysis of Two or More Groups

Comparing the Survival Analysis of Two or More Groups

Survival analysis is a cornerstone of medical research, used to evaluate the time until an event of interest occurs, such as death, disease recurrence, or recovery. Unlike standard statistical methods, survival analysis is particularly adept at handling censored data—instances where the event has not occurred for some participants by the end of the study or remains unobserved. To address these unique challenges, specialized techniques like the Kaplan-Meier estimator, log-rank test, and...

Friedman Two-way Analysis of Variance by Ranks

Friedman Two-way Analysis of Variance by Ranks

Friedman's Two-Way Analysis of Variance by Ranks is a nonparametric test designed to identify differences across multiple test attempts when traditional assumptions of normality and equal variances do not apply. Unlike conventional ANOVA, which requires normally distributed data with equal variances, Friedman's test is ideal for ordinal or non-normally distributed data, making it particularly useful for analyzing dependent samples, such as matched subjects over time or repeated measures...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

scSketch: Interactive Sketch-based Trajectory Exploration and Pathway-Aware Analysis of Single-Cell Data.

bioRxiv : the preprint server for biology·2026

Same author

Separating selection from mutation in antibody language models.

eLife·2026

Same author

Interactive visualization of metric distortion in nonlinear data embeddings using the distortions package.

Briefings in bioinformatics·2026

Same author

Separating selection from mutation in antibody language models.

bioRxiv : the preprint server for biology·2025

Same author

Microbiota effects and predictors of <i>Lactobacillus crispatu</i>s colonization after treatment with a vaginal live biotherapeutic: results from a randomized, double-blinded, placebo-controlled trial.

medRxiv : the preprint server for health sciences·2025

Same author

Thrifty wide-context models of B cell receptor somatic hypermutation.

eLife·2025

Same journal

A Bayesian functional concurrent zero-inflated Dirichlet-multinomial regression model with application to infant microbiome.

Biostatistics (Oxford, England)·2026

Same journal

Towards optimal environmental policies: policy learning under arbitrary bipartite network interference.

Biostatistics (Oxford, England)·2026

Same journal

Multilevel functional quantile principal component analysis.

Biostatistics (Oxford, England)·2026

Same journal

Adaptive transfer learning for time-to-event modeling with applications in disease risk assessment.

Biostatistics (Oxford, England)·2026

Same journal

High-dimensional test for one-sided hypotheses.

Biostatistics (Oxford, England)·2026

Same journal

NBSR: a Negative Binomial Softmax Regression model for microRNA-seq data analysis.

Biostatistics (Oxford, England)·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Sep 21, 2025

Author Spotlight: Alignment of Synchronized Time-Series Data Using the Characterizing Loss of Cell Cycle Synchrony Model for Cross-Experiment Comparisons

Author Spotlight: Alignment of Synchronized Time-Series Data Using the Characterizing Loss of Cell Cycle Synchrony Model for Cross-Experiment Comparisons

Published on: June 9, 2023

Multiscale analysis of count data through topic alignment.

Julia Fukuyama¹, Kris Sankaran², Laura Symul³

¹Department of Statistics, Indiana University Bloomington, 919 E 10th Street, Bloomington, IN 47408, USA.

Biostatistics (Oxford, England)

|June 3, 2022

Summary

This summary is machine-generated.

Topic modeling helps analyze biological data, but choosing the number of topics (K) is challenging. Our new topic alignment method reveals consistent patterns across different K values, offering deeper biological insights.

Keywords:

Community analysis Microbiota Mixed membership models Multiresolution Topic model

More Related Videos

Author Spotlight: Integrated Multi-Omics Analysis for Unveiling Multicellular Immune Signatures in Clinical Heart Attack Cohorts

Author Spotlight: Integrated Multi-Omics Analysis for Unveiling Multicellular Immune Signatures in Clinical Heart Attack Cohorts

Published on: September 20, 2024

Cross-Modal Multivariate Pattern Analysis

Cross-Modal Multivariate Pattern Analysis

Published on: November 9, 2011

Related Experiment Videos

Last Updated: Sep 21, 2025

Author Spotlight: Alignment of Synchronized Time-Series Data Using the Characterizing Loss of Cell Cycle Synchrony Model for Cross-Experiment Comparisons

Author Spotlight: Alignment of Synchronized Time-Series Data Using the Characterizing Loss of Cell Cycle Synchrony Model for Cross-Experiment Comparisons

Published on: June 9, 2023

Author Spotlight: Integrated Multi-Omics Analysis for Unveiling Multicellular Immune Signatures in Clinical Heart Attack Cohorts

Author Spotlight: Integrated Multi-Omics Analysis for Unveiling Multicellular Immune Signatures in Clinical Heart Attack Cohorts

Published on: September 20, 2024

Cross-Modal Multivariate Pattern Analysis

Cross-Modal Multivariate Pattern Analysis

Published on: November 9, 2011

Area of Science:

Bioinformatics
Computational Biology
Statistical Genomics

Background:

Topic modeling is widely used for analyzing biological count data.
Selecting the optimal number of topics (K) for topic models is a significant challenge in data analysis.
A definitive method for choosing K is lacking, and a true optimal value may not exist.

Purpose of the Study:

To develop a novel method, termed topic alignment, for studying relationships between topic models with varying numbers of topics (K).
To introduce three new diagnostics based on topic alignment to assess topic consistency and evolution.
To provide a more insightful approach to biological data interpretation than selecting a single K value.

Main Methods:

Topic alignment: a method to compare topic models with different K values.
Development of three alignment-based diagnostics to identify consistent, transient, or splitting topics.
Visual representation of cross-model topic relationships.
Application to simulated and real biological count data.
Release of the 'alto' R package for implementing these methods.

Main Results:

Topic alignment effectively visualizes relationships between models with different K.
The diagnostics successfully identify topics that are consistently present, transient, or split as K increases.
The approach provides enhanced biological insights into data-generating processes.
Demonstrated effectiveness on both simulated and real biological datasets.

Conclusions:

Topic alignment offers a robust framework for exploring topic model structures across different K values.
The developed diagnostics enhance the interpretability of topic models in biological data analysis.
This strategy provides a more comprehensive understanding of biological count data compared to single-model approaches.