Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving01:29

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

72
Mechanistic models play a crucial role in algorithms for numerical problem-solving, particularly in nonlinear mixed effects modeling (NMEM). These models aim to minimize specific objective functions by evaluating various parameter estimates, leading to the development of systematic algorithms. In some cases, linearization techniques approximate the model using linear equations.
In individual population analyses, different algorithms are employed, such as Cauchy's method, which uses a...
72
Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data01:16

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

150
Statistical inference techniques, paramount in hypothesis testing, differentiate into two broad categories: parametric and nonparametric statistics.
Parametric statistics, as the name suggests, assumes that data follow a specific distribution, often a normal distribution. This assumption enables robust hypothesis testing and estimation. Parametric methods, like the Student's t-test or Goodness-of-fit test, are frequently employed in biostatistics due to their robustness. For instance,...
150
Model Approaches for Pharmacokinetic Data: Distributed Parameter Models01:06

Model Approaches for Pharmacokinetic Data: Distributed Parameter Models

89
Pharmacokinetic models are mathematical constructs that represent and predict the time course of drug concentrations in the body, providing meaningful pharmacokinetic parameters. These models are categorized into compartment, physiological, and distributed parameter models.
The distributed parameter models are specifically designed to account for variations and differences in some drug classes. This model is particularly useful for assessing regional concentrations of anticancer or...
89
Statistical Analysis: Overview01:11

Statistical Analysis: Overview

6.6K
When we take repeated measurements on the same or replicated samples, we will observe inconsistencies in the magnitude. These inconsistencies are called errors. To categorize and characterize these results and their errors, the researcher can use statistical analysis to determine the quality of the measurements and/or suitability of the methods.
One of the most commonly used statistical quantifiers is the mean, which is the ratio between the sum of the numerical values of all results and the...
6.6K
Column Efficiency: Rate Theory01:12

Column Efficiency: Rate Theory

373
The rate theory of chromatography provides quantitative insight into the shapes and widths of elution bands. These bands are based on the random-walk mechanism governing molecular migration within a column. The Gaussian profile of chromatographic bands arises from the cumulative effect of random molecular motions as they progress through the column.
During elution, a solute molecule experiences numerous transitions between stationary and mobile phases, exhibiting irregular residence times in...
373
Mechanistic Models: Compartment Models in Individual and Population Analysis01:23

Mechanistic Models: Compartment Models in Individual and Population Analysis

64
Mechanistic models are utilized in individual analysis using single-source data, but imperfections arise due to data collection errors, preventing perfect prediction of observed data. The mathematical equation involves known values (Xi), observed concentrations (Ci), measurement errors (εi), model parameters (ϕj), and the related function (ƒi) for i number of values. Different least-squares metrics quantify differences between predicted and observed values. The ordinary least...
64

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Research progress and prospects of nutrient delivery systems in food design for the elderly.

Critical reviews in food science and nutrition·2026
Same author

Postnatal Maxillofacial "Developing" Decellularized Extracellular Matrix Orchestrates Hierarchical Cross-Organ Regeneration via Macrophage Integrin αvβ5-Mediated Efferocytosis-Driven Developmental Recapitulation.

Research (Washington, D.C.)·2026
Same author

AI-powered TargetMap: Enabling system-level target discovery through full-path reasoning on a unified knowledge graph.

iScience·2026
Same author

Chemically Reprogramming Liquid Metals With Polyphenols for a Self-Reinforcing Assault on Biofilms.

Small (Weinheim an der Bergstrasse, Germany)·2026
Same author

AI approaches for predicting progression to acute coronary syndrome among stable coronary heart disease patients.

NPJ cardiovascular health·2026
Same author

Noninvasive evaluation and clinical value prediction of tumor-infiltrating neutrophil-to-T-cell ratio in pancreatic ductal adenocarcinoma.

NPJ digital medicine·2026
Same journal

Instrumental Variable Estimation of Marginal Structural Mean Models for Time-Varying Treatment.

Journal of the American Statistical Association·2026
Same journal

Semiparametric Joint Modeling for Survival Analysis with Longitudinal Covariates.

Journal of the American Statistical Association·2026
Same journal

Dimension Reduction for Large-Scale Federated Data: Statistical Rate and Asymptotic Inference.

Journal of the American Statistical Association·2026
Same journal

Facilitating Heterogeneous Effect Estimation via Statistically Efficient Categorical Modifiers.

Journal of the American Statistical Association·2026
Same journal

Nonparametric Density Estimation of a Long-Term Trend from Repeated Semicontinuous Data.

Journal of the American Statistical Association·2026
Same journal

Functional Integrative Bayesian Analysis of High-dimensional Multiplatform Clinicogenomic Data.

Journal of the American Statistical Association·2026
See all related articles

Related Experiment Video

Updated: Jul 15, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.5K

Sparse Topic Modeling: Computational Efficiency, Near-Optimal Algorithms, and Statistical Inference.

Ruijia Wu1, Linjun Zhang2, T Tony Cai1

  • 1Department of Statistics and Data Science, University of Pennsylvania.

Journal of the American Statistical Association
|September 29, 2023
PubMed
Summary
This summary is machine-generated.

New algorithms for sparse topic modeling using probabilistic latent semantic indexing (pLSI) are computationally efficient and accurate. These methods offer optimal performance for estimating word-topic and topic-document matrices, outperforming existing approaches.

Keywords:
Topic modelingconfidence intervalshigh-dimensional statisticsmatrix factorizationsparsity

More Related Videos

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments
08:12

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

2.6K
3D Modeling of Dendritic Spines with Synaptic Plasticity
07:13

3D Modeling of Dendritic Spines with Synaptic Plasticity

Published on: May 18, 2020

6.9K

Related Experiment Videos

Last Updated: Jul 15, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.5K
A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments
08:12

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

2.6K
3D Modeling of Dendritic Spines with Synaptic Plasticity
07:13

3D Modeling of Dendritic Spines with Synaptic Plasticity

Published on: May 18, 2020

6.9K

Area of Science:

  • Computational statistics
  • Machine learning
  • Natural language processing

Background:

  • Probabilistic latent semantic indexing (pLSI) is a foundational model for topic modeling.
  • Existing methods for sparse pLSI often face computational challenges and limitations in accuracy.
  • Efficient estimation and inference for sparse topic models are crucial for large-scale text analysis.

Purpose of the Study:

  • To develop novel, computationally fast algorithms for sparse topic modeling under the pLSI framework.
  • To theoretically analyze the properties of the proposed estimation and inference algorithms.
  • To provide a method for constructing valid confidence intervals for model parameters.

Main Methods:

  • Development of new algorithms for estimating word-topic and topic-document matrices in sparse pLSI.
  • Theoretical investigation involving minimax upper and lower bounds to establish rate-optimality.
  • Introduction of a refitting algorithm to ensure asymptotic normality and enable confidence interval construction.
  • Empirical validation through simulation studies and analysis of the COVID-19 Open Research Dataset (CORD-19).

Main Results:

  • Proposed algorithms demonstrate computational efficiency and high accuracy in estimating sparse topic models.
  • Theoretical analysis confirms the rate-optimality of the algorithms, up to a logarithmic factor.
  • Simulation studies show superior numerical performance and accuracy compared to existing literature.
  • Successful application to the CORD-19 dataset, demonstrating practical utility.

Conclusions:

  • The novel algorithms offer significant improvements in speed and accuracy for sparse topic modeling with pLSI.
  • The theoretical guarantees and practical performance validate the proposed methods for text data analysis.
  • These advancements facilitate more robust and reliable insights from large text corpora, including scientific literature.