Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving01:29

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

Mechanistic models play a crucial role in algorithms for numerical problem-solving, particularly in nonlinear mixed effects modeling (NMEM). These models aim to minimize specific objective functions by evaluating various parameter estimates, leading to the development of systematic algorithms. In some cases, linearization techniques approximate the model using linear equations.
In individual population analyses, different algorithms are employed, such as Cauchy's method, which uses a...
Goodness-of-Fit Test01:16

Goodness-of-Fit Test

The goodness-of-fit test is a type of hypothesis test which determines whether the data "fits" a particular distribution. For example, one may suspect that some anonymous data may fit a binomial distribution. A chi-square test (meaning the distribution for the hypothesis test is chi-square) can be used to determine if there is a fit. The null and alternative hypotheses may be written in sentences or stated as equations or inequalities. The test statistic for a goodness-of-fit test is given as...
Expected Frequencies in Goodness-of-Fit Tests01:19

Expected Frequencies in Goodness-of-Fit Tests

A goodness-of-fit test is conducted to determine whether the observed frequency values are statistically similar to the frequencies expected for the dataset. Suppose the expected frequencies for a dataset are equal such as when predicting the frequency of any number appearing when casting a die. In that case, the expected frequency is the ratio of the total number of observations (n) to the number of categories (k).
Improving Translational Accuracy02:07

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
Improving Translational Accuracy02:07

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
Quadratic Models01:23

Quadratic Models

Quadratic models are mathematical representations used to describe relationships in which the rate of change changes at a constant rate. These models appear in a wide variety of natural and engineered systems, especially those involving motion, forces, and optimization. One common application is analyzing the vertical motion of objects influenced by gravity, such as a ball thrown into the air.In such scenarios, the object's height changes over time in a curved pattern, rising to a maximum point...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Empirical Bayes Covariance Decomposition, and a Solution to the Multiple Tuning Problem in Sparse PCA.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2026
Same author

Transient Receptor Potential Channels in Modulating Synaptic Plasticity and Cognitive Function: Potential Role of Pharmacological Agents.

Basic & clinical pharmacology & toxicology·2026
Same author

Generation of aptamers for the selective detection and neutralization of soluble lymphotoxin alpha.

Molecular therapy. Nucleic acids·2026
Same author

Differential disease tolerance mediates sex-biased illness severity in sepsis.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same author

Integrating multi-omics and multi-context QTL data with GWAS reveals the genetic architecture of complex traits and improves the discovery of risk genes.

medRxiv : the preprint server for health sciences·2026
Same author

Fast and flexible joint fine-mapping of multiple traits via the Sum of Single Effects model.

Nature genetics·2026
Same journal

Neural posterior estimation on exponential random graph models: evaluating bias and implementation challenges.

Statistics and computing·2026
Same journal

Subgroup Analysis of Differential Networks with Latent Variables.

Statistics and computing·2026
Same journal

Approximating evidence via bounded harmonic means.

Statistics and computing·2026
Same journal

Efficient Inference in First Passage Time Models.

Statistics and computing·2026
Same journal

Optimal <i>F</i>-score Matching for Bipartite Record Linkage.

Statistics and computing·2026
Same journal

Accelerated inference for stochastic compartmental models with over-dispersed partial observations.

Statistics and computing·2026
See all related articles

Related Experiment Videos

Non-negative matrix factorization algorithms generally improve topic model fits.

Peter Carbonetto1, Abhishek Sarkar1,2, Zihao Wang3

  • 1Department of Human Genetics, University of Chicago, Chicago, IL USA.

Statistics and Computing
|May 8, 2026
PubMed
Summary
This summary is machine-generated.

This study introduces efficient topic modeling by leveraging non-negative matrix factorization (NMF) optimization. These new methods significantly improve fitting speed and accuracy for large datasets.

Keywords:
Expectation maximizationMaximum-likelihood estimationNon-negative matrix factorizationNonconvex optimizationTopic models

Related Experiment Videos

Area of Science:

  • Computational statistics
  • Machine learning
  • Natural language processing

Background:

  • Topic models are essential for analyzing large text datasets.
  • Maximum-likelihood estimation (MLE) is a standard approach for fitting topic models.
  • Existing MLE methods can be computationally intensive for large datasets.

Purpose of the Study:

  • To develop faster and more accurate topic modeling methods for large datasets.
  • To formally connect topic modeling MLE with non-negative matrix factorization (NMF).
  • To leverage advances in NMF optimization for topic model fitting.

Main Methods:

  • Revisiting maximum-likelihood estimation in topic models.
  • Establishing a formal connection between topic model MLE and non-negative matrix factorization (NMF).
  • Applying recent NMF optimization techniques to fit topic models.

Main Results:

  • Demonstrated that advances in NMF optimization can efficiently fit topic models.
  • Achieved significantly better fits in less time compared to existing topic modeling algorithms.
  • Showed that the Expectation-Maximization (EM) algorithm for topic models is equivalent to classic NMF multiplicative updates.

Conclusions:

  • Novel methods based on NMF provide a highly efficient approach for topic modeling.
  • The "fastTopics" R package implements these advanced NMF-based topic modeling techniques.
  • This work bridges the gap between NMF and topic modeling, offering practical benefits for large-scale text analysis.