Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

Mechanistic models play a crucial role in algorithms for numerical problem-solving, particularly in nonlinear mixed effects modeling (NMEM). These models aim to minimize specific objective functions by evaluating various parameter estimates, leading to the development of systematic algorithms. In some cases, linearization techniques approximate the model using linear equations.
In individual population analyses, different algorithms are employed, such as Cauchy's method, which uses a...

Statistical Methods for Analyzing Epidemiological Data

Statistical Methods for Analyzing Epidemiological Data

Epidemiological data primarily involves information on specific populations' occurrence, distribution, and determinants of health and diseases. This data is crucial for understanding disease patterns and impacts, aiding public health decision-making and disease prevention strategies. The analysis of epidemiological data employs various statistical methods to interpret health-related data effectively. Here are some commonly used methods:

Statistical Analysis: Overview

Statistical Analysis: Overview

When we take repeated measurements on the same or replicated samples, we will observe inconsistencies in the magnitude. These inconsistencies are called errors. To categorize and characterize these results and their errors, the researcher can use statistical analysis to determine the quality of the measurements and/or suitability of the methods.
One of the most commonly used statistical quantifiers is the mean, which is the ratio between the sum of the numerical values of all results and the...

Biostatistics: Overview

Biostatistics: Overview

Biostatistics plays a crucial role in understanding and analyzing data in healthcare and biology. Biostatisticians conduct experiments, gather evidence, and draw meaningful conclusions using statistical methods and techniques. Different variables form the foundation of biostatistical analysis, allowing researchers to understand and interpret data effectively. These variables are classified into different types, each serving a specific purpose in statistical analysis.
Discrete variables are...

One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for ka Estimation

One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for k_a Estimation

This lesson introduces two critical methods in pharmacokinetics, the Wagner-Nelson and Loo-Riegelman methods, used for estimating the absorption rate constant (ka) for drugs administered via non-intravenous routes. The Wagner-Nelson method relates ka to the plasma concentration derived from the slope of a semilog percent unabsorbed time plot. However, it is limited to drugs with one-compartment kinetics and can be impacted by factors like gastrointestinal motility or enzymatic degradation.
On...

Mechanistic Models: Compartment Models in Individual and Population Analysis

Mechanistic Models: Compartment Models in Individual and Population Analysis

Mechanistic models are utilized in individual analysis using single-source data, but imperfections arise due to data collection errors, preventing perfect prediction of observed data. The mathematical equation involves known values (Xi), observed concentrations (Ci), measurement errors (εi), model parameters (ϕj), and the related function (ƒi) for i number of values. Different least-squares metrics quantify differences between predicted and observed values. The ordinary least squares (OLS)...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Long-term stability of language with remapping in patients with medically refractory epilepsy.

Journal of neurosurgery·2026

Same author

Comment on Roberts at al: Historical cosmetic talc consumption and incidence of mesothelioma in the United States. International Journal of Environmental Health Research, 35:4, 972-980.

International journal of environmental health research·2026

Same author

Bayesian Posterior Interval Calibration to Improve the Interpretability of Observational Studies.

Statistical analysis and data mining·2025

Same author

Effect of Abandoned Housing Interventions on Gun Violence, Perceptions of Safety, and Substance Use in Black Neighborhoods: A Citywide Cluster Randomized Trial.

JAMA internal medicine·2022

Same author

Evaluating bias control strategies in observational studies using frequentist model averaging.

Journal of biopharmaceutical statistics·2022

Same author

A Multicentered Randomized Controlled Trial Comparing the Effectiveness of Pain Treatment Communication Tools in Emergency Department Patients With Back or Kidney Stone Pain.

American journal of public health·2022

Same journal

Topology only pre-training: towards generalised multi-domain graph models.

Data mining and knowledge discovery·2026

Same journal

Detection and evaluation of clusters within sequential data.

Data mining and knowledge discovery·2025

Same journal

Universal representation learning for multivariate time series using the instance-level and cluster-level supervised contrastive learning.

Data mining and knowledge discovery·2025

Same journal

Missing value replacement in strings and applications.

Data mining and knowledge discovery·2025

Same journal

Robust explainer recommendation for time series classification.

Data mining and knowledge discovery·2024

Same journal

Somtimes: self organizing maps for time series clustering and its application to serious illness conversations.

Data mining and knowledge discovery·2024

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 19, 2026

A Novel Bayesian Change-point Algorithm for Genome-wide Analysis of Diverse ChIPseq Data Types

A Novel Bayesian Change-point Algorithm for Genome-wide Analysis of Diverse ChIPseq Data Types

Published on: December 10, 2012

A Sequential Monte Carlo Method for Bayesian Analysis of Massive Datasets.

Greg Ridgeway¹, David Madigan

¹RAND, PO Box 2138, Santa Monica, CA 90407-2138, gregr@rand.org.

Data Mining and Knowledge Discovery

|October 1, 2009

Summary

This summary is machine-generated.

This study introduces a novel algorithm for Bayesian analysis of massive datasets, significantly reducing data access needs. The method enhances computational feasibility for large-scale data mining while maintaining estimation efficiency.

More Related Videos

A User-friendly and Powerful R Analysis of Large-scale Datasets

A User-friendly and Powerful R Analysis of Large-scale Datasets

Published on: November 4, 2025

Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin

Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin

Published on: August 14, 2018

Related Experiment Videos

Last Updated: Jun 19, 2026

A Novel Bayesian Change-point Algorithm for Genome-wide Analysis of Diverse ChIPseq Data Types

A Novel Bayesian Change-point Algorithm for Genome-wide Analysis of Diverse ChIPseq Data Types

Published on: December 10, 2012

A User-friendly and Powerful R Analysis of Large-scale Datasets

A User-friendly and Powerful R Analysis of Large-scale Datasets

Published on: November 4, 2025

Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin

Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin

Published on: August 14, 2018

Area of Science:

Statistical computing
Bayesian inference
Data mining

Background:

Markov chain Monte Carlo (MCMC) methods are computationally intensive for large datasets.
Current MCMC techniques require full dataset scans per iteration, limiting their use in data mining.
Massive datasets necessitate scalable and statistically sound analytical methods.

Purpose of the Study:

To develop a computationally feasible method for Bayesian analysis of massive datasets.
To adapt MCMC techniques for large-scale data mining applications.
To reduce the computational burden of Bayesian inference on large data.

Main Methods:

Simulating posterior distributions from a subset of data.
Incorporating remaining data via importance sampling with reweighting.
Utilizing a rejuvenation step from particle filters to maintain estimation efficiency.

Main Results:

Demonstrated proof-of-concept on mixture transition models and Bayesian logistic regression.
Achieved a 99% reduction in data accesses for mixture models without loss of efficiency.
Achieved a 98% reduction in data accesses for Bayesian logistic regression.

Conclusions:

The proposed method makes Bayesian analysis computationally feasible for massive datasets.
The algorithm significantly reduces data access requirements for large-scale statistical modeling.
This approach offers a scalable solution for applying Bayesian methods in data mining.