Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving01:29

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

Mechanistic models play a crucial role in algorithms for numerical problem-solving, particularly in nonlinear mixed effects modeling (NMEM). These models aim to minimize specific objective functions by evaluating various parameter estimates, leading to the development of systematic algorithms. In some cases, linearization techniques approximate the model using linear equations.
In individual population analyses, different algorithms are employed, such as Cauchy's method, which uses a...
Statistical Methods for Analyzing Epidemiological Data01:25

Statistical Methods for Analyzing Epidemiological Data

Epidemiological data primarily involves information on specific populations' occurrence, distribution, and determinants of health and diseases. This data is crucial for understanding disease patterns and impacts, aiding public health decision-making and disease prevention strategies. The analysis of epidemiological data employs various statistical methods to interpret health-related data effectively. Here are some commonly used methods:
Statistical Analysis: Overview01:11

Statistical Analysis: Overview

When we take repeated measurements on the same or replicated samples, we will observe inconsistencies in the magnitude. These inconsistencies are called errors. To categorize and characterize these results and their errors, the researcher can use statistical analysis to determine the quality of the measurements and/or suitability of the methods.
One of the most commonly used statistical quantifiers is the mean, which is the ratio between the sum of the numerical values of all results and the...
Biostatistics: Overview01:20

Biostatistics: Overview

Biostatistics plays a crucial role in understanding and analyzing data in healthcare and biology. Biostatisticians conduct experiments, gather evidence, and draw meaningful conclusions using statistical methods and techniques. Different variables form the foundation of biostatistical analysis, allowing researchers to understand and interpret data effectively. These variables are classified into different types, each serving a specific purpose in statistical analysis.
Discrete variables are...
One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for ka Estimation01:24

One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for ka Estimation

This lesson introduces two critical methods in pharmacokinetics, the Wagner-Nelson and Loo-Riegelman methods, used for estimating the absorption rate constant (ka) for drugs administered via non-intravenous routes. The Wagner-Nelson method relates ka to the plasma concentration derived from the slope of a semilog percent unabsorbed time plot. However, it is limited to drugs with one-compartment kinetics and can be impacted by factors like gastrointestinal motility or enzymatic degradation.
On...
Mechanistic Models: Compartment Models in Individual and Population Analysis01:23

Mechanistic Models: Compartment Models in Individual and Population Analysis

Mechanistic models are utilized in individual analysis using single-source data, but imperfections arise due to data collection errors, preventing perfect prediction of observed data. The mathematical equation involves known values (Xi), observed concentrations (Ci), measurement errors (εi), model parameters (ϕj), and the related function (ƒi) for i number of values. Different least-squares metrics quantify differences between predicted and observed values. The ordinary least squares (OLS)...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Long-term stability of language with remapping in patients with medically refractory epilepsy.

Journal of neurosurgery·2026
Same author

Comment on Roberts at al: Historical cosmetic talc consumption and incidence of mesothelioma in the United States. International Journal of Environmental Health Research, 35:4, 972-980.

International journal of environmental health research·2026
Same author

Bayesian Posterior Interval Calibration to Improve the Interpretability of Observational Studies.

Statistical analysis and data mining·2025
Same author

Effect of Abandoned Housing Interventions on Gun Violence, Perceptions of Safety, and Substance Use in Black Neighborhoods: A Citywide Cluster Randomized Trial.

JAMA internal medicine·2022
Same author

Evaluating bias control strategies in observational studies using frequentist model averaging.

Journal of biopharmaceutical statistics·2022
Same author

A Multicentered Randomized Controlled Trial Comparing the Effectiveness of Pain Treatment Communication Tools in Emergency Department Patients With Back or Kidney Stone Pain.

American journal of public health·2022
Same journal

Topology only pre-training: towards generalised multi-domain graph models.

Data mining and knowledge discovery·2026
Same journal

Detection and evaluation of clusters within sequential data.

Data mining and knowledge discovery·2025
Same journal

Universal representation learning for multivariate time series using the instance-level and cluster-level supervised contrastive learning.

Data mining and knowledge discovery·2025
Same journal

Missing value replacement in strings and applications.

Data mining and knowledge discovery·2025
Same journal

Robust explainer recommendation for time series classification.

Data mining and knowledge discovery·2024
Same journal

Somtimes: self organizing maps for time series clustering and its application to serious illness conversations.

Data mining and knowledge discovery·2024
See all related articles

Related Experiment Video

Updated: Jun 19, 2026

A Novel Bayesian Change-point Algorithm for Genome-wide Analysis of Diverse ChIPseq Data Types
12:39

A Novel Bayesian Change-point Algorithm for Genome-wide Analysis of Diverse ChIPseq Data Types

Published on: December 10, 2012

A Sequential Monte Carlo Method for Bayesian Analysis of Massive Datasets.

Greg Ridgeway1, David Madigan

  • 1RAND, PO Box 2138, Santa Monica, CA 90407-2138, gregr@rand.org.

Data Mining and Knowledge Discovery
|October 1, 2009
PubMed
Summary
This summary is machine-generated.

This study introduces a novel algorithm for Bayesian analysis of massive datasets, significantly reducing data access needs. The method enhances computational feasibility for large-scale data mining while maintaining estimation efficiency.

More Related Videos

A User-friendly and Powerful R Analysis of Large-scale Datasets
10:56

A User-friendly and Powerful R Analysis of Large-scale Datasets

Published on: November 4, 2025

Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin
08:57

Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin

Published on: August 14, 2018

Related Experiment Videos

Last Updated: Jun 19, 2026

A Novel Bayesian Change-point Algorithm for Genome-wide Analysis of Diverse ChIPseq Data Types
12:39

A Novel Bayesian Change-point Algorithm for Genome-wide Analysis of Diverse ChIPseq Data Types

Published on: December 10, 2012

A User-friendly and Powerful R Analysis of Large-scale Datasets
10:56

A User-friendly and Powerful R Analysis of Large-scale Datasets

Published on: November 4, 2025

Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin
08:57

Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin

Published on: August 14, 2018

Area of Science:

  • Statistical computing
  • Bayesian inference
  • Data mining

Background:

  • Markov chain Monte Carlo (MCMC) methods are computationally intensive for large datasets.
  • Current MCMC techniques require full dataset scans per iteration, limiting their use in data mining.
  • Massive datasets necessitate scalable and statistically sound analytical methods.

Purpose of the Study:

  • To develop a computationally feasible method for Bayesian analysis of massive datasets.
  • To adapt MCMC techniques for large-scale data mining applications.
  • To reduce the computational burden of Bayesian inference on large data.

Main Methods:

  • Simulating posterior distributions from a subset of data.
  • Incorporating remaining data via importance sampling with reweighting.
  • Utilizing a rejuvenation step from particle filters to maintain estimation efficiency.

Main Results:

  • Demonstrated proof-of-concept on mixture transition models and Bayesian logistic regression.
  • Achieved a 99% reduction in data accesses for mixture models without loss of efficiency.
  • Achieved a 98% reduction in data accesses for Bayesian logistic regression.

Conclusions:

  • The proposed method makes Bayesian analysis computationally feasible for massive datasets.
  • The algorithm significantly reduces data access requirements for large-scale statistical modeling.
  • This approach offers a scalable solution for applying Bayesian methods in data mining.