Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

Mechanistic models play a crucial role in algorithms for numerical problem-solving, particularly in nonlinear mixed effects modeling (NMEM). These models aim to minimize specific objective functions by evaluating various parameter estimates, leading to the development of systematic algorithms. In some cases, linearization techniques approximate the model using linear equations.
In individual population analyses, different algorithms are employed, such as Cauchy's method, which uses a...

Mechanistic Models: Compartment Models in Individual and Population Analysis

Mechanistic Models: Compartment Models in Individual and Population Analysis

Mechanistic models are utilized in individual analysis using single-source data, but imperfections arise due to data collection errors, preventing perfect prediction of observed data. The mathematical equation involves known values (Xi), observed concentrations (Ci), measurement errors (εi), model parameters (ϕj), and the related function (ƒi) for i number of values. Different least-squares metrics quantify differences between predicted and observed values. The ordinary least squares (OLS)...

Detection of Gross Error: The Q Test

Detection of Gross Error: The Q Test

When one or more data points appear far from the rest of the data, there is a need to determine whether they are outliers and whether they should be eliminated from the data set to ensure an accurate representation of the measured value. In many cases, outliers arise from gross errors (or human errors) and do not accurately reflect the underlying phenomenon. In some cases, however, these apparent outliers reflect true phenomenological differences. In these cases, we can use statistical methods...

One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for ka Estimation

One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for k_a Estimation

This lesson introduces two critical methods in pharmacokinetics, the Wagner-Nelson and Loo-Riegelman methods, used for estimating the absorption rate constant (ka) for drugs administered via non-intravenous routes. The Wagner-Nelson method relates ka to the plasma concentration derived from the slope of a semilog percent unabsorbed time plot. However, it is limited to drugs with one-compartment kinetics and can be impacted by factors like gastrointestinal motility or enzymatic degradation.
On...

Pharmacokinetic Models: Comparison and Selection Criterion

Pharmacokinetic Models: Comparison and Selection Criterion

Physiological and compartmental models are valuable tools used in studying biological systems. These models rely on differential equations to maintain mass balance within the system, ensuring an accurate representation of the dynamic processes at play.
Physiological models take a detailed approach by considering specific molecular processes. They can predict drug distribution, metabolism, and elimination changes, providing a comprehensive understanding of how drugs interact with the body.

Survival Tree

Survival Tree

Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
Building a Survival Tree
Constructing a survival tree begins...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Statistics and AI - A Fireside Conversation.

Harvard data science review·2026

Same author

Cardiovascular-Kidney-Metabolic Syndrome: Conceptualising an Approach to Health Economic Modelling.

Diabetes, obesity & metabolism·2026

Same author

Artificial Intelligence in Image-Based Cardiovascular Disease Analysis.

Annual review of biomedical data science·2026

Same author

Multi-organ imaging and genetics show the impact of sleep patterns on the human brain and body.

Communications medicine·2026

Same author

Scalable subclonal reconstruction of cancer cells in DNA sequencing data using a penalized likelihood model.

bioRxiv : the preprint server for biology·2026

Same author

Connectome-based spatial statistics enabling large-scale population analyses of human connectome across cohorts.

bioRxiv : the preprint server for biology·2026

Same journal

Instrumental Variable Estimation of Marginal Structural Mean Models for Time-Varying Treatment.

Journal of the American Statistical Association·2026

Same journal

Semiparametric Joint Modeling for Survival Analysis with Longitudinal Covariates.

Journal of the American Statistical Association·2026

Same journal

Dimension Reduction for Large-Scale Federated Data: Statistical Rate and Asymptotic Inference.

Journal of the American Statistical Association·2026

Same journal

Facilitating Heterogeneous Effect Estimation via Statistically Efficient Categorical Modifiers.

Journal of the American Statistical Association·2026

Same journal

Nonparametric Density Estimation of a Long-Term Trend from Repeated Semicontinuous Data.

Journal of the American Statistical Association·2026

Same journal

Functional Integrative Bayesian Analysis of High-dimensional Multiplatform Clinicogenomic Data.

Journal of the American Statistical Association·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 20, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Model Selection Criteria for Missing-Data Problems Using the EM Algorithm.

Joseph G Ibrahim¹, Hongtu Zhu, Niansheng Tang

¹Joseph G. Ibrahim is Alumni Distinguished Professor (E-mail: ibrahim@bios.unc.edu ), Department of Biostatistics, University of North Carolina, Chapel Hill.

Journal of the American Statistical Association

|August 21, 2009

Summary

This summary is machine-generated.

This study introduces novel information criteria (IC(H)(,)(Q)) for model selection with missing data using the EM algorithm. These criteria, including IC(H̃)((k)(),)(Q) and IC(Q), offer versatile and computationally efficient solutions for incomplete data problems.

More Related Videos

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

A Workflow for Lipid Nanoparticle (LNP) Formulation Optimization using Designed Mixture-Process Experiments and Self-Validated Ensemble Models (SVEM)

A Workflow for Lipid Nanoparticle (LNP) Formulation Optimization using Designed Mixture-Process Experiments and Self-Validated Ensemble Models (SVEM)

Published on: August 18, 2023

Related Experiment Videos

Last Updated: Jun 20, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

A Workflow for Lipid Nanoparticle (LNP) Formulation Optimization using Designed Mixture-Process Experiments and Self-Validated Ensemble Models (SVEM)

A Workflow for Lipid Nanoparticle (LNP) Formulation Optimization using Designed Mixture-Process Experiments and Self-Validated Ensemble Models (SVEM)

Published on: August 18, 2023

Area of Science:

Statistics
Computational Statistics
Data Science

Background:

Missing data poses significant challenges in statistical modeling.
The Expectation-Maximization (EM) algorithm is a common approach for handling incomplete datasets.
Existing model selection criteria may not be directly applicable or computationally efficient in missing-data scenarios.

Purpose of the Study:

To develop novel, generalizable methods for computing model selection criteria in the presence of missing data.
To introduce a new class of information criteria, IC(H)(,)(Q), derived from EM algorithm outputs.
To propose computationally simplified approximations, IC(H̃)((k)(),)(Q) and IC(Q), for enhanced usability.

Main Methods:

Utilizing the output of the EM algorithm for maximum likelihood estimation.
Developing an analytic approximation for the H-function to derive IC(H)(,)(Q).
Proposing IC(Q) as a computationally simpler alternative dependent only on the Q-function of the EM algorithm.

Main Results:

The proposed IC(H)(,)(Q) framework encompasses established criteria like AIC and BIC.
Theoretical properties, including consistency, of IC(H̃)((k)(),)(Q) were rigorously investigated.
Simulations demonstrated the methodology's effectiveness and the performance of the proposed criteria in various missing-data settings.

Conclusions:

The developed information criteria provide robust and flexible tools for model selection with incomplete data.
IC(Q) offers a computationally advantageous alternative for practical applications.
The methodology is broadly applicable across diverse regression models and missing data mechanisms.