Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Mechanistic Models: Compartment Models in Individual and Population Analysis01:23

Mechanistic Models: Compartment Models in Individual and Population Analysis

37
Mechanistic models are utilized in individual analysis using single-source data, but imperfections arise due to data collection errors, preventing perfect prediction of observed data. The mathematical equation involves known values (Xi), observed concentrations (Ci), measurement errors (εi), model parameters (ϕj), and the related function (ƒi) for i number of values. Different least-squares metrics quantify differences between predicted and observed values. The ordinary least...
37
Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving01:29

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

50
Mechanistic models play a crucial role in algorithms for numerical problem-solving, particularly in nonlinear mixed effects modeling (NMEM). These models aim to minimize specific objective functions by evaluating various parameter estimates, leading to the development of systematic algorithms. In some cases, linearization techniques approximate the model using linear equations.
In individual population analyses, different algorithms are employed, such as Cauchy's method, which uses a...
50
Statistical Methods for Analyzing Epidemiological Data01:25

Statistical Methods for Analyzing Epidemiological Data

361
Epidemiological data primarily involves information on specific populations' occurrence, distribution, and determinants of health and diseases. This data is crucial for understanding disease patterns and impacts, aiding public health decision-making and disease prevention strategies. The analysis of epidemiological data employs various statistical methods to interpret health-related data effectively. Here are some commonly used methods:
361
Comparing the Survival Analysis of Two or More Groups01:20

Comparing the Survival Analysis of Two or More Groups

177
Survival analysis is a cornerstone of medical research, used to evaluate the time until an event of interest occurs, such as death, disease recurrence, or recovery. Unlike standard statistical methods, survival analysis is particularly adept at handling censored data—instances where the event has not occurred for some participants by the end of the study or remains unobserved. To address these unique challenges, specialized techniques like the Kaplan-Meier estimator, log-rank test, and...
177
Residuals and Least-Squares Property01:11

Residuals and Least-Squares Property

7.3K
The vertical distance between the actual value of y and the estimated value of y. In other words, it measures the vertical distance between the actual data point and the predicted point on the line
If the observed data point lies above the line, the residual is positive, and the line underestimates the actual data value for y. If the observed data point lies below the line, the residual is negative, and the line overestimates the actual data value for y.
The process of fitting the best-fit...
7.3K
Regression Toward the Mean01:52

Regression Toward the Mean

6.3K
Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when...
6.3K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Reconciling fast Hepatitis B evolutionary rates with ancient co-divergence.

bioRxiv : the preprint server for biology·2026
Same author

Global approaches to infectious disease surveillance and modeling.

Nature medicine·2026
Same author

Loss function influence on hyperparameter optimization for observational healthcare prediction models.

Journal of the American Medical Informatics Association : JAMIA·2026
Same author

Multimodal prediction of visual improvement in diabetic macular edema using real-world electronic health records and optical coherence tomography images.

medRxiv : the preprint server for health sciences·2026
Same author

Parallel algorithms for phylogenetic inference under a structured coalescent approximation.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same author

Real-world evidence for comparative safety of second-line antihyperglycemic agents in older adults with type 2 diabetes.

Nature communications·2026
Same journal

Digital divide in clinical and operational artificial intelligence adoption and implementation stages: US hospital diffusion patterns and AI deserts.

Journal of the American Medical Informatics Association : JAMIA·2026
Same journal

Extending the fundamental theorem of biomedical informatics: a proposal and illustrative examples.

Journal of the American Medical Informatics Association : JAMIA·2026
Same journal

Human factors methods for designing safe health information technology: what do the experts think?

Journal of the American Medical Informatics Association : JAMIA·2026
Same journal

Equity-by-design for socially assistive robots as digital health tools.

Journal of the American Medical Informatics Association : JAMIA·2026
Same journal

Orchestrator multi-agent clinical decision support system for secondary headache diagnosis in primary care.

Journal of the American Medical Informatics Association : JAMIA·2026
Same journal

CUI-Curate: a GraphRAG-based framework for automated clinical concept curation for NLP applications.

Journal of the American Medical Informatics Association : JAMIA·2026
See all related articles

Related Experiment Video

Updated: Jun 25, 2025

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index
06:55

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

14.5K

Comparing penalization methods for linear models on large observational health data.

Egill A Fridgeirsson1, Ross Williams1, Peter Rijnbeek1

  • 1Department of Medical Informatics, Erasmus University Medical Center, 3015 GD Rotterdam, The Netherlands.

Journal of the American Medical Informatics Association : JAMIA
|May 20, 2024
PubMed
Summary
This summary is machine-generated.

L1 and ElasticNet regularization methods offer superior discrimination for healthcare predictions. L0-based methods like Iterative Hard Thresholding (IHT) and Broken Adaptive Ridge (BAR) provide simpler, more interpretable models with better calibration.

Keywords:
calibrationdiscriminationelectronic health recordslogistic regressionregularization

More Related Videos

An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.0K
Establishing a Competing Risk Regression Nomogram Model for Survival Data
04:57

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

10.1K

Related Experiment Videos

Last Updated: Jun 25, 2025

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index
06:55

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

14.5K
An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.0K
Establishing a Competing Risk Regression Nomogram Model for Survival Data
04:57

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

10.1K

Area of Science:

  • Machine Learning in Healthcare
  • Statistical Modeling
  • Predictive Analytics

Background:

  • Logistic regression is widely used for healthcare predictions.
  • Regularization techniques are crucial for optimizing model performance and preventing overfitting.
  • Evaluating various regularization methods is essential for selecting the most effective approach.

Purpose of the Study:

  • To compare the discrimination and calibration performance of different logistic regression regularization variants.
  • To assess the internal and external validation of these methods in healthcare prediction models.
  • To guide the selection of regularization techniques for improved predictive accuracy and interpretability.

Main Methods:

  • Utilized data from 5 US claims and electronic health record databases for major depressive disorder patient population.
  • Developed and externally validated logistic regression models using L1, L2, ElasticNet, Adaptive L1, Adaptive ElasticNet, Broken Adaptive Ridge (BAR), and Iterative Hard Thresholding (IHT).
  • Employed a 75%/25% train-test split and evaluated performance using discrimination (AUC) and calibration metrics, with statistical analysis via Friedman's test and critical difference diagrams.

Main Results:

  • L1 and ElasticNet regularization demonstrated superior internal and external discrimination performance.
  • BAR and IHT methods exhibited the best internal calibration, though no single method led in external calibration.
  • While IHT and BAR were slightly less discriminative, they significantly reduced model complexity and feature count compared to L1 and ElasticNet.

Conclusions:

  • L1 and ElasticNet provide the best discriminative performance for logistic regression in healthcare, ensuring robustness across internal and external validations.
  • L0-based methods (IHT, BAR) are advantageous for creating simpler, more interpretable models with enhanced parsimony and calibration.
  • The findings assist in choosing appropriate regularization techniques for healthcare prediction models, balancing predictive performance, model complexity, and interpretability.