Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Mechanistic Models: Compartment Models in Individual and Population Analysis01:23

Mechanistic Models: Compartment Models in Individual and Population Analysis

359
Mechanistic models are utilized in individual analysis using single-source data, but imperfections arise due to data collection errors, preventing perfect prediction of observed data. The mathematical equation involves known values (Xi), observed concentrations (Ci), measurement errors (εi), model parameters (ϕj), and the related function (ƒi) for i number of values. Different least-squares metrics quantify differences between predicted and observed values. The ordinary least...
359
Statistical Methods for Analyzing Epidemiological Data01:25

Statistical Methods for Analyzing Epidemiological Data

1.3K
Epidemiological data primarily involves information on specific populations' occurrence, distribution, and determinants of health and diseases. This data is crucial for understanding disease patterns and impacts, aiding public health decision-making and disease prevention strategies. The analysis of epidemiological data employs various statistical methods to interpret health-related data effectively. Here are some commonly used methods:
1.3K
Applications of Life Tables01:22

Applications of Life Tables

426
Life tables are versatile across various fields, providing a quantitative basis for analyzing mortality and survival rates. Whether used by demographers, actuaries, epidemiologists, or sociologists, life tables offer valuable insights into the dynamics of life and death, facilitating informed decisions in public health, insurance, conservation, and beyond. Their broad applicability highlights the interconnectedness of demographic data with practical outcomes in everyday life and strategic...
426
Probability Laws01:49

Probability Laws

29.7K
Overview
29.7K
Regression Toward the Mean01:52

Regression Toward the Mean

6.3K
Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when...
6.3K
Confounding in Epidemiological Studies01:27

Confounding in Epidemiological Studies

1.1K
Confounding in statistical epidemiology represents a pivotal challenge, referring to the distortion in the perceived relationship between an exposure and an outcome due to the presence of a third variable, known as a confounder. This variable is associated with both the exposure and the outcome but is not a direct link in their causal chain. Its presence can lead to erroneous interpretations of the exposure's effect, either exaggerating or underestimating the true association. This...
1.1K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Trends of Fentanyl and Oxycodone Use in Australia (2018-2022) in National Prescription and Wastewater Data Sets.

Environmental science & technology·2026
Same author

Impact of bariatric surgery on alcohol-use disorder and alcohol-related liver disease: A retrospective study.

Addiction (Abingdon, England)·2026
Same author

A Clinical Practice Update for the Management of Patients With Disorders of Gut-Brain Interaction and Altered Food Intake Behavior.

Neurogastroenterology and motility·2026
Same author

Patterns and Correlates of Disposable E-Cigarette Use Among U.S. Youth.

American journal of preventive medicine·2026
Same author

Socially Minded Intelligence: How Individuals, Groups, and Artificial Intelligence Can Make Each Other Smarter (or Not).

Personality and social psychology review : an official journal of the Society for Personality and Social Psychology, Inc·2026
Same author

Severity of Polysubstance Use Among Young People Seeking Drug and Alcohol Treatment, and Social and Mental Health Correlates: An Observational Study.

Substance use & misuse·2026
Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026
Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026
Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026
Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026
Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026
Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026
See all related articles

Related Experiment Video

Updated: Apr 28, 2026

Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment
06:48

Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment

Published on: June 25, 2019

8.7K

Detecting contaminated birthdates using generalized additive models.

Wei Luo1, Marcus Gallagher, Bill Loveday

  • 1Centre for Pattern Recognition and Data Analytics, Deakin University, Geelong, Australia. wei.luo@deakin.edu.au.

BMC Bioinformatics
|June 14, 2014
PubMed
Summary
This summary is machine-generated.

This study introduces a statistically driven method to detect erroneous patient birthdates in health databases. The approach accurately identifies incorrect birthdates, improving data quality and reducing manual verification needs.

More Related Videos

A Method of Trigonometric Modelling of Seasonal Variation Demonstrated with Multiple Sclerosis Relapse Data
10:46

A Method of Trigonometric Modelling of Seasonal Variation Demonstrated with Multiple Sclerosis Relapse Data

Published on: December 9, 2015

10.3K
Using Cholesky Decomposition to Explore Individual Differences in Longitudinal Relations between Reading Skills
06:52

Using Cholesky Decomposition to Explore Individual Differences in Longitudinal Relations between Reading Skills

Published on: September 17, 2019

5.8K

Related Experiment Videos

Last Updated: Apr 28, 2026

Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment
06:48

Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment

Published on: June 25, 2019

8.7K
A Method of Trigonometric Modelling of Seasonal Variation Demonstrated with Multiple Sclerosis Relapse Data
10:46

A Method of Trigonometric Modelling of Seasonal Variation Demonstrated with Multiple Sclerosis Relapse Data

Published on: December 9, 2015

10.3K
Using Cholesky Decomposition to Explore Individual Differences in Longitudinal Relations between Reading Skills
06:52

Using Cholesky Decomposition to Explore Individual Differences in Longitudinal Relations between Reading Skills

Published on: September 17, 2019

5.8K

Area of Science:

  • Health Informatics
  • Data Quality Management
  • Statistical Modeling

Background:

  • Erroneous patient birthdates are a frequent and significant issue in health databases.
  • Manual verification of birthdate errors is resource-intensive and often impractical.
  • Developing automated methods for detecting these errors is crucial for data integrity.

Purpose of the Study:

  • To present a statistically driven procedure for identifying erroneous patient birthdates.
  • To offer a more efficient and accurate alternative to manual error detection.
  • To address the common data quality problem of inaccurate birthdates in healthcare systems.

Main Methods:

  • Utilized Generalized Additive Models (GAM) to incorporate demographic trends and birth patterns.
  • Developed a principled, statistically driven procedure for error detection.
  • Controlled for false positive rates to ensure high accuracy.

Main Results:

  • The GAM-based method identified 51 out of 58 actual incorrect birthdates (86.0% positive predictive value).
  • Achieved a low false negative rate of 12.0% (7 out of 58 errors missed).
  • Outperformed traditional linear time-series models in accuracy.

Conclusions:

  • The GAM-based method is a highly accurate and effective approach for identifying systemic birthdate errors.
  • This method offers a practical solution for improving data quality in clinical and administrative databases.
  • Automated detection of birthdate errors enhances the reliability of health data.