Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Statistical Methods for Analyzing Epidemiological Data01:25

Statistical Methods for Analyzing Epidemiological Data

889
Epidemiological data primarily involves information on specific populations' occurrence, distribution, and determinants of health and diseases. This data is crucial for understanding disease patterns and impacts, aiding public health decision-making and disease prevention strategies. The analysis of epidemiological data employs various statistical methods to interpret health-related data effectively. Here are some commonly used methods:
889
Observational Studies01:11

Observational Studies

10.7K
Observational studies are a type of analytical study where researchers observe events without any interventions. In other words, the researcher does not influence the response variable or the experiment's outcome.
There are three types of observational studies – Prospective, retrospective, and cross-sectional.
Prospective Study
Prospective studies, also known as longitudinal or cohort studies, are carried out by collecting future data from groups sharing similar characteristics. One...
10.7K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Lower self-efficacy and lower PrEP stigma predict telePrEP preferences among HIV self-testers.

Journal of acquired immune deficiency syndromes (1999)·2026
Same author

Financial Incentives to Address Representativeness in Randomized Clinical Trials: A Secondary Analysis of the RETAIN Randomized Clinical Trial.

JAMA internal medicine·2026
Same author

Impact of empiric potassium supplementation on mortality, sudden cardiac arrest and stroke in furosemide initiators.

British journal of clinical pharmacology·2026
Same author

Clinician Perceptions of Therapeutic Inertia in Type 2 Diabetes Mellitus: A Qualitative Study.

Journal of the American College of Clinical Pharmacy : JACCP·2026
Same author

Risk Factors for Community Colonization With Extended-Spectrum Cephalosporin-Resistant Enterobacterales Among People Living With HIV in Botswana.

Open forum infectious diseases·2026
Same author

Linkage of Emergency Department Patients With Public Benefits Navigators via Text Messages: A Randomized Clinical Trial.

JAMA health forum·2026
Same journal

Where are cigarettes cheaper? Area-level and store-type differences in price in US cities.

Nicotine & tobacco research : official journal of the Society for Research on Nicotine and Tobacco·2026
Same journal

"As for stigma? … I don't think I'd be going to my next-door neighbor saying 'I'm going for lung cancer screening'": a qualitative study of the potential impacts of stigma during lung cancer screening in Australia.

Nicotine & tobacco research : official journal of the Society for Research on Nicotine and Tobacco·2026
Same journal

Rural E-Cigarette Use Higher than Urban Across Age Groups.

Nicotine & tobacco research : official journal of the Society for Research on Nicotine and Tobacco·2026
Same journal

Effects of transcranial direct current stimulation on Nicotine Use Disorder: A systematic review and meta-analysis.

Nicotine & tobacco research : official journal of the Society for Research on Nicotine and Tobacco·2026
Same journal

Sex-specific associations between tobacco use and youth mental health: prospective and cross-sectional findings from the LoRDIA Program.

Nicotine & tobacco research : official journal of the Society for Research on Nicotine and Tobacco·2026
Same journal

Tobacco Cessation Treatment in Pregnancy: Insights from Florida Medicaid Claims Data.

Nicotine & tobacco research : official journal of the Society for Research on Nicotine and Tobacco·2026
See all related articles

Related Experiment Video

Updated: Jan 12, 2026

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index
06:55

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

15.0K

Training a Smoking Status Probabilistic Model Using Cotinine Levels in a Large Claims Database.

Dominique Medaglio1,2, Charles E Leonard1,2,3, Alisa J Stephens Shields1,2,3,4

  • 1Center for Pharmacoepidemiology Research and Training, Center for Clinical Epidemiology and Biostatistics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA.

Nicotine & Tobacco Research : Official Journal of the Society for Research on Nicotine and Tobacco
|October 31, 2025
PubMed
Summary
This summary is machine-generated.

A new probabilistic model accurately predicts smoking status in administrative claims data using cotinine values. This approach enhances epidemiological research by providing reliable smoking data where it was previously lacking.

More Related Videos

Machine Learning-Based Cough Tone Classification: Diagnostic Exploration of Chronic Obstructive Pulmonary Disease and Respiratory Tract Infections
06:22

Machine Learning-Based Cough Tone Classification: Diagnostic Exploration of Chronic Obstructive Pulmonary Disease and Respiratory Tract Infections

Published on: September 19, 2025

420
A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
12:18

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

7.9K

Related Experiment Videos

Last Updated: Jan 12, 2026

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index
06:55

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

15.0K
Machine Learning-Based Cough Tone Classification: Diagnostic Exploration of Chronic Obstructive Pulmonary Disease and Respiratory Tract Infections
06:22

Machine Learning-Based Cough Tone Classification: Diagnostic Exploration of Chronic Obstructive Pulmonary Disease and Respiratory Tract Infections

Published on: September 19, 2025

420
A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
12:18

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

7.9K

Area of Science:

  • Epidemiology
  • Health Informatics
  • Biostatistics

Background:

  • Smoking status is a critical confounder in epidemiological studies.
  • Administrative claims data often lack comprehensive smoking status documentation.
  • Existing probabilistic models for smoking status typically rely on self-reported data.

Purpose of the Study:

  • To develop and validate a probabilistic model for predicting smoking status using cotinine values from a large claims database.
  • To leverage objective cotinine measurements for a more reliable proxy of smoking status in real-world data.

Main Methods:

  • Beneficiaries with cotinine measurements were included.
  • Current smokers were defined by specific cotinine thresholds (serum/plasma ≥5ng/mL, urine ≥30ng/mL).
  • A logistic regression model with stepwise forward selection was employed, using predictors from the year prior to cotinine assessment.

Main Results:

  • The model achieved an area under the receiver operating characteristic curve of 0.77 (95% CI: 0.75-0.78) with good calibration.
  • Key predictors included diagnosis codes for smoking and drug abuse, and medication counts.
  • The model demonstrated high specificity but lower sensitivity at probability cutoffs ≥ 0.2.

Conclusions:

  • A validated probabilistic model for smoking status in claims data was successfully developed using cotinine values.
  • The model, utilizing 26 predictors, offers a simplified approach for application in other claims databases.
  • External validation is recommended for future epidemiological research applications.