Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Steps in Outbreak Investigation01:18

Steps in Outbreak Investigation

185
In the ever-evolving field of public health, statistical analysis serves as a cornerstone for understanding and managing disease outbreaks. By leveraging various statistical tools, health professionals can predict potential outbreaks, analyze ongoing situations, and devise effective responses to mitigate impact. For that to happen, there are a few possible stages of the analysis:
185
Classification of Illness01:17

Classification of Illness

7.9K
The meaning of illness is individualized to each person who experiences an alteration in health. In contrast, disease is a medical term indicating a pathological change in the structure and function of the body or mind. It is a condition that has specific symptoms and boundaries.
An illness is a response to a disease in which the person's level of functioning is changed compared with a previous level. The general classification of illness includes acute and chronic.
Acute illness is severe...
7.9K
Residuals and Least-Squares Property01:11

Residuals and Least-Squares Property

7.8K
The vertical distance between the actual value of y and the estimated value of y. In other words, it measures the vertical distance between the actual data point and the predicted point on the line
If the observed data point lies above the line, the residual is positive, and the line underestimates the actual data value for y. If the observed data point lies below the line, the residual is negative, and the line overestimates the actual data value for y.
The process of fitting the best-fit...
7.8K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Prevalence and Clinical Impact of Pathogenic Variants in Cardiomyopathy Genes Among Individuals with Cardiac Conduction Disorders.

medRxiv : the preprint server for health sciences·2026
Same author

A comparison of Fast Healthcare Interoperability Resources and Observational Medical Mutcomes Partnership electronic health record data within the All of Us Research Program.

Journal of the American Medical Informatics Association : JAMIA·2026
Same author

Implementing trust in non-small cell lung cancer diagnosis with a conformalized uncertainty-aware AI framework.

Nature biomedical engineering·2026
Same author

Generating synthetic multi-national longitudinal cohorts for clinically grounded HIV research.

Nature communications·2026
Same author

Critical illness expands a transcriptionally distinct hypometabolic CD8 <sup>+</sup> T effector program associated with respiratory failure and mortality.

bioRxiv : the preprint server for biology·2026
Same author

Evaluating Large Language Models for Translating Multimodal Phenotype Documentations into Executable EHR Phenotyping Algorithms.

medRxiv : the preprint server for health sciences·2026
Same journal

HIV Transmission Dynamics in Greater Mexico City are Shaped by Dense Spatial Mixing.

Research square·2026
Same journal

A UCP1-IRES-Cre Knock-In Mouse Enables Specific Brown Adipocyte Targeting Without CNS Off-Target Expression.

Research square·2026
Same journal

Precision RNAi for Fibrodysplasia Ossificans Progressiva: a combinatorial, unimolecular, allele selective approach.

Research square·2026
Same journal

Perceptions of end-of-life care quality among bereaved closest contacts of community-dwelling older Australians: a cross-sectional survey of the ASPREE cohort.

Research square·2026
Same journal

Heavy-chain immune repertoire sequencing enables language-model prediction of antigen-specific antibodies.

Research square·2026
Same journal

25+ Years of TRPV4: From Discovery to Translational Horizons.

Research square·2026
See all related articles

Related Experiment Video

Updated: Sep 9, 2025

Constructing and Visualizing Models using Mime-based Machine-learning Framework
06:19

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Published on: July 22, 2025

603

Multi-scale Data Improves Performance of Machine Learning Model for Long COVID Prediction.

Wei-Qi Wei1, Christopher Guardo1, Xinmeng Zhang1

  • 1Vanderbilt University Medical Center.

Research Square
|September 5, 2025
PubMed
Summary
This summary is machine-generated.

Integrating diverse data improves long COVID prediction. Combining electronic health records with social, behavioral, and genetic factors enhances risk assessment for SARS-CoV-2 survivors.

More Related Videos

Asthma Detection Research Based on Voice Signal Processing and Machine Learning
04:04

Asthma Detection Research Based on Voice Signal Processing and Machine Learning

Published on: July 22, 2025

405
Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data
14:27

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data

Published on: June 26, 2013

15.8K

Related Experiment Videos

Last Updated: Sep 9, 2025

Constructing and Visualizing Models using Mime-based Machine-learning Framework
06:19

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Published on: July 22, 2025

603
Asthma Detection Research Based on Voice Signal Processing and Machine Learning
04:04

Asthma Detection Research Based on Voice Signal Processing and Machine Learning

Published on: July 22, 2025

405
Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data
14:27

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data

Published on: June 26, 2013

15.8K

Area of Science:

  • Epidemiology
  • Genetics
  • Health Informatics

Background:

  • Long COVID impacts a significant number of SARS-CoV-2 infected individuals globally.
  • Current predictive models for long COVID risk are limited, often relying solely on electronic health record (EHR) data.
  • Social, behavioral, and genetic factors are increasingly recognized as potential contributors to long COVID development.

Purpose of the Study:

  • To investigate if integrating EHR data with survey-based and genomic information improves the predictive performance of long COVID risk models.
  • To identify key social, behavioral, and genetic predictors of long COVID.
  • To enhance risk stratification for personalized long COVID interventions.

Main Methods:

  • Utilized a diverse cohort of over 17,200 SARS-CoV-2 infected individuals from the NIH All of Us Research Program.
  • Employed a multi-scale data integration approach, combining EHR data with survey responses and genomic information.
  • Compared the performance of integrated models against EHR-only models using Area Under the Receiver Operating Characteristic Curve (AUROC).

Main Results:

  • The integrated multi-scale model demonstrated improved predictive performance compared to EHR-only models, with an AUROC of 0.748 (95% CI: 0.741, 0.755) versus 0.736 (95% CI: 0.730, 0.741).
  • Key predictors identified included active-duty service status, self-reported fatigue, and a specific genetic variant (chr19:4719431:G:A_A).
  • These findings underscore the value of multi-modal data in predicting long COVID.

Conclusions:

  • Integrating electronic health records with social, behavioral, and genetic data significantly enhances the prediction of long COVID risk.
  • Factors such as military service, fatigue, and specific genetic markers are important predictors.
  • This multi-scale approach offers a pathway for improved risk stratification and personalized interventions for long COVID management.