Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Prediction Intervals01:03

Prediction Intervals

The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y. 
The...
Polygenic Traits01:18

Polygenic Traits

When more than one gene is responsible for a given phenotype, the trait is considered polygenic. Human height is a polygenic trait. Studies have uncovered hundreds of loci that influence height, and there are believed to be many more. Due to the high number of genes involved, as well as environmental and nutritional factors, height varies significantly within a given population. The distribution of height forms a bell-shaped curve, with relatively few individuals in the population at the...
Polygenic Traits01:18

Polygenic Traits

When more than one gene is responsible for a given phenotype, the trait is considered polygenic. Human height is a polygenic trait. Studies have uncovered hundreds of loci that influence height, and there are believed to be many more. Due to the high number of genes involved, as well as environmental and nutritional factors, height varies significantly within a given population. The distribution of height forms a bell-shaped curve, with relatively few individuals in the population at the...
Calibration Curves: Correlation Coefficient01:10

Calibration Curves: Correlation Coefficient

In a linear calibration curve, there is a value called the calibration coefficient, denoted by 'r,' which measures the strength and the direction of association between two variables. The correlation coefficient value ranges from −1 to +1. A value of +1 indicates a perfect positive linear correlation, −1 denotes a perfect negative correlation, and 0 implies no correlation between the two variables. A positive correlation value establishes that as one variable increases, the other increases, and...
Calibration Curves: Linear Least Squares01:20

Calibration Curves: Linear Least Squares

A calibration curve is a plot of the instrument's response against a series of known concentrations of a substance. This curve is used to set the instrument response levels, using the substance and its concentrations as standards. Alternatively, or additionally, an equation is fitted to the calibration curve plot and subsequently used to calculate the unknown concentrations of other samples reliably.
For data that follow a straight line, the standard method for fitting is the linear...
Confidence Intervals01:21

Confidence Intervals

An unbiased point estimate is often insufficient to predict a population estimate, such as population mean or population proportion. In this scenario, a confidence interval is used. A confidence interval is an estimate similar to a sample proportion. However, unlike the point estimate which is a single value, the confidence interval contains a range of values. These values have lower and upper limits, known as confidence limits, and can be designated as L1 and L2, respectively.
A confidence...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

A unified framework enables accessible deployment and comprehensive benchmarking of single-cell foundation models.

bioRxiv : the preprint server for biology·2026
Same author

Decoding multilevel relationships with the human tissue-cell-molecule network.

Briefings in bioinformatics·2022
Same author

Inhalation airflow and ventilation efficiency in subject-specific human upper airways.

Respiratory physiology & neurobiology·2020
Same author

Adverse Effects of Low-Dose Methotrexate in a Randomized Double-Blind Placebo-Controlled Trial: Adjudicated Hematologic and Skin Cancer Outcomes in the Cardiovascular Inflammation Reduction Trial.

ACR open rheumatology·2020
Same author

Preparation and <i>in Vitro</i> Antitumor Study of Two-Dimensional Muscovite Nanosheets.

Langmuir : the ACS journal of surfaces and colloids·2020
Same author

Identification and Bioinformatic Assessment of circRNA Expression After <i>RMI1</i> Knockdown and Ionizing Radiation Exposure.

DNA and cell biology·2020
Same journal

Comparative Evaluation of Pretrained Large Language Models for Suicide Risk Prediction from Clinical Notes in U.S. Veterans.

medRxiv : the preprint server for health sciences·2026
Same journal

Nocturnal Respiratory Rate and Variability Predict Long-term Mortality in Stable Outpatients with Cardiovascular Disease.

medRxiv : the preprint server for health sciences·2026
Same journal

MOSAIC: Methylation-Oriented Site Analysis and Information Classifier for Robust Epigenomic Classification of Acute Leukemia in Clinical Cohorts with Variable Tumor Purity.

medRxiv : the preprint server for health sciences·2026
Same journal

Risk beliefs, intensive digital information and demand for a new preventative health product in public clinics: Evidence from an experiment in Zimbabwe.

medRxiv : the preprint server for health sciences·2026
Same journal

Development of an automated, imaging-based preoperative screening model for early identification of malnutrition in an abdominal surgery cohort.

medRxiv : the preprint server for health sciences·2026
Same journal

A Pilot Project Leveraging Large Language Models for Automated Screening and Variable Extraction in Observational Studies.

medRxiv : the preprint server for health sciences·2026
See all related articles

Related Experiment Video

Updated: Jun 5, 2026

Large-Scale Multi-Omics Genome-Wide Association Studies (Mo-GWAS): Guidelines for Sample Preparation and Normalization
08:27

Large-Scale Multi-Omics Genome-Wide Association Studies (Mo-GWAS): Guidelines for Sample Preparation and Normalization

Published on: July 27, 2021

Calibrated Prediction Intervals for Polygenic Scores: Updated Comparisons, Contextual Calibration, and Data

Chang Xu, Siyu Hou, Xiang Zhou

    Medrxiv : the Preprint Server for Health Sciences
    |June 4, 2026
    PubMed
    Summary
    This summary is machine-generated.

    Calibrated prediction intervals for polygenic scores (PGS) are crucial for genomic medicine. Phenotype normalization and data preprocessing are key for accurate uncertainty quantification and improved polygenic score accuracy.

    More Related Videos

    An R-Based Landscape Validation of a Competing Risk Model
    05:37

    An R-Based Landscape Validation of a Competing Risk Model

    Published on: September 16, 2022

    Related Experiment Videos

    Last Updated: Jun 5, 2026

    Large-Scale Multi-Omics Genome-Wide Association Studies (Mo-GWAS): Guidelines for Sample Preparation and Normalization
    08:27

    Large-Scale Multi-Omics Genome-Wide Association Studies (Mo-GWAS): Guidelines for Sample Preparation and Normalization

    Published on: July 27, 2021

    An R-Based Landscape Validation of a Competing Risk Model
    05:37

    An R-Based Landscape Validation of a Competing Risk Model

    Published on: September 16, 2022

    Area of Science:

    • Genomics
    • Biostatistics
    • Precision Medicine

    Background:

    • Calibrated prediction intervals for polygenic scores (PGS) are vital for communicating individual-level uncertainty in genomic medicine.
    • Comparing parametric (CalPred) and non-parametric (PredInterval) methods for constructing these intervals is essential.

    Purpose of the Study:

    • To compare the performance of CalPred and PredInterval for constructing calibrated prediction intervals for PGS.
    • To investigate the role of phenotype normalization and data preprocessing in achieving contextual calibration.
    • To assess the impact of dataset size and risk identification trade-offs between the two methods.

    Main Methods:

    • Comparison of CalPred (parametric) and PredInterval (non-parametric) methods.
    • Evaluation of contextual calibration strategies, including phenotype normalization and covariate inclusion.
    • Analysis of UK Biobank data and extreme simulation scenarios.

    Main Results:

    • Both CalPred and PredInterval can achieve calibrated coverage, with CalPred requiring a larger calibration set.
    • Phenotype normalization and data preprocessing are critical for achieving contextual calibration.
    • Standard GWAS normalization procedures in UK Biobank were sufficient for contextual calibration.
    • Covariate inclusion can restore calibration for PredInterval without normalization, while normalization improves upstream GWAS tasks.

    Conclusions:

    • Phenotype normalization and data preprocessing play a central role in GWAS, including reliable uncertainty quantification for PGS.
    • Appropriate data preprocessing ensures contextual calibration and enhances polygenic score accuracy and association power.
    • Understanding these methods is crucial for advancing genomic medicine and personalized risk prediction.