Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Prediction Intervals

Prediction Intervals

The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y.
The...

Polygenic Traits

Polygenic Traits

When more than one gene is responsible for a given phenotype, the trait is considered polygenic. Human height is a polygenic trait. Studies have uncovered hundreds of loci that influence height, and there are believed to be many more. Due to the high number of genes involved, as well as environmental and nutritional factors, height varies significantly within a given population. The distribution of height forms a bell-shaped curve, with relatively few individuals in the population at the...

Polygenic Traits

Polygenic Traits

When more than one gene is responsible for a given phenotype, the trait is considered polygenic. Human height is a polygenic trait. Studies have uncovered hundreds of loci that influence height, and there are believed to be many more. Due to the high number of genes involved, as well as environmental and nutritional factors, height varies significantly within a given population. The distribution of height forms a bell-shaped curve, with relatively few individuals in the population at the...

Calibration Curves: Correlation Coefficient

Calibration Curves: Correlation Coefficient

In a linear calibration curve, there is a value called the calibration coefficient, denoted by 'r,' which measures the strength and the direction of association between two variables. The correlation coefficient value ranges from −1 to +1. A value of +1 indicates a perfect positive linear correlation, −1 denotes a perfect negative correlation, and 0 implies no correlation between the two variables. A positive correlation value establishes that as one variable increases, the other increases, and...

Calibration Curves: Linear Least Squares

Calibration Curves: Linear Least Squares

A calibration curve is a plot of the instrument's response against a series of known concentrations of a substance. This curve is used to set the instrument response levels, using the substance and its concentrations as standards. Alternatively, or additionally, an equation is fitted to the calibration curve plot and subsequently used to calculate the unknown concentrations of other samples reliably.
For data that follow a straight line, the standard method for fitting is the linear...

Confidence Intervals

Confidence Intervals

An unbiased point estimate is often insufficient to predict a population estimate, such as population mean or population proportion. In this scenario, a confidence interval is used. A confidence interval is an estimate similar to a sample proportion. However, unlike the point estimate which is a single value, the confidence interval contains a range of values. These values have lower and upper limits, known as confidence limits, and can be designated as L1 and L2, respectively.
A confidence...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

A unified framework enables accessible deployment and comprehensive benchmarking of single-cell foundation models.

bioRxiv : the preprint server for biology·2026

Same author

Decoding multilevel relationships with the human tissue-cell-molecule network.

Briefings in bioinformatics·2022

Same author

Inhalation airflow and ventilation efficiency in subject-specific human upper airways.

Respiratory physiology & neurobiology·2020

Same author

Adverse Effects of Low-Dose Methotrexate in a Randomized Double-Blind Placebo-Controlled Trial: Adjudicated Hematologic and Skin Cancer Outcomes in the Cardiovascular Inflammation Reduction Trial.

ACR open rheumatology·2020

Same author

Preparation and <i>in Vitro</i> Antitumor Study of Two-Dimensional Muscovite Nanosheets.

Langmuir : the ACS journal of surfaces and colloids·2020

Same author

Identification and Bioinformatic Assessment of circRNA Expression After <i>RMI1</i> Knockdown and Ionizing Radiation Exposure.

DNA and cell biology·2020

Same journal

Comparative Evaluation of Pretrained Large Language Models for Suicide Risk Prediction from Clinical Notes in U.S. Veterans.

medRxiv : the preprint server for health sciences·2026

Same journal

Nocturnal Respiratory Rate and Variability Predict Long-term Mortality in Stable Outpatients with Cardiovascular Disease.

medRxiv : the preprint server for health sciences·2026

Same journal

MOSAIC: Methylation-Oriented Site Analysis and Information Classifier for Robust Epigenomic Classification of Acute Leukemia in Clinical Cohorts with Variable Tumor Purity.

medRxiv : the preprint server for health sciences·2026

Same journal

Risk beliefs, intensive digital information and demand for a new preventative health product in public clinics: Evidence from an experiment in Zimbabwe.

medRxiv : the preprint server for health sciences·2026

Same journal

Development of an automated, imaging-based preoperative screening model for early identification of malnutrition in an abdominal surgery cohort.

medRxiv : the preprint server for health sciences·2026

Same journal

A Pilot Project Leveraging Large Language Models for Automated Screening and Variable Extraction in Observational Studies.

medRxiv : the preprint server for health sciences·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 5, 2026

Large-Scale Multi-Omics Genome-Wide Association Studies (Mo-GWAS): Guidelines for Sample Preparation and Normalization

Large-Scale Multi-Omics Genome-Wide Association Studies (Mo-GWAS): Guidelines for Sample Preparation and Normalization

Published on: July 27, 2021

Calibrated Prediction Intervals for Polygenic Scores: Updated Comparisons, Contextual Calibration, and Data

Chang Xu, Siyu Hou, Xiang Zhou

Medrxiv : the Preprint Server for Health Sciences

|June 4, 2026

Summary

This summary is machine-generated.

Calibrated prediction intervals for polygenic scores (PGS) are crucial for genomic medicine. Phenotype normalization and data preprocessing are key for accurate uncertainty quantification and improved polygenic score accuracy.

More Related Videos

An R-Based Landscape Validation of a Competing Risk Model

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

Related Experiment Videos

Last Updated: Jun 5, 2026

Large-Scale Multi-Omics Genome-Wide Association Studies (Mo-GWAS): Guidelines for Sample Preparation and Normalization

Large-Scale Multi-Omics Genome-Wide Association Studies (Mo-GWAS): Guidelines for Sample Preparation and Normalization

Published on: July 27, 2021

An R-Based Landscape Validation of a Competing Risk Model

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

Area of Science:

Genomics
Biostatistics
Precision Medicine

Background:

Calibrated prediction intervals for polygenic scores (PGS) are vital for communicating individual-level uncertainty in genomic medicine.
Comparing parametric (CalPred) and non-parametric (PredInterval) methods for constructing these intervals is essential.

Purpose of the Study:

To compare the performance of CalPred and PredInterval for constructing calibrated prediction intervals for PGS.
To investigate the role of phenotype normalization and data preprocessing in achieving contextual calibration.
To assess the impact of dataset size and risk identification trade-offs between the two methods.

Main Methods:

Comparison of CalPred (parametric) and PredInterval (non-parametric) methods.
Evaluation of contextual calibration strategies, including phenotype normalization and covariate inclusion.
Analysis of UK Biobank data and extreme simulation scenarios.

Main Results:

Both CalPred and PredInterval can achieve calibrated coverage, with CalPred requiring a larger calibration set.
Phenotype normalization and data preprocessing are critical for achieving contextual calibration.
Standard GWAS normalization procedures in UK Biobank were sufficient for contextual calibration.
Covariate inclusion can restore calibration for PredInterval without normalization, while normalization improves upstream GWAS tasks.

Conclusions:

Phenotype normalization and data preprocessing play a central role in GWAS, including reliable uncertainty quantification for PGS.
Appropriate data preprocessing ensures contextual calibration and enhances polygenic score accuracy and association power.
Understanding these methods is crucial for advancing genomic medicine and personalized risk prediction.