Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Polygenic Traits

Polygenic Traits

When more than one gene is responsible for a given phenotype, the trait is considered polygenic. Human height is a polygenic trait. Studies have uncovered hundreds of loci that influence height, and there are believed to be many more. Due to the high number of genes involved, as well as environmental and nutritional factors, height varies significantly within a given population. The distribution of height forms a bell-shaped curve, with relatively few individuals in the population at the...

Genome-wide Association Studies-GWAS

Genome-wide Association Studies-GWAS

Genome-wide association studies or GWAS are used to identify whether common SNPs are associated with certain diseases. Suppose specific SNPs are more frequently observed in individuals with a particular disease than those without the disease. In that case, those SNPs are said to be associated with the disease. Chi-square analysis is performed to check the probability of the allele likely to be associated with the disease.
GWAS does not require the identification of the target gene involved in...

Single Nucleotide Polymorphisms-SNPs

Single Nucleotide Polymorphisms-SNPs

A single nucleotide polymorphism or SNP is a single nucleotide variation at a specific genomic position in a large population. It is the most prevalent type of sequence variation found in the human genome. Point mutations that occur in more than 1% of the population qualify as SNPs. These are present once every 1000 nucleotides on an average in the human genome. Replacement of a purine with another purine (A/G) or a pyrimidine with another pyrimidine (C/T) is known as a transition. In contrast,...

Prediction Intervals

Prediction Intervals

The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y.

Pleiotropy

Pleiotropy

Pleiotropy is the phenomenon in which a single gene impacts multiple, seemingly unrelated phenotypic traits. For example, defects in the SOX10 gene cause Waardenburg Syndrome Type 4, or WS4, which can cause defects in pigmentation, hearing impairments, and an absence of intestinal contractions necessary for elimination. This diversity of phenotypes results from the expression pattern of SOX10 in early embryonic and fetal development. SOX10 is found in neural crest cells that form melanocytes,...

Multiple Regression

Multiple Regression

Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Improved heritability partitioning and enrichment analyses using summary statistics with graphREML.

Nature genetics·2026

Same author

STELLAR: A flexible ensemble learning framework integrating rare variants to enhance polygenic risk prediction.

medRxiv : the preprint server for health sciences·2026

Same author

MetaSTAARlite: an all-in-one tool for biobank-scale whole-genome sequencing meta-analysis.

Nature computational science·2026

Same author

Multi-ancestry transcriptome-wide association studies uncover insights into breast cancer genetics and biology.

Nature communications·2026

Same author

A high-penetrance intergenic variant at 9p21 confers melanoma susceptibility.

Research square·2026

Same author

Integrating common and rare variants improves polygenic risk prediction across diverse populations.

Nature communications·2026

Same journal

The TaMYB55-TaSnRK1α1-TabZIP9 module confers heat stress tolerance in wheat.

Proceedings of the National Academy of Sciences of the United States of America·2026

Same journal

Superstatistics approach to turbulent circulation fluctuations.

Proceedings of the National Academy of Sciences of the United States of America·2026

Same journal

A molecular timescale for evolution of cobamide biosynthesis.

Proceedings of the National Academy of Sciences of the United States of America·2026

Same journal

Pierre Chambon, a pioneer of molecular biology and gene regulation in eukaryotes.

Proceedings of the National Academy of Sciences of the United States of America·2026

Same journal

Granulosa cell glycogen fuels the avascular corpus luteum.

Proceedings of the National Academy of Sciences of the United States of America·2026

Same journal

Synthetic essentiality of TRAIL/TNFSF10 in VHL-deficient renal cell carcinoma.

Proceedings of the National Academy of Sciences of the United States of America·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 17, 2025

Candidate Gene Testing in Clinical Cohort Studies with Multiplexed Genotyping and Mass Spectrometry

Candidate Gene Testing in Clinical Cohort Studies with Multiplexed Genotyping and Mass Spectrometry

Published on: June 21, 2018

Fast and scalable ensemble learning method for versatile polygenic risk prediction.

Tony Chen¹, Haoyu Zhang², Rahul Mazumder³

¹Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA 02215.

Proceedings of the National Academy of Sciences of the United States of America

|August 7, 2024

Summary

This summary is machine-generated.

A new method, Aggregated L0Learn using Summary-level data (ALL-Sum), significantly improves polygenic risk score (PRS) calculation. It offers higher accuracy, faster computation, and lower memory use for personalized medicine applications.

Keywords:

L0Learn ensemble learning penalized regression polygenic risk scores

More Related Videos

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Infinium Assay for Large-scale SNP Genotyping Applications

Infinium Assay for Large-scale SNP Genotyping Applications

Published on: November 19, 2013

Related Experiment Videos

Last Updated: Jun 17, 2025

Candidate Gene Testing in Clinical Cohort Studies with Multiplexed Genotyping and Mass Spectrometry

Candidate Gene Testing in Clinical Cohort Studies with Multiplexed Genotyping and Mass Spectrometry

Published on: June 21, 2018

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Infinium Assay for Large-scale SNP Genotyping Applications

Infinium Assay for Large-scale SNP Genotyping Applications

Published on: November 19, 2013

Area of Science:

Genetics
Biostatistics
Computational Biology

Background:

Polygenic risk scores (PRS) are crucial for risk stratification and personalized medicine.
Existing PRS methods struggle with computational efficiency, accuracy, and diverse genetic architectures.

Purpose of the Study:

To introduce Aggregated L0Learn using Summary-level data (ALL-Sum), an efficient and scalable method for computing PRS.
To address limitations of current PRS methods in terms of speed, accuracy, and adaptability.

Main Methods:

Developed ALL-Sum, an ensemble learning method utilizing L0L2 penalized regression on genome-wide association study (GWAS) summary statistics.
Employed ensemble learning across tuning parameters to model various genetic architectures.
Validated using large-scale simulations and real-world data for 11 complex traits.

Main Results:

ALL-Sum outperformed existing methods in simulations by 10% in accuracy, 20-fold in speed, and threefold in memory efficiency.
Real-world data analysis showed ALL-Sum achieved 25% higher PRS accuracy, 15x faster computation, and 50% less memory usage.
Demonstrated robustness across diverse genetic architectures and stability with varying linkage disequilibrium data.

Conclusions:

ALL-Sum provides a fast, scalable, and accurate solution for PRS computation, advancing personalized medicine.
The method is robust across various genetic architectures and data sources.
ALL-Sum is available as an R package for accessible use in genetic research.