Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Polygenic Traits01:18

Polygenic Traits

65.6K
When more than one gene is responsible for a given phenotype, the trait is considered polygenic. Human height is a polygenic trait. Studies have uncovered hundreds of loci that influence height, and there are believed to be many more. Due to the high number of genes involved, as well as environmental and nutritional factors, height varies significantly within a given population. The distribution of height forms a bell-shaped curve, with relatively few individuals in the population at the...
65.6K
Genome-wide Association Studies-GWAS01:11

Genome-wide Association Studies-GWAS

13.3K
Genome-wide association studies or GWAS are used to identify whether common SNPs are associated with certain diseases. Suppose specific SNPs are more frequently observed in individuals with a particular disease than those without the disease. In that case, those SNPs are said to be associated with the disease. Chi-square analysis is performed to check the probability of the allele likely to be associated with the disease.
GWAS does not require the identification of the target gene involved in...
13.3K
Single Nucleotide Polymorphisms-SNPs01:05

Single Nucleotide Polymorphisms-SNPs

14.9K
A single nucleotide polymorphism or SNP is a single nucleotide variation at a specific genomic position in a large population. It is the most prevalent type of sequence variation found in the human genome. Point mutations that occur in more than 1% of the population qualify as SNPs. These are present once every 1000 nucleotides on an average in the human genome. Replacement of a purine with another purine (A/G) or a pyrimidine with another pyrimidine (C/T) is known as a transition. In contrast,...
14.9K
Prediction Intervals01:03

Prediction Intervals

2.2K
The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y. 
2.2K
Pleiotropy01:33

Pleiotropy

40.3K
Pleiotropy is the phenomenon in which a single gene impacts multiple, seemingly unrelated phenotypic traits. For example, defects in the SOX10 gene cause Waardenburg Syndrome Type 4, or WS4, which can cause defects in pigmentation, hearing impairments, and an absence of intestinal contractions necessary for elimination. This diversity of phenotypes results from the expression pattern of SOX10 in early embryonic and fetal development. SOX10 is found in neural crest cells that form melanocytes,...
40.3K
Multiple Regression01:25

Multiple Regression

3.0K
Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
3.0K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Improved heritability partitioning and enrichment analyses using summary statistics with graphREML.

Nature genetics·2026
Same author

STELLAR: A flexible ensemble learning framework integrating rare variants to enhance polygenic risk prediction.

medRxiv : the preprint server for health sciences·2026
Same author

MetaSTAARlite: an all-in-one tool for biobank-scale whole-genome sequencing meta-analysis.

Nature computational science·2026
Same author

Multi-ancestry transcriptome-wide association studies uncover insights into breast cancer genetics and biology.

Nature communications·2026
Same author

A high-penetrance intergenic variant at 9p21 confers melanoma susceptibility.

Research square·2026
Same author

Integrating common and rare variants improves polygenic risk prediction across diverse populations.

Nature communications·2026
Same journal

The TaMYB55-TaSnRK1α1-TabZIP9 module confers heat stress tolerance in wheat.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same journal

Superstatistics approach to turbulent circulation fluctuations.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same journal

A molecular timescale for evolution of cobamide biosynthesis.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same journal

Pierre Chambon, a pioneer of molecular biology and gene regulation in eukaryotes.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same journal

Granulosa cell glycogen fuels the avascular corpus luteum.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same journal

Synthetic essentiality of TRAIL/TNFSF10 in VHL-deficient renal cell carcinoma.

Proceedings of the National Academy of Sciences of the United States of America·2026
See all related articles

Related Experiment Video

Updated: Jun 17, 2025

Candidate Gene Testing in Clinical Cohort Studies with Multiplexed Genotyping and Mass Spectrometry
05:53

Candidate Gene Testing in Clinical Cohort Studies with Multiplexed Genotyping and Mass Spectrometry

Published on: June 21, 2018

10.1K

Fast and scalable ensemble learning method for versatile polygenic risk prediction.

Tony Chen1, Haoyu Zhang2, Rahul Mazumder3

  • 1Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA 02215.

Proceedings of the National Academy of Sciences of the United States of America
|August 7, 2024
PubMed
Summary
This summary is machine-generated.

A new method, Aggregated L0Learn using Summary-level data (ALL-Sum), significantly improves polygenic risk score (PRS) calculation. It offers higher accuracy, faster computation, and lower memory use for personalized medicine applications.

Keywords:
L0Learnensemble learningpenalized regressionpolygenic risk scores

More Related Videos

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.4K
Infinium Assay for Large-scale SNP Genotyping Applications
13:33

Infinium Assay for Large-scale SNP Genotyping Applications

Published on: November 19, 2013

38.9K

Related Experiment Videos

Last Updated: Jun 17, 2025

Candidate Gene Testing in Clinical Cohort Studies with Multiplexed Genotyping and Mass Spectrometry
05:53

Candidate Gene Testing in Clinical Cohort Studies with Multiplexed Genotyping and Mass Spectrometry

Published on: June 21, 2018

10.1K
Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.4K
Infinium Assay for Large-scale SNP Genotyping Applications
13:33

Infinium Assay for Large-scale SNP Genotyping Applications

Published on: November 19, 2013

38.9K

Area of Science:

  • Genetics
  • Biostatistics
  • Computational Biology

Background:

  • Polygenic risk scores (PRS) are crucial for risk stratification and personalized medicine.
  • Existing PRS methods struggle with computational efficiency, accuracy, and diverse genetic architectures.

Purpose of the Study:

  • To introduce Aggregated L0Learn using Summary-level data (ALL-Sum), an efficient and scalable method for computing PRS.
  • To address limitations of current PRS methods in terms of speed, accuracy, and adaptability.

Main Methods:

  • Developed ALL-Sum, an ensemble learning method utilizing L0L2 penalized regression on genome-wide association study (GWAS) summary statistics.
  • Employed ensemble learning across tuning parameters to model various genetic architectures.
  • Validated using large-scale simulations and real-world data for 11 complex traits.

Main Results:

  • ALL-Sum outperformed existing methods in simulations by 10% in accuracy, 20-fold in speed, and threefold in memory efficiency.
  • Real-world data analysis showed ALL-Sum achieved 25% higher PRS accuracy, 15x faster computation, and 50% less memory usage.
  • Demonstrated robustness across diverse genetic architectures and stability with varying linkage disequilibrium data.

Conclusions:

  • ALL-Sum provides a fast, scalable, and accurate solution for PRS computation, advancing personalized medicine.
  • The method is robust across various genetic architectures and data sources.
  • ALL-Sum is available as an R package for accessible use in genetic research.