Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Genome-wide Association Studies-GWAS01:11

Genome-wide Association Studies-GWAS

12.3K
Genome-wide association studies or GWAS are used to identify whether common SNPs are associated with certain diseases. Suppose specific SNPs are more frequently observed in individuals with a particular disease than those without the disease. In that case, those SNPs are said to be associated with the disease. Chi-square analysis is performed to check the probability of the allele likely to be associated with the disease.
GWAS does not require the identification of the target gene involved in...
12.3K
Comparing Copy Number Variations and SNPs02:26

Comparing Copy Number Variations and SNPs

17.0K
Sequencing of the human genome has opened up several best-kept secrets of the genome. Scientists have identified thousands of genome variations that exist within a population. These variations can be a single nucleotide or a larger chromosomal variation.
Copy number variations or CNVs are the structural variations that cover more than 1kb of DNA sequence. The single nucleotide polymorphism (SNP), on the other hand, is a single nucleotide change or a point mutation that is found in more than 1%...
17.0K
Pleiotropy01:33

Pleiotropy

39.4K
Pleiotropy is the phenomenon in which a single gene impacts multiple, seemingly unrelated phenotypic traits. For example, defects in the SOX10 gene cause Waardenburg Syndrome Type 4, or WS4, which can cause defects in pigmentation, hearing impairments, and an absence of intestinal contractions necessary for elimination. This diversity of phenotypes results from the expression pattern of SOX10 in early embryonic and fetal development. SOX10 is found in neural crest cells that form melanocytes,...
39.4K
Single Nucleotide Polymorphisms-SNPs01:05

Single Nucleotide Polymorphisms-SNPs

13.8K
A single nucleotide polymorphism or SNP is a single nucleotide variation at a specific genomic position in a large population. It is the most prevalent type of sequence variation found in the human genome. Point mutations that occur in more than 1% of the population qualify as SNPs. These are present once every 1000 nucleotides on an average in the human genome. Replacement of a purine with another purine (A/G) or a pyrimidine with another pyrimidine (C/T) is known as a transition. In contrast,...
13.8K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Genetic Risk Factors for Kidney Function in Individuals with Type 1 Diabetes.

Clinical journal of the American Society of Nephrology : CJASN·2026
Same author

Response to the Letter to the Editor Entitled "A Missing Genomic Dimension: The Small but Central Mitochondrial Genome in Diabetic Kidney Disease Genetics".

Kidney international reports·2026
Same author

Multistage Testing for Cognitive Diagnosis Based on Skill-Space Partitioning.

Applied psychological measurement·2026
Same author

Development of polyphenotypic scores to prioritize detection of G6PD rs1050828 carriers in African and African American populations.

Genetics in medicine : official journal of the American College of Medical Genetics·2026
Same author

Genetic Linkage for Bipolar Disorder to the Distal Region of Chromosome 19q: A Large Family Whole Genome Sequencing Study.

Bipolar disorders·2026
Same author

Accommodating the Analysis Model in Multiple Imputation for the Weibull Mixture Cure Model: Performance Under Penalized Likelihood.

Statistics in medicine·2026
Same journal

Applying Bayesian Multivariable Mendelian Randomisation to Prioritise Candidate Causal Traits From High-Dimensional Data: Illustration From Estimation of the Effect of Maternal Metabolites on Offspring Birthweight.

Genetic epidemiology·2026
Same journal

Individualized Bayesian Inference Identifies Novel Genetic Variants for Parkinson's Disease.

Genetic epidemiology·2026
Same journal

DRIVE v3: Command Line Application for Identity-by-Descent Haplotype Clustering in Large Biobank Scale Data.

Genetic epidemiology·2026
Same journal

Deep Unsupervised Domain Adaptation for Translating Cancer Dependency Maps From Cell Lines to Breast Cancer Tumor Genomics.

Genetic epidemiology·2026
Same journal

Polygenic Risk Scores for Incident Dementia in the Multi-Ethnic Study of Atherosclerosis.

Genetic epidemiology·2026
Same journal

Outcome and Exposure Polygenic Risk Scores Can Help Reduce Information Bias and Selection Bias in Regression Estimates From Biobank Data.

Genetic epidemiology·2026
See all related articles

Related Experiment Video

Updated: May 24, 2025

Large-Scale Multi-Omics Genome-Wide Association Studies Mo-GWAS: Guidelines for Sample Preparation and Normalization
08:27

Large-Scale Multi-Omics Genome-Wide Association Studies Mo-GWAS: Guidelines for Sample Preparation and Normalization

Published on: July 27, 2021

3.5K

Dimension Reduction Using Local Principal Components for Regression-Based Multi-SNP Analysis in 1000 Genomes and the

Fatemeh Yavartanoo1, Myriam Brossard2, Shelley B Bull2,3

  • 1Department of Mathematics Education, Seoul National University, Seoul, South Korea.

Genetic Epidemiology
|March 1, 2025
PubMed
Summary
This summary is machine-generated.

Dimension Reduction using Local Principal Components (DRLPC) effectively resolves multi-collinearity in genetic association studies. This method improves regression model stability and enhances the statistical power of genetic tests.

Keywords:
1000 Genomes Project (phase 3)Canadian Longitudinal on Aging (CLSA)dimension reductionmulti‐SNP statisticsmulti‐collinearityprincipal component analysis (PCA)variance inflation factor (VIF)

More Related Videos

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data
14:27

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data

Published on: June 26, 2013

15.6K
Screening for Functional Non-coding Genetic Variants Using Electrophoretic Mobility Shift Assay EMSA and DNA-affinity Precipitation Assay DAPA
11:35

Screening for Functional Non-coding Genetic Variants Using Electrophoretic Mobility Shift Assay EMSA and DNA-affinity Precipitation Assay DAPA

Published on: August 21, 2016

12.9K

Related Experiment Videos

Last Updated: May 24, 2025

Large-Scale Multi-Omics Genome-Wide Association Studies Mo-GWAS: Guidelines for Sample Preparation and Normalization
08:27

Large-Scale Multi-Omics Genome-Wide Association Studies Mo-GWAS: Guidelines for Sample Preparation and Normalization

Published on: July 27, 2021

3.5K
Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data
14:27

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data

Published on: June 26, 2013

15.6K
Screening for Functional Non-coding Genetic Variants Using Electrophoretic Mobility Shift Assay EMSA and DNA-affinity Precipitation Assay DAPA
11:35

Screening for Functional Non-coding Genetic Variants Using Electrophoretic Mobility Shift Assay EMSA and DNA-affinity Precipitation Assay DAPA

Published on: August 21, 2016

12.9K

Area of Science:

  • Genetics
  • Statistical Genetics
  • Bioinformatics

Background:

  • Multi-collinearity poses a significant challenge in genetic association studies using multiple Single Nucleotide Polymorphisms (SNPs).
  • This issue can lead to regression model instability and failure in genetic association analyses.
  • Existing methods struggle to adequately address severe multi-collinearity in dense genotype data.

Purpose of the Study:

  • To propose and evaluate a novel dimension reduction method, Dimension Reduction using Local Principal Components (DRLPC), for addressing multi-collinearity.
  • To improve the power and applicability of regression-based statistical tests in genetic association studies.
  • To assess the effectiveness of DRLPC in reducing variable numbers while preserving essential genetic information.

Main Methods:

  • DRLPC removes SNPs exhibiting high linear dependency, assuming remaining SNPs capture their effects.
  • Variance Inflation Factor (VIF) is used to quantify collinearity, with SNPs above a VIF threshold (e.g., 20) being excluded.
  • The method was applied to chromosome 22 SNPs from the 1000 Genomes Project and the Canadian Longitudinal Study on Aging (CLSA).

Main Results:

  • DRLPC significantly reduces the number of SNPs for regression analysis, particularly for larger genes (average reduction to ~20%).
  • For smaller genes, the reduction is less pronounced (average ~48%), indicating tailored effectiveness.
  • Application of DRLPC improved the power of the multiple regression Wald test from 60% to approximately 80% in simulation studies.

Conclusions:

  • DRLPC is an effective strategy for mitigating multi-collinearity in genetic association studies.
  • The method enhances the power of statistical tests, leading to more robust genetic association findings.
  • DRLPC offers improved applicability for subsequent regression analyses, especially with large genetic datasets.