Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Comparing Copy Number Variations and SNPs02:26

Comparing Copy Number Variations and SNPs

Sequencing of the human genome has opened up several best-kept secrets of the genome. Scientists have identified thousands of genome variations that exist within a population. These variations can be a single nucleotide or a larger chromosomal variation.
Copy number variations or CNVs are the structural variations that cover more than 1kb of DNA sequence. The single nucleotide polymorphism (SNP), on the other hand, is a single nucleotide change or a point mutation that is found in more than 1%...
Genome-wide Association Studies-GWAS01:11

Genome-wide Association Studies-GWAS

Genome-wide association studies or GWAS are used to identify whether common SNPs are associated with certain diseases. Suppose specific SNPs are more frequently observed in individuals with a particular disease than those without the disease. In that case, those SNPs are said to be associated with the disease. Chi-square analysis is performed to check the probability of the allele likely to be associated with the disease.
GWAS does not require the identification of the target gene involved in...
Multiple Comparison Tests01:13

Multiple Comparison Tests

Multiple comparison test, abbreviated as MCT, is a post hoc analysis generally performed after comparing multiple samples with one or more tests. An MCT will help identify a significantly different sample among multiple samples or a factor among multiple factors.
It would be easy to compare two samples using a significance alpha level of 0.05. In other words, there is only one sample pair to be compared. However, it would be difficult to identify a significantly different sample if the number...
Single Nucleotide Polymorphisms-SNPs01:05

Single Nucleotide Polymorphisms-SNPs

A single nucleotide polymorphism or SNP is a single nucleotide variation at a specific genomic position in a large population. It is the most prevalent type of sequence variation found in the human genome. Point mutations that occur in more than 1% of the population qualify as SNPs. These are present once every 1000 nucleotides on an average in the human genome. Replacement of a purine with another purine (A/G) or a pyrimidine with another pyrimidine (C/T) is known as a transition. In contrast,...
Significance Testing: Overview01:04

Significance Testing: Overview

Significance testing is a set of statistical methods used to test whether a claim about a parameter is valid. In analytical chemistry, significance testing is used primarily to determine whether the difference between two values comes from determinate or random errors. The effect of a particular change in the measurement protocol, analyst, or sample itself can cause a deviation from the expected result. In the case of a suspected deviation/outlier, we need to be able to confirm mathematically...
Sign Test for Matched Pairs01:17

Sign Test for Matched Pairs

The sign test for matched pairs offers a robust method for comparing two paired samples, often for the effects of an intervention in one of them. This method is very useful in situations where the underlying distribution of the data is unknown. The test compares two related samples—often pre- and post-treatment measurements on the same subjects—to determine if there are significant differences in their median values.
To conduct the sign test, we first calculate the differences in value between...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Testing for gene-environment interaction under exposure misspecification.

Biometrics·2017
Same author

Novel genetic variants in the P38MAPK pathway gene ZAK and susceptibility to lung cancer.

Molecular carcinogenesis·2017
Same author

Updating the "Risk Index": A systematic review and meta-analysis of occupational injuries and work schedule characteristics.

Chronobiology international·2017
Same author

Genome-wide interaction study of smoking behavior and non-small cell lung cancer risk in Caucasian population.

Carcinogenesis·2017
Same author

Epigenomic study identifies a novel mesenchyme homeobox2-GLI1 transcription axis involved in cancer drug resistance, overall survival and therapy prognosis in lung cancer patients.

Oncotarget·2017
Same author

Common <i>TDP1</i> Polymorphisms in Relation to Survival among Small Cell Lung Cancer Patients: A Multicenter Study from the International Lung Cancer Consortium.

Clinical cancer research : an official journal of the American Association for Cancer Research·2017
Same journal

Thymidylate synthase inhibitory drugs induce p53-dependent pathways differently.

PloS one·2026
Same journal

Top-down and bottom-up attention for joint pattern classification and reconstruction.

PloS one·2026
Same journal

Short- and long-term scaling behavior of blood pressure and pulse arrival time during sleep in healthy controls and patients with obstructive sleep apnea.

PloS one·2026
Same journal

Double DQN-based secrecy energy efficiency and fairness performance in IRS-assisted NOMA systems with friendly jamming.

PloS one·2026
Same journal

10 recommendations for strengthening citizen science for improved societal and ecological outcomes: A co-produced analysis of challenges and opportunities in the 21st century.

PloS one·2026
Same journal

Paying in public: Peer effects, impression management, and willingness to pay on digital payment platforms.

PloS one·2026
See all related articles

Related Experiment Video

Updated: May 18, 2026

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data
14:27

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data

Published on: June 26, 2013

Association test based on SNP set: logistic kernel machine based test vs. principal component analysis.

Yang Zhao1, Feng Chen, Rihong Zhai

  • 1Department of Environmental Health, Harvard School of Public Health, Harvard University, Boston, Massachusetts, United States of America.

Plos One
|October 3, 2012
PubMed
Summary
This summary is machine-generated.

Linear kernel machine (LKM) and principal component analysis (PCA) effectively control type I errors in GWAS. Both methods increase power for complex disease risk SNP discovery, especially with multiple causal SNPs.

More Related Videos

Machine Learning-Based Cough Tone Classification: Diagnostic Exploration of Chronic Obstructive Pulmonary Disease and Respiratory Tract Infections
06:22

Machine Learning-Based Cough Tone Classification: Diagnostic Exploration of Chronic Obstructive Pulmonary Disease and Respiratory Tract Infections

Published on: September 19, 2025

Related Experiment Videos

Last Updated: May 18, 2026

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data
14:27

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data

Published on: June 26, 2013

Machine Learning-Based Cough Tone Classification: Diagnostic Exploration of Chronic Obstructive Pulmonary Disease and Respiratory Tract Infections
06:22

Machine Learning-Based Cough Tone Classification: Diagnostic Exploration of Chronic Obstructive Pulmonary Disease and Respiratory Tract Infections

Published on: September 19, 2025

Area of Science:

  • Genetics
  • Statistical genetics
  • Bioinformatics

Background:

  • Genome-Wide Association Studies (GWAS) identify risk single nucleotide polymorphisms (SNPs) for complex diseases.
  • Individual SNP analysis in GWAS suffers from low power and reproducibility due to multiple comparison corrections.
  • SNP set-based methods leverage biological knowledge and genomic features to enhance discovery power.

Purpose of the Study:

  • To compare the performance of linear kernel machine (LKM) and principal component analysis (PCA) for SNP set-based association testing.
  • To evaluate these methods under various scenarios, including different numbers of causal SNPs and linkage disequilibrium (LD) structures.
  • To assess the power of LKM and PCA in detecting multiple causal SNPs and compare them to individual SNP analysis.

Main Methods:

  • Utilized simulated datasets with 0 to 3 causal SNPs and simple/complex LD structures.
  • Applied linear kernel machine (LKM) tests.
  • Employed principal component analysis (PCA) based approaches, varying the number of principal components (PCs).

Main Results:

  • Both LKM and PCA maintained type I error control at a 0.05 significance level across simulations.
  • PCA with few PCs and LKM are valid for strong LD scenarios.
  • Complex LD structures necessitate more PCs for PCA to capture causal SNP information.
  • LKM and PCA demonstrated the ability to combine information from multiple causal SNPs, increasing power over single SNP analysis.

Conclusions:

  • LKM and PCA are robust methods for SNP set-based association analysis in GWAS.
  • The choice of parameters (e.g., number of PCs) for PCA depends on the complexity of the LD structure.
  • These methods offer improved power for identifying genetic risk factors in complex diseases compared to traditional single SNP approaches.
  • Applied LKM and PCA to a non-small cell lung cancer GWAS dataset, demonstrating real-world applicability.