Benchmarking Mendelian randomization methods for causal inference using genome-wide association study summary statistics

Affiliations
  • 1School of Mathematical Sciences, Institute of Statistical Sciences, Shenzhen University, Shenzhen 518060, China; Department of Mathematics, The Hong Kong University of Science and Technology, Hong Kong, China; Guangzhou HKUST Fok Ying Tung Research Institute, Guangzhou 511458, China.
  • 2Department of Biostatistics, City University of Hong Kong, Hong Kong, China.
  • 3Shenzhen Research Institute of Big Data, Shenzhen 518172, China.
  • 4Department of Mathematics, The Hong Kong University of Science and Technology, Hong Kong, China; Guangzhou HKUST Fok Ying Tung Research Institute, Guangzhou 511458, China.
  • 5Department of Biostatistics, Yale School of Public Health, New Haven, CT 06520, USA. Electronic address: hongyu.zhao@yale.edu.
  • 6Department of Mathematics, The Hong Kong University of Science and Technology, Hong Kong, China; Guangzhou HKUST Fok Ying Tung Research Institute, Guangzhou 511458, China; Big Data Bio-Intelligence Lab, The Hong Kong University of Science and Technology, Hong Kong SAR, China. Electronic address: macyang@ust.hk.

Abstract

Mendelian randomization (MR), which utilizes genetic variants as instrumental variables (IVs), has gained popularity as a method for causal inference between phenotypes using genetic data. While efforts have been made to relax IV assumptions and develop new methods for causal inference in the presence of invalid IVs due to confounding, the reliability of MR methods in real-world applications remains uncertain. Instead of using simulated datasets, we conducted a benchmark study evaluating 16 two-sample summary-level MR methods using real-world genetic datasets to provide guidelines for the best practices. Our study focused on the following crucial aspects: type I error control in the presence of various confounding scenarios (e.g., population stratification, pleiotropy, and family-level confounders like assortative mating), the accuracy of causal effect estimates, replicability, and power. By comprehensively evaluating the performance of compared methods over one thousand exposure-outcome trait pairs, our study not only provides valuable insights into the performance and limitations of the compared methods but also offers practical guidance for researchers to choose appropriate MR methods for causal inference.

Related Concept Videos

JoVE Research Video for Genome-wide Association Studies-GWAS 01:11

10.3K

Genome-wide association studies or GWAS are used to identify whether common SNPs are associated with certain diseases. Suppose specific SNPs are more frequently observed in individuals with a particular disease than those without the disease. In that case, those SNPs are said to be associated with the disease. Chi-square analysis is performed to check the probability of the allele likely to be associated with the disease.
GWAS does not require the identification of the target gene involved in…

JoVE Research Video for Genetic Screens 02:46

4.7K

Genetic screens are tools used to identify genes and mutations responsible for phenotypes of interest. Genetic screens help identify individuals or a group of people at risk of developing  genetic diseases and help them with early intervention, targeted therapy, and reproductive options.
Forward genetic screens
Forward or “classical” genetic screens involve creating random mutations in an organism’s DNA using radiation, mutagens, or insertion of additional bases, which…

JoVE Research Video for Hardy-Weinberg Principle 01:49

69.4K

Diploid organisms have two alleles of each gene, one from each parent, in their somatic cells. Therefore, each individual contributes two alleles to the gene pool of the population. The gene pool of a population is the sum of every allele of all genes within that population and has some degree of variation. Genetic variation is typically expressed as a relative frequency, which is the percentage of the total population that has a given allele, genotype or phenotype.

In the early 20th century,…

JoVE Research Video for Pleiotropy 01:33

35.7K

Pleiotropy is the phenomenon in which a single gene impacts multiple, seemingly unrelated phenotypic traits. For example, defects in the SOX10 gene cause Waardenburg Syndrome Type 4, or WS4, which can cause defects in pigmentation, hearing impairments, and an absence of intestinal contractions necessary for elimination. This diversity of phenotypes results from the expression pattern of SOX10 in early embryonic and fetal development. SOX10 is found in neural crest cells that form melanocytes,…

JoVE Research Video for Comparing Copy Number Variations and SNPs 02:26

13.4K

Sequencing of the human genome has opened up several best-kept secrets of the genome. Scientists have identified thousands of genome variations that exist within a population. These variations can be a single nucleotide or a larger chromosomal variation.
Copy number variations or CNVs are the structural variations that cover more than 1kb of DNA sequence. The single nucleotide polymorphism (SNP), on the other hand, is a single nucleotide change or a point mutation that is found in more than 1%…

JoVE Research Video for Wilcoxon Signed-Ranks Test for Median of Single Population 01:14

25

The Wilcoxon signed-rank test for the median of a single population is a nonparametric test used to evaluate whether the median of a population differs from a specified value. Unlike parametric tests, it does not require data to follow a normal distribution, making it suitable for non-normal or small samples. The test begins by calculating the difference (d) between each observation and the hypothesized median. The absolute values of these differences are ranked in ascending order, with ties…