Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Genome-wide Association Studies-GWAS01:11

Genome-wide Association Studies-GWAS

13.8K
Genome-wide association studies or GWAS are used to identify whether common SNPs are associated with certain diseases. Suppose specific SNPs are more frequently observed in individuals with a particular disease than those without the disease. In that case, those SNPs are said to be associated with the disease. Chi-square analysis is performed to check the probability of the allele likely to be associated with the disease.
GWAS does not require the identification of the target gene involved in...
13.8K
Single Nucleotide Polymorphisms-SNPs01:05

Single Nucleotide Polymorphisms-SNPs

15.5K
A single nucleotide polymorphism or SNP is a single nucleotide variation at a specific genomic position in a large population. It is the most prevalent type of sequence variation found in the human genome. Point mutations that occur in more than 1% of the population qualify as SNPs. These are present once every 1000 nucleotides on an average in the human genome. Replacement of a purine with another purine (A/G) or a pyrimidine with another pyrimidine (C/T) is known as a transition. In contrast,...
15.5K
Variability: Analysis01:11

Variability: Analysis

167
Measures of variability are statistical metrics that reveal the dispersion pattern within a dataset. They are pivotal in biostatistics, providing insights into the heterogeneity within health and biological data. Variability signifies the degree to which data points diverge from one another, helping researchers understand the potential range of values and associated uncertainty within the data.
The range is a simple measure of variability, indicating the difference between the highest and...
167
Comparing Copy Number Variations and SNPs02:26

Comparing Copy Number Variations and SNPs

17.8K
Sequencing of the human genome has opened up several best-kept secrets of the genome. Scientists have identified thousands of genome variations that exist within a population. These variations can be a single nucleotide or a larger chromosomal variation.
Copy number variations or CNVs are the structural variations that cover more than 1kb of DNA sequence. The single nucleotide polymorphism (SNP), on the other hand, is a single nucleotide change or a point mutation that is found in more than 1%...
17.8K
Confounding in Epidemiological Studies01:27

Confounding in Epidemiological Studies

214
Confounding in statistical epidemiology represents a pivotal challenge, referring to the distortion in the perceived relationship between an exposure and an outcome due to the presence of a third variable, known as a confounder. This variable is associated with both the exposure and the outcome but is not a direct link in their causal chain. Its presence can lead to erroneous interpretations of the exposure's effect, either exaggerating or underestimating the true association. This...
214
Multiple Allele Traits01:49

Multiple Allele Traits

34.5K
The Concept of Multiple Allelism
34.5K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Fanconi Anemia as a Window into Premalignant Field Cancerization of the Oral Mucosa.

medRxiv : the preprint server for health sciences·2026
Same author

Tobacco product use among middle and high school students in the United States: National Youth Tobacco Survey, 2025.

Nicotine & tobacco research : official journal of the Society for Research on Nicotine and Tobacco·2026
Same author

Incidence patterns and trends of chordoma in the United States: a population-based analysis of over 6,000 cases.

Journal of bone oncology·2026
Same author

Disruption of CTCF binding by germline non-coding variants in <i>CDKN2B</i> suppress <i>CDKN2A</i> expression and predispose to melanoma.

medRxiv : the preprint server for health sciences·2026
Same author

Developing the multidisciplinary and multispecialty workforce and the systems needed to sustain high-quality care from diagnosis through long-term survivorship.

Supportive care in cancer : official journal of the Multinational Association of Supportive Care in Cancer·2026
Same author

Prevalence of Familial Melanoma Genes and Cancer Risk Among Genomically Ascertained Individuals.

JAMA dermatology·2026
Same journal

Thymidylate synthase inhibitory drugs induce p53-dependent pathways differently.

PloS one·2026
Same journal

Top-down and bottom-up attention for joint pattern classification and reconstruction.

PloS one·2026
Same journal

Short- and long-term scaling behavior of blood pressure and pulse arrival time during sleep in healthy controls and patients with obstructive sleep apnea.

PloS one·2026
Same journal

Double DQN-based secrecy energy efficiency and fairness performance in IRS-assisted NOMA systems with friendly jamming.

PloS one·2026
Same journal

10 recommendations for strengthening citizen science for improved societal and ecological outcomes: A co-produced analysis of challenges and opportunities in the 21st century.

PloS one·2026
Same journal

Paying in public: Peer effects, impression management, and willingness to pay on digital payment platforms.

PloS one·2026
See all related articles

Related Experiment Video

Updated: Aug 12, 2025

Determining the Likelihood of Variant Pathogenicity Using Amino Acid-level Signal-to-Noise Analysis of Genetic Variation
07:15

Determining the Likelihood of Variant Pathogenicity Using Amino Acid-level Signal-to-Noise Analysis of Genetic Variation

Published on: January 16, 2019

11.1K

Inflated expectations: Rare-variant association analysis using public controls.

Jung Kim1, Danielle M Karyadi1, Stephen W Hartley1

  • 1Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, Maryland, United States of America.

Plos One
|January 25, 2023
PubMed
Summary
This summary is machine-generated.

Using public controls in rare variant association studies can lead to false positives. Employing consistent variant-calling pipelines significantly reduces inflated test statistics, mitigating risks associated with public data.

More Related Videos

Targeted Next-generation Sequencing and Bioinformatics Pipeline to Evaluate Genetic Determinants of Constitutional Disease
09:34

Targeted Next-generation Sequencing and Bioinformatics Pipeline to Evaluate Genetic Determinants of Constitutional Disease

Published on: April 4, 2018

33.9K
Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER
14:06

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

15.3K

Related Experiment Videos

Last Updated: Aug 12, 2025

Determining the Likelihood of Variant Pathogenicity Using Amino Acid-level Signal-to-Noise Analysis of Genetic Variation
07:15

Determining the Likelihood of Variant Pathogenicity Using Amino Acid-level Signal-to-Noise Analysis of Genetic Variation

Published on: January 16, 2019

11.1K
Targeted Next-generation Sequencing and Bioinformatics Pipeline to Evaluate Genetic Determinants of Constitutional Disease
09:34

Targeted Next-generation Sequencing and Bioinformatics Pipeline to Evaluate Genetic Determinants of Constitutional Disease

Published on: April 4, 2018

33.9K
Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER
14:06

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

15.3K

Area of Science:

  • Genetics
  • Bioinformatics
  • Statistical Genomics

Background:

  • Publicly available sequencing datasets offer potential as controls in rare variant disease association studies.
  • However, their use can increase the risk of false-positive discoveries due to unexamined factors contributing to inflated test statistics.

Purpose of the Study:

  • To systematically investigate factors contributing to false-positive discoveries when using public controls in rare variant association studies.
  • To quantify the inflation of statistical significance using lambda-delta-95 (λΔ95).

Main Methods:

  • Leveraged public control datasets (gnomAD v2.1) and in-house sequenced datasets.
  • Systematically analyzed factors including variant caller/filtering pipelines, library preparation kits, sequencers, and joint vs. separate variant-calling.
  • Quantified false-positive inflation using λΔ95.

Main Results:

  • Using the same variant caller and filtering pipelines for cases and controls substantially decreased test statistic inflation.
  • Differences in library preparation kits and sequencers did not significantly impact the false-positive discovery rate.
  • Joint versus separate variant-calling of cases and controls did not contribute to the inflation of test statistics.

Conclusions:

  • Current methods inadequately adjust for high false-positive rates when using public controls.
  • Risks are emphasized for rare-variant association tests using public controls when individual-level data and unified computational pipelines are inaccessible.
  • Cloud-based computing offers a solution by enabling containerized pipelines to be brought to the data, minimizing these issues.