Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Genome-wide Association Studies-GWAS01:11

Genome-wide Association Studies-GWAS

Genome-wide association studies or GWAS are used to identify whether common SNPs are associated with certain diseases. Suppose specific SNPs are more frequently observed in individuals with a particular disease than those without the disease. In that case, those SNPs are said to be associated with the disease. Chi-square analysis is performed to check the probability of the allele likely to be associated with the disease.
GWAS does not require the identification of the target gene involved in...
Comparing Copy Number Variations and SNPs02:26

Comparing Copy Number Variations and SNPs

Sequencing of the human genome has opened up several best-kept secrets of the genome. Scientists have identified thousands of genome variations that exist within a population. These variations can be a single nucleotide or a larger chromosomal variation.
Copy number variations or CNVs are the structural variations that cover more than 1kb of DNA sequence. The single nucleotide polymorphism (SNP), on the other hand, is a single nucleotide change or a point mutation that is found in more than 1%...
Genetic Variation01:25

Genetic Variation

Genetic variation is the diversity in DNA sequences found among individuals of the same species. This diversity is crucial for a species' survival because it helps organisms adapt to environmental changes. Genetic variation begins with fertilization, where an egg and sperm cell merge. Each of these cells carries 23 chromosomes, up to 46 in the fertilized egg. Chromosomes are long DNA strands that contain genes, the basic units of heredity.
Genes exist in different versions called alleles, which...
Principles of Pharmacogenetics: Types of Genetic Variants01:27

Principles of Pharmacogenetics: Types of Genetic Variants

The human genome is over 99.9% identical between individuals, yet genetic differences exist at millions of bases. The human genome contains approximately 3 million variant positions per individual, many of which are heterozygous, contributing to genetic diversity and individual traits. Genetic variations include single-nucleotide polymorphisms (SNPs), insertions, deletions, and copy number variations (CNVs).SNPs, the most common variation, involve single-base changes in DNA. These can be...
Genomics02:02

Genomics

Genomics is the science of genomes: it is the study of all the genetic material of an organism. In humans, the genome consists of information carried in 23 pairs of chromosomes in the nucleus, as well as mitochondrial DNA. In genomics, both coding and non-coding DNA is sequenced and analyzed. Genomics allows a better understanding of all living things, their evolution, and their diversity. It has a myriad of uses: for example, to build phylogenetic trees, to improve productivity and...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Linker-Enabled Self-Nitridation and Multistep Pyrolysis Pathways of Cu-Triazolate MOFs Toward Cu<sub>3</sub>N/Carbon Fibers.

Angewandte Chemie (International ed. in English)·2026
Same author

DeepICER: A deep learning framework for predicting compound-induced gene expression profiles.

Acta pharmaceutica Sinica. B·2026
Same author

A near-complete, haplotype-resolved telomere-to-telomere genome assembly of Cannabis sativa reveals complex higher-order repetitive structures.

Plant communications·2026
Same author

Corrigendum to "Ozone-coupled photocatalytic H<sub>2</sub>O<sub>2</sub> synthesis via Mn/Ce bimetallic metal organic framework: synergistic catalysis and sacrificial agent-free efficiency" [J. Colloid and Interface Sci. 702 (2026) 138801].

Journal of colloid and interface science·2026
Same author

Targeting oncogenic NTSR1 with liensinine reprograms Gq-mediated signaling to suppress lung adenocarcinoma.

Chinese medicine·2026
Same author

In vitro bioactive metabolite production and plant regeneration of medicinal plant Ligusticum chuanxiong Hort.

BMC plant biology·2026
Same journal

Applying Bayesian Multivariable Mendelian Randomisation to Prioritise Candidate Causal Traits From High-Dimensional Data: Illustration From Estimation of the Effect of Maternal Metabolites on Offspring Birthweight.

Genetic epidemiology·2026
Same journal

Individualized Bayesian Inference Identifies Novel Genetic Variants for Parkinson's Disease.

Genetic epidemiology·2026
Same journal

DRIVE v3: Command Line Application for Identity-by-Descent Haplotype Clustering in Large Biobank Scale Data.

Genetic epidemiology·2026
Same journal

Deep Unsupervised Domain Adaptation for Translating Cancer Dependency Maps From Cell Lines to Breast Cancer Tumor Genomics.

Genetic epidemiology·2026
Same journal

Polygenic Risk Scores for Incident Dementia in the Multi-Ethnic Study of Atherosclerosis.

Genetic epidemiology·2026
Same journal

Outcome and Exposure Polygenic Risk Scores Can Help Reduce Information Bias and Selection Bias in Regression Estimates From Biobank Data.

Genetic epidemiology·2026
See all related articles

Related Experiment Video

Updated: May 16, 2026

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER
14:06

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

Simulating realistic genomic data with rare variants.

Yaji Xu1, Yinghua Wu, Chi Song

  • 1Department of Biostatistics, Yale University, New Haven, CT 06511, USA.

Genetic Epidemiology
|November 20, 2012
PubMed
Summary
This summary is machine-generated.

Simulating genetic data with rare variants is crucial for disease risk research. This study introduces a novel imputation-based algorithm that effectively generates realistic genomic data, preserving key properties like linkage disequilibrium.

More Related Videos

In Vivo Functional Study of Disease-associated Rare Human Variants Using Drosophila
06:41

In Vivo Functional Study of Disease-associated Rare Human Variants Using Drosophila

Published on: August 20, 2019

Rare Event Detection Using Error-corrected DNA and RNA Sequencing
10:36

Rare Event Detection Using Error-corrected DNA and RNA Sequencing

Published on: August 3, 2018

Related Experiment Videos

Last Updated: May 16, 2026

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER
14:06

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

In Vivo Functional Study of Disease-associated Rare Human Variants Using Drosophila
06:41

In Vivo Functional Study of Disease-associated Rare Human Variants Using Drosophila

Published on: August 20, 2019

Rare Event Detection Using Error-corrected DNA and RNA Sequencing
10:36

Rare Event Detection Using Error-corrected DNA and RNA Sequencing

Published on: August 3, 2018

Area of Science:

  • Genetics and Genomics
  • Statistical Bioinformatics
  • Computational Biology

Background:

  • Rare genetic variants significantly influence disease risk for both Mendelian and common diseases.
  • Identifying rare variants and evaluating statistical methods for their analysis is computationally challenging.
  • Existing genomic data is insufficient for method evaluation due to unknown disease mechanisms.

Purpose of the Study:

  • To develop and evaluate a novel algorithm for simulating genomic data that accurately reflects the presence of rare variants.
  • To create a simulation method that preserves essential population genetic properties like linkage disequilibrium (LD) and allele frequencies.
  • To enable robust evaluation of statistical methods for rare variant association studies by providing realistic simulated datasets.

Main Methods:

  • Developed a hybrid algorithm combining regression-based imputation with resampling techniques.
  • Utilized logistic regression to model the relationship between rare and common variants using 1000 Genomes Project data.
  • Imputed rare variants based on common variants and simulated individuals using the modified real data.

Main Results:

  • The proposed algorithm successfully simulates genetic data incorporating both rare and common variants.
  • Qualitative comparison demonstrated that the method effectively retains real sample properties, including linkage disequilibrium (LD) and minor allele frequency.
  • The simulation approach addresses limitations of existing resampling methods.

Conclusions:

  • The developed imputation-based resampling algorithm provides a valuable tool for generating realistic genetic data with rare variants.
  • This method facilitates the rigorous evaluation of statistical and bioinformatic tools for rare variant association studies.
  • Accurate simulation of rare variants is essential for advancing our understanding of genetic contributions to complex diseases.