Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Comparing Copy Number Variations and SNPs02:26

Comparing Copy Number Variations and SNPs

18.2K
Sequencing of the human genome has opened up several best-kept secrets of the genome. Scientists have identified thousands of genome variations that exist within a population. These variations can be a single nucleotide or a larger chromosomal variation.
Copy number variations or CNVs are the structural variations that cover more than 1kb of DNA sequence. The single nucleotide polymorphism (SNP), on the other hand, is a single nucleotide change or a point mutation that is found in more than 1%...
18.2K
Genome Copying Errors02:46

Genome Copying Errors

4.6K
DNA replication is a well-evolved process that copies millions of base pairs with high fidelity during each cell division. Occasionally a wrong base or a long stretch of wrong bases may get added to the daughter strands. If the errors are left unchecked, cells might accumulate several mutations that might endanger their  survival. Therefore, the copying errors are checked and repaired at three levels.
4.6K
Single Nucleotide Polymorphisms-SNPs01:05

Single Nucleotide Polymorphisms-SNPs

17.1K
A single nucleotide polymorphism or SNP is a single nucleotide variation at a specific genomic position in a large population. It is the most prevalent type of sequence variation found in the human genome. Point mutations that occur in more than 1% of the population qualify as SNPs. These are present once every 1000 nucleotides on an average in the human genome. Replacement of a purine with another purine (A/G) or a pyrimidine with another pyrimidine (C/T) is known as a transition. In contrast,...
17.1K
Exon Recombination02:32

Exon Recombination

3.8K
The evolution of new genes is critical for speciation. Exon recombination, also known as exon shuffling or domain shuffling, is an important means of new gene formation. It is observed across vertebrates, invertebrates, and in some plants such as potatoes and sunflowers. During exon recombination, exons from the same or different genes recombine and produce new exon-intron combinations, which might evolve into new genes. 
Exon shuffling follows “splice frame rules.” Each exon...
3.8K
Evolutionary Relationships through Genome Comparisons02:54

Evolutionary Relationships through Genome Comparisons

6.5K
Genome comparison is one of the excellent ways to interpret the evolutionary relationships between organisms. The basic principle of genome comparison is that if two species share a common feature, it is likely encoded by the DNA sequence conserved between both species. The advent of genome sequencing technologies in the late 20th century enabled scientists to understand the concept of conservation of domains between species and helped them to deduce evolutionary relationships across diverse...
6.5K
Genetic Variation01:25

Genetic Variation

944
Genetic variation is the diversity in DNA sequences found among individuals of the same species. This diversity is crucial for a species' survival because it helps organisms adapt to environmental changes. Genetic variation begins with fertilization, where an egg and sperm cell merge. Each of these cells carries 23 chromosomes, up to 46 in the fertilized egg. Chromosomes are long DNA strands that contain genes, the basic units of heredity.
Genes exist in different versions called alleles,...
944

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

An integrated cardiometabolic genetic testing program in a predominantly Hispanic population within a community setting.

Genetics in medicine : official journal of the American College of Medical Genetics·2026
Same author

Double Mosaicism in Xia-Gibbs Syndrome.

American journal of medical genetics. Part A·2026
Same author

Mapping the causal chain from genetic risk variants to lipid dysmetabolism in Parkinson's disease.

Brain : a journal of neurology·2026
Same author

An evaluation of genetic predisposition to congenital anomalies and pediatric cancer supports <i>KAT6B</i> as a novel neuroblastoma susceptibility gene.

Genetics in medicine open·2026
Same author

Evaluating the Genetic Overlap Between Congenital Heart Disease and Neuroblastoma Risk.

Pediatric blood & cancer·2025
Same author

Human genetic variation determines 24-hour rhythmic gene expression and disease risk.

Nature communications·2025
Same journal

Linkage disequilibrium and allelic heterogeneity explain variation in coronary artery disease risk at 9p21 across populations and reduced effect in Africans.

American journal of human genetics·2026
Same journal

Genome-wide association study and predictors of neonatal blood cell traits in Hispanic newborns.

American journal of human genetics·2026
Same journal

Comparison of methods for assessing effects of risk factors on disease progression in Mendelian randomization under index event bias.

American journal of human genetics·2026
Same journal

Deciding "what" to screen for and "when": The importance of natural history information.

American journal of human genetics·2026
Same journal

Homologous recombination deficiency-driven genomic instability in ovarian cancer as an indicator of BRCA1 and BRCA2 variant pathogenicity.

American journal of human genetics·2026
Same journal

Individuals who deviate from polygenic expectation are enriched for damaging variants in genes linked to rare disease.

American journal of human genetics·2026
See all related articles

Related Experiment Video

Updated: Nov 2, 2025

Navigating MARRVEL, a Web-Based Tool that Integrates Human Genomics and Model Organism Genetics Information
09:37

Navigating MARRVEL, a Web-Based Tool that Integrates Human Genomics and Model Organism Genetics Information

Published on: August 15, 2019

10.0K

Exome variant discrepancies due to reference-genome differences.

He Li1, Moez Dawood2, Michael M Khayat1

  • 1Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA.

American Journal of Human Genetics
|June 15, 2021
PubMed
Summary
This summary is machine-generated.

Choosing the right human reference genome (GRCh37 vs. GRCh38) significantly impacts variant identification in exome sequencing data, affecting disease gene discovery. Discordant variants cluster in specific genomic regions, influencing genetic interpretations for Mendelian and common diseases.

Keywords:
GRCh37GRCh38Human Genome Referenceclinical genome sequencingexome sequencinghg19

More Related Videos

Targeted Next-generation Sequencing and Bioinformatics Pipeline to Evaluate Genetic Determinants of Constitutional Disease
09:34

Targeted Next-generation Sequencing and Bioinformatics Pipeline to Evaluate Genetic Determinants of Constitutional Disease

Published on: April 4, 2018

34.2K
Author Spotlight: Genetic Profiling for Fluorouracil Response in Gastric Cancer
06:21

Author Spotlight: Genetic Profiling for Fluorouracil Response in Gastric Cancer

Published on: May 10, 2024

919

Related Experiment Videos

Last Updated: Nov 2, 2025

Navigating MARRVEL, a Web-Based Tool that Integrates Human Genomics and Model Organism Genetics Information
09:37

Navigating MARRVEL, a Web-Based Tool that Integrates Human Genomics and Model Organism Genetics Information

Published on: August 15, 2019

10.0K
Targeted Next-generation Sequencing and Bioinformatics Pipeline to Evaluate Genetic Determinants of Constitutional Disease
09:34

Targeted Next-generation Sequencing and Bioinformatics Pipeline to Evaluate Genetic Determinants of Constitutional Disease

Published on: April 4, 2018

34.2K
Author Spotlight: Genetic Profiling for Fluorouracil Response in Gastric Cancer
06:21

Author Spotlight: Genetic Profiling for Fluorouracil Response in Gastric Cancer

Published on: May 10, 2024

919

Area of Science:

  • Genomics and Bioinformatics
  • Human Genetics
  • Medical Genomics

Background:

  • The GRCh37 human reference genome is still widely used despite the release of GRCh38 over seven years ago.
  • The impact of using different human reference genome assemblies on variant identification in large-scale exome sequencing data for disease association studies has not been quantified.

Purpose of the Study:

  • To quantify the impact of utilizing GRCh37 versus GRCh38 human reference genomes on identifying single-nucleotide variants (SNVs) and insertion-deletions (indels).
  • To assess how reference assembly choice affects variant interpretation for Mendelian and common diseases using exome sequencing data.

Main Methods:

  • Variant calling of SNVs and indels was performed on 1,572 exomes using both GRCh37 and GRCh38 human reference genomes.
  • Discordant variants and their genomic locations were analyzed, identifying discrete discordant reference patches (DISCREPs).
  • Genes enriched for discordant variants were identified and cross-referenced with known Mendelian diseases and common phenotypes from genome-wide association studies.

Main Results:

  • A total of 1.5% of SNVs and 2.0% of indels were discordant between GRCh37 and GRCh38 reference assemblies.
  • 76.6% of discordant variants were located in discrete discordant reference patches (DISCREPs), which constitute only 0.9% of exome sequencing loci.
  • 206 genes showed significant enrichment for discordant variants, with 8 implicated in Mendelian diseases and 53 associated with common phenotypes; reference choice also impacted variant interpretation after lift-over.

Conclusions:

  • The choice of human reference genome assembly (GRCh37 vs. GRCh38) significantly affects variant identification and interpretation in exome sequencing data.
  • Discordant variants are concentrated in specific genomic regions (DISCREPs), highlighting potential challenges in variant calling accuracy.
  • Careful evaluation of genes and loci affected by reference assembly choice is crucial for both research and clinical applications in genetic disease studies.