Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Single Nucleotide Polymorphisms-SNPs01:05

Single Nucleotide Polymorphisms-SNPs

17.8K
A single nucleotide polymorphism or SNP is a single nucleotide variation at a specific genomic position in a large population. It is the most prevalent type of sequence variation found in the human genome. Point mutations that occur in more than 1% of the population qualify as SNPs. These are present once every 1000 nucleotides on an average in the human genome. Replacement of a purine with another purine (A/G) or a pyrimidine with another pyrimidine (C/T) is known as a transition. In contrast,...
17.8K
Comparing Copy Number Variations and SNPs02:26

Comparing Copy Number Variations and SNPs

18.5K
Sequencing of the human genome has opened up several best-kept secrets of the genome. Scientists have identified thousands of genome variations that exist within a population. These variations can be a single nucleotide or a larger chromosomal variation.
Copy number variations or CNVs are the structural variations that cover more than 1kb of DNA sequence. The single nucleotide polymorphism (SNP), on the other hand, is a single nucleotide change or a point mutation that is found in more than 1%...
18.5K
Genome-wide Association Studies-GWAS01:11

Genome-wide Association Studies-GWAS

15.2K
Genome-wide association studies or GWAS are used to identify whether common SNPs are associated with certain diseases. Suppose specific SNPs are more frequently observed in individuals with a particular disease than those without the disease. In that case, those SNPs are said to be associated with the disease. Chi-square analysis is performed to check the probability of the allele likely to be associated with the disease.
GWAS does not require the identification of the target gene involved in...
15.2K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Association between obesity, sex, medical comorbidities, and survival in cancer patients treated with immune checkpoint inhibitors.

Journal of Cancer·2026
Same author

Effects of unventilated versus ventilated filtered cigarettes on smoking behaviour and biomarkers of smoke exposure in a randomised clinical trial.

Tobacco control·2026
Same author

Lung microbiome predictors of epigenetic aging and potential associations with smoking and electronic cigarette use.

Clinical epigenetics·2026
Same author

Overcoming vascular niche-mediated TKI resistance in acute myeloid leukemia through miR-126 inhibition.

NPJ systems biology and applications·2026
Same author

Role of the anti-Shine-Dalgarno sequence of 16S rRNA in Flavobacterium johnsoniae.

Nucleic acids research·2026
Same author

Associations of breast DNA methylation-based measures of biological ageing with obesity-related breast cancer risk markers.

Epigenetics·2026
Same journal

Mild oxidative stress and dietary epigenetic modulators direct DNA methylation remodeling toward stress-resilience pathways.

BMC genomics·2026
Same journal

Integrative ATAC-Seq and RNA-Seq analysis identifies key genes for intramuscular fat content in Laiwu pigs.

BMC genomics·2026
Same journal

A comprehensive long RNA landscape of multi-regional porcine lung-derived small extracellular vesicles.

BMC genomics·2026
Same journal

pGWAS-Portal: a comprehensive online platform for integrative post-genome-wide association study analysis.

BMC genomics·2026
Same journal

Physiological and transcriptomic analyses of Rosa persica in response to drought stress and functional validation of the transcription factor RpERF113-like.

BMC genomics·2026
Same journal

Integrated analysis of chromatin accessibility and transcriptome profiles in granulosa cells of sheep with different FecB genotypes.

BMC genomics·2026
See all related articles

Related Experiment Video

Updated: Dec 31, 2025

Candidate Gene Testing in Clinical Cohort Studies with Multiplexed Genotyping and Mass Spectrometry
05:53

Candidate Gene Testing in Clinical Cohort Studies with Multiplexed Genotyping and Mass Spectrometry

Published on: June 21, 2018

10.5K

SMaSH: Sample matching using SNPs in humans.

Maximillian Westphal1, David Frankhouser2,3, Carmine Sonzone4

  • 1Interdisciplinary Biophysics Graduate Program, The Ohio State University, 484 W. 12th Avenue, Columbus, 43210, OH, USA.

BMC Genomics
|January 1, 2020
PubMed
Summary
This summary is machine-generated.

This study introduces SMASH, a Bayesian tool to identify sample swaps in omics data using thousands of single nucleotide polymorphisms (SNPs). SMASH ensures data integrity across diverse sequencing types, even with low-quality samples.

Keywords:
Identity matchingNext generation sequencing dataSample swap

More Related Videos

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER
14:06

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

15.6K
A Strategy to Identify de Novo Mutations in Common Disorders such as Autism and Schizophrenia
05:51

A Strategy to Identify de Novo Mutations in Common Disorders such as Autism and Schizophrenia

Published on: June 15, 2011

26.3K

Related Experiment Videos

Last Updated: Dec 31, 2025

Candidate Gene Testing in Clinical Cohort Studies with Multiplexed Genotyping and Mass Spectrometry
05:53

Candidate Gene Testing in Clinical Cohort Studies with Multiplexed Genotyping and Mass Spectrometry

Published on: June 21, 2018

10.5K
Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER
14:06

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

15.6K
A Strategy to Identify de Novo Mutations in Common Disorders such as Autism and Schizophrenia
05:51

A Strategy to Identify de Novo Mutations in Common Disorders such as Autism and Schizophrenia

Published on: June 15, 2011

26.3K

Area of Science:

  • Genomics
  • Bioinformatics
  • Data Quality Assurance

Background:

  • Inadvertent sample swaps compromise data integrity in large-scale omics studies.
  • Identifying sample matches typically requires numerous single nucleotide polymorphisms (SNPs) due to variable omics data coverage.

Purpose of the Study:

  • To develop a robust method for identifying sample identity in next-generation sequencing (NGS) data.
  • To address the challenge of sample mix-ups in omics research.

Main Methods:

  • Selection of approximately six thousand SNPs across the human genome.
  • Development of a Bayesian framework for robust sample matching in sequencing datasets.

Main Results:

  • Validation of the approach across diverse datasets, including Exome, RNA-Seq, and MethylCap-Seq.
  • Demonstration that identity detection is reliable even with low-quality RNA-Seq data (20 million reads).
  • Assessment of how sample quality and read coverage impact identity detection accuracy.

Conclusions:

  • The SMASH tool accurately identifies sample mismatches in NGS data.
  • SMASH is effective across different sequencing modalities and for low-quality data.
  • Ensures reliable sample tracking in omics studies.