Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Single Nucleotide Polymorphisms-SNPs01:05

Single Nucleotide Polymorphisms-SNPs

A single nucleotide polymorphism or SNP is a single nucleotide variation at a specific genomic position in a large population. It is the most prevalent type of sequence variation found in the human genome. Point mutations that occur in more than 1% of the population qualify as SNPs. These are present once every 1000 nucleotides on an average in the human genome. Replacement of a purine with another purine (A/G) or a pyrimidine with another pyrimidine (C/T) is known as a transition. In contrast,...
Comparing Copy Number Variations and SNPs02:26

Comparing Copy Number Variations and SNPs

Sequencing of the human genome has opened up several best-kept secrets of the genome. Scientists have identified thousands of genome variations that exist within a population. These variations can be a single nucleotide or a larger chromosomal variation.
Copy number variations or CNVs are the structural variations that cover more than 1kb of DNA sequence. The single nucleotide polymorphism (SNP), on the other hand, is a single nucleotide change or a point mutation that is found in more than 1%...
Genome-wide Association Studies-GWAS01:11

Genome-wide Association Studies-GWAS

Genome-wide association studies or GWAS are used to identify whether common SNPs are associated with certain diseases. Suppose specific SNPs are more frequently observed in individuals with a particular disease than those without the disease. In that case, those SNPs are said to be associated with the disease. Chi-square analysis is performed to check the probability of the allele likely to be associated with the disease.
GWAS does not require the identification of the target gene involved in...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

SNPpy--database management for SNP data from genome wide association studies.

PloS one·2011
Same author

The Multiscale Systems Immunology project: software for cell-based immunological simulation.

Source code for biology and medicine·2008
Same journal

Mapping the 3D Chromosome Organization of a Biosynthetic Gene Cluster by Capture Hi-C (CHi-C).

Methods in molecular biology (Clifton, N.J.)·2026
Same journal

Mapping the 3D Chromosome Organization of Streptomyces by Hi-C.

Methods in molecular biology (Clifton, N.J.)·2026
Same journal

CUT&Tag Epigenomic Profiling of Biosynthetic Gene Clusters in Arabidopsis thaliana.

Methods in molecular biology (Clifton, N.J.)·2026
Same journal

Rhizobium rhizogenes-Mediated Hairy Root Transformation Protocol for Lotus japonicus and Other Legumes.

Methods in molecular biology (Clifton, N.J.)·2026
Same journal

Characterization of Bioactive Saponins from Sea Cucumbers.

Methods in molecular biology (Clifton, N.J.)·2026
Same journal

Methods for Functional Validation of Terpenoid Metabolic Clusters in Nicotiana benthamiana and Aspergillus oryzae.

Methods in molecular biology (Clifton, N.J.)·2026
See all related articles

Related Experiment Video

Updated: May 10, 2026

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER
14:06

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

Managing large SNP datasets with SNPpy.

Faheem Mitha1

  • 1Duke University, Durham, NC, USA.

Methods in Molecular Biology (Clifton, N.J.)
|June 13, 2013
PubMed
Summary
This summary is machine-generated.

Managing single nucleotide polymorphism (SNP) datasets with relational databases offers advantages in data validation and export. SNPpy, a Python program, automates SNP data management using PostgreSQL and SQLAlchemy for efficient storage and retrieval.

More Related Videos

Targeted Next-generation Sequencing and Bioinformatics Pipeline to Evaluate Genetic Determinants of Constitutional Disease
09:34

Targeted Next-generation Sequencing and Bioinformatics Pipeline to Evaluate Genetic Determinants of Constitutional Disease

Published on: April 4, 2018

Infinium Assay for Large-scale SNP Genotyping Applications
13:33

Infinium Assay for Large-scale SNP Genotyping Applications

Published on: November 19, 2013

Related Experiment Videos

Last Updated: May 10, 2026

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER
14:06

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

Targeted Next-generation Sequencing and Bioinformatics Pipeline to Evaluate Genetic Determinants of Constitutional Disease
09:34

Targeted Next-generation Sequencing and Bioinformatics Pipeline to Evaluate Genetic Determinants of Constitutional Disease

Published on: April 4, 2018

Infinium Assay for Large-scale SNP Genotyping Applications
13:33

Infinium Assay for Large-scale SNP Genotyping Applications

Published on: November 19, 2013

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • Managing large single nucleotide polymorphism (SNP) datasets is crucial for genetic research.
  • Traditional methods for SNP data management present challenges in data validation and efficient querying.
  • Relational databases offer robust solutions for handling complex biological datasets.

Purpose of the Study:

  • To introduce SNPpy, a Python program designed for automated SNP dataset management.
  • To demonstrate the advantages of using relational databases for storing and managing large SNP datasets.
  • To illustrate the practical application of SNPpy with PostgreSQL and SQLAlchemy.

Main Methods:

  • Development of SNPpy, a Python-based software tool.
  • Integration of the PostgreSQL relational database management system.
  • Utilization of the SQLAlchemy Python library for database interaction.
  • Application of SQL for data validation and export functionalities.

Main Results:

  • SNPpy enables efficient storage and management of large SNP datasets.
  • The system leverages relational database capabilities for automated data validation.
  • SQL query language facilitates powerful data export from SNP datasets.
  • SNPpy automates the process of SNP data management, reducing manual effort.

Conclusions:

  • Relational databases provide a powerful and advantageous framework for SNP data management.
  • SNPpy offers a practical and automated solution for handling large-scale SNP datasets.
  • The integration of PostgreSQL and SQLAlchemy in SNPpy enhances data integrity and accessibility.