Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

RNA-seq03:21

RNA-seq

RNA sequencing, or RNA-Seq, is a high-throughput sequencing technology used to study the transcriptome of a cell. Transcriptomics helps to interpret the functional elements of a genome and identify the molecular constituents of an organism. Additionally, it also helps in understanding the development of an organism and the occurrence of diseases. 
Before the discovery of RNA-seq, microarray-based methods and Sanger sequencing were used for transcriptome analysis. However, while microarray-based...
RACE - Rapid Amplification of cDNA Ends02:35

RACE - Rapid Amplification of cDNA Ends

Rapid Amplification of cDNA Ends, or RACE, is one of the most effective methods to obtain a full-length cDNA from an mRNA sequence between a known internal region to the unknown sequence at the 5’ or 3’ end. The unknown region is cloned in the cDNA by a gene-specific primer that binds the known end, and a hybrid primer that attaches a predefined anchor sequence to the unknown end of the cDNA. The sequence in between is amplified by PCR with an anchor primer and a gene-specific primer.
Since the...
Sanger Sequencing01:57

Sanger Sequencing

DNA sequencing is a fundamental technique that is routinely used in the biological sciences. This method can be applied to a range of questions at different scales - from the sequencing of a cloned DNA fragment or the study of a mutation in a gene up to whole-genome sequencing. However, despite the widespread use of sequencing today, it was not until 1977 that Fredrick Sanger and his collaborators developed the chain-termination method to decode DNA sequences. It relies on the separation of a...
Next-generation Sequencing03:00

Next-generation Sequencing

The first human genome sequencing project cost $2.7 billion and was declared complete in 2003, after 15 years of international cooperation and collaboration between several research teams and funding agencies. Today, with the advent of next-generation sequencing technologies, the cost and time of sequencing a human genome have dropped over 100 fold.
Next-Generation Sequencing Methods
Although all next-generation methods use different technologies, they all share a set of standard features.

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

T-cell receptors that are <i>k</i>-binding have defined sequence features.

Frontiers in immunology·2025
Same author

Identifying lupus Patient Subsets Through Immune Cell Deconvolution of Gene Expression Data in Two Atacicept Phase II Studies.

ACR open rheumatology·2023
Same author

Major β cell-specific functions of NKX2.2 are mediated via the NK2-specific domain.

Genes & development·2023
Same author

Systematic elucidation of genetic mechanisms underlying cholesterol uptake.

Cell genomics·2023
Same author

Revealing the immune cell subtype reconstitution profile in patients from the CLARITY study using deconvolution algorithms after cladribine tablets treatment.

Scientific reports·2023
Same author

A pan-variant mRNA-LNP T cell vaccine protects HLA transgenic mice from mortality after infection with SARS-CoV-2 Beta.

Frontiers in immunology·2023
Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026
Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026
Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026
Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026
Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026
Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026
See all related articles

Related Experiment Video

Updated: May 31, 2026

Introductory Analysis and Validation of CUT&#38;RUN Sequencing Data
04:58

Introductory Analysis and Validation of CUT&RUN Sequencing Data

Published on: December 13, 2024

ReadDB provides efficient storage for mapped short reads.

P Alexander Rolfe1, David K Gifford

  • 1Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02139, USA. dkg@mit.edu

BMC Bioinformatics
|July 9, 2011
PubMed
Summary
This summary is machine-generated.

ReadDB is a novel database system designed for storing and retrieving large collections of high-throughput sequencing data. It offers efficient access for ChIP-Seq and RNA-Seq analysis, improving upon existing network-based methods.

More Related Videos

G2-seq: A High Throughput Sequencing-based Technique for Identifying Late Replicating Regions of the Genome
06:40

G2-seq: A High Throughput Sequencing-based Technique for Identifying Late Replicating Regions of the Genome

Published on: March 22, 2018

Transcription Start Site Mapping Using Super-low Input Carrier-CAGE
06:59

Transcription Start Site Mapping Using Super-low Input Carrier-CAGE

Published on: June 26, 2019

Related Experiment Videos

Last Updated: May 31, 2026

Introductory Analysis and Validation of CUT&#38;RUN Sequencing Data
04:58

Introductory Analysis and Validation of CUT&RUN Sequencing Data

Published on: December 13, 2024

G2-seq: A High Throughput Sequencing-based Technique for Identifying Late Replicating Regions of the Genome
06:40

G2-seq: A High Throughput Sequencing-based Technique for Identifying Late Replicating Regions of the Genome

Published on: March 22, 2018

Transcription Start Site Mapping Using Super-low Input Carrier-CAGE
06:59

Transcription Start Site Mapping Using Super-low Input Carrier-CAGE

Published on: June 26, 2019

Area of Science:

  • Bioinformatics
  • Genomics
  • Computational Biology

Background:

  • High-throughput sequencing generates massive datasets (over 10^8 reads per experiment).
  • Existing tools inadequately address storage and retrieval challenges for large aligned sequencing datasets.
  • Efficient data management is crucial for analyzing cellular functions through sequencing.

Purpose of the Study:

  • To introduce ReadDB, a network-accessible column store database system.
  • To provide a solution for storing and retrieving large collections of aligned high-throughput sequencing data.
  • To facilitate visualization and analysis of genomic interval data.

Main Methods:

  • ReadDB is implemented as a network server.
  • It stores aligned read positions and responds to queries on genomic intervals.
  • Provides either contained reads or histogram-based interval summaries.

Main Results:

  • ReadDB demonstrates high performance on datasets ranging from 10^5 to 10^8 reads.
  • Performance is within a factor of two of local-storage methods.
  • Outperforms other network-based methods by three to five times.

Conclusions:

  • ReadDB serves as a high-performance foundation for ChIP-Seq and RNA-Seq analysis.
  • The client-server model enables convenient access without shared network file systems or large local storage.
  • Offers a new method for storing genome-aligned reads, optimized for applications not requiring read sequence or mismatch data.