Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

RNA-seq

RNA-seq

RNA sequencing, or RNA-Seq, is a high-throughput sequencing technology used to study the transcriptome of a cell. Transcriptomics helps to interpret the functional elements of a genome and identify the molecular constituents of an organism. Additionally, it also helps in understanding the development of an organism and the occurrence of diseases.
Before the discovery of RNA-seq, microarray-based methods and Sanger sequencing were used for transcriptome analysis. However, while microarray-based...

RACE - Rapid Amplification of cDNA Ends

RACE - Rapid Amplification of cDNA Ends

Rapid Amplification of cDNA Ends, or RACE, is one of the most effective methods to obtain a full-length cDNA from an mRNA sequence between a known internal region to the unknown sequence at the 5’ or 3’ end. The unknown region is cloned in the cDNA by a gene-specific primer that binds the known end, and a hybrid primer that attaches a predefined anchor sequence to the unknown end of the cDNA. The sequence in between is amplified by PCR with an anchor primer and a gene-specific primer.
Since the...

Sanger Sequencing

Sanger Sequencing

DNA sequencing is a fundamental technique that is routinely used in the biological sciences. This method can be applied to a range of questions at different scales - from the sequencing of a cloned DNA fragment or the study of a mutation in a gene up to whole-genome sequencing. However, despite the widespread use of sequencing today, it was not until 1977 that Fredrick Sanger and his collaborators developed the chain-termination method to decode DNA sequences. It relies on the separation of a...

Next-generation Sequencing

Next-generation Sequencing

The first human genome sequencing project cost $2.7 billion and was declared complete in 2003, after 15 years of international cooperation and collaboration between several research teams and funding agencies. Today, with the advent of next-generation sequencing technologies, the cost and time of sequencing a human genome have dropped over 100 fold.
Next-Generation Sequencing Methods
Although all next-generation methods use different technologies, they all share a set of standard features.

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

T-cell receptors that are <i>k</i>-binding have defined sequence features.

Frontiers in immunology·2025

Same author

Identifying lupus Patient Subsets Through Immune Cell Deconvolution of Gene Expression Data in Two Atacicept Phase II Studies.

ACR open rheumatology·2023

Same author

Major β cell-specific functions of NKX2.2 are mediated via the NK2-specific domain.

Genes & development·2023

Same author

Systematic elucidation of genetic mechanisms underlying cholesterol uptake.

Cell genomics·2023

Same author

Revealing the immune cell subtype reconstitution profile in patients from the CLARITY study using deconvolution algorithms after cladribine tablets treatment.

Scientific reports·2023

Same author

A pan-variant mRNA-LNP T cell vaccine protects HLA transgenic mice from mortality after infection with SARS-CoV-2 Beta.

Frontiers in immunology·2023

Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026

Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026

Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026

Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026

Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026

Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: May 31, 2026

Introductory Analysis and Validation of CUT&RUN Sequencing Data

Introductory Analysis and Validation of CUT&RUN Sequencing Data

Published on: December 13, 2024

ReadDB provides efficient storage for mapped short reads.

P Alexander Rolfe¹, David K Gifford

¹Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02139, USA. dkg@mit.edu

BMC Bioinformatics

|July 9, 2011

Summary

This summary is machine-generated.

ReadDB is a novel database system designed for storing and retrieving large collections of high-throughput sequencing data. It offers efficient access for ChIP-Seq and RNA-Seq analysis, improving upon existing network-based methods.

More Related Videos

G2-seq: A High Throughput Sequencing-based Technique for Identifying Late Replicating Regions of the Genome

G2-seq: A High Throughput Sequencing-based Technique for Identifying Late Replicating Regions of the Genome

Published on: March 22, 2018

Transcription Start Site Mapping Using Super-low Input Carrier-CAGE

Transcription Start Site Mapping Using Super-low Input Carrier-CAGE

Published on: June 26, 2019

Related Experiment Videos

Last Updated: May 31, 2026

Introductory Analysis and Validation of CUT&RUN Sequencing Data

Introductory Analysis and Validation of CUT&RUN Sequencing Data

Published on: December 13, 2024

G2-seq: A High Throughput Sequencing-based Technique for Identifying Late Replicating Regions of the Genome

G2-seq: A High Throughput Sequencing-based Technique for Identifying Late Replicating Regions of the Genome

Published on: March 22, 2018

Transcription Start Site Mapping Using Super-low Input Carrier-CAGE

Transcription Start Site Mapping Using Super-low Input Carrier-CAGE

Published on: June 26, 2019

Area of Science:

Bioinformatics
Genomics
Computational Biology

Background:

High-throughput sequencing generates massive datasets (over 10^8 reads per experiment).
Existing tools inadequately address storage and retrieval challenges for large aligned sequencing datasets.
Efficient data management is crucial for analyzing cellular functions through sequencing.

Purpose of the Study:

To introduce ReadDB, a network-accessible column store database system.
To provide a solution for storing and retrieving large collections of aligned high-throughput sequencing data.
To facilitate visualization and analysis of genomic interval data.

Main Methods:

ReadDB is implemented as a network server.
It stores aligned read positions and responds to queries on genomic intervals.
Provides either contained reads or histogram-based interval summaries.

Main Results:

ReadDB demonstrates high performance on datasets ranging from 10^5 to 10^8 reads.
Performance is within a factor of two of local-storage methods.
Outperforms other network-based methods by three to five times.

Conclusions:

ReadDB serves as a high-performance foundation for ChIP-Seq and RNA-Seq analysis.
The client-server model enables convenient access without shared network file systems or large local storage.
Offers a new method for storing genome-aligned reads, optimized for applications not requiring read sequence or mismatch data.