Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

RNA-seq03:21

RNA-seq

12.4K
RNA sequencing, or RNA-Seq, is a high-throughput sequencing technology used to study the transcriptome of a cell. Transcriptomics helps to interpret the functional elements of a genome and identify the molecular constituents of an organism. Additionally, it also helps in understanding the development of an organism and the occurrence of diseases. 
Before the discovery of RNA-seq, microarray-based methods and Sanger sequencing were used for transcriptome analysis. However, while...
12.4K
Genome Annotation and Assembly03:36

Genome Annotation and Assembly

21.4K
The genome refers to all of the genetic material in an organism. It can range from a few million base pairs in microbial cells to several billion base pairs in many eukaryotic organisms. Genome assembly refers to the process of taking the DNA sequencing data and putting it all back together in a correct order to create a close representation of the original genome. This is followed by the identification of functional elements on the newly assembled genome, a process called genome annotation.
21.4K
Next-generation Sequencing03:00

Next-generation Sequencing

100.6K
The first human genome sequencing project cost $2.7 billion and was declared complete in 2003, after 15 years of international cooperation and collaboration between several research teams and funding agencies. Today, with the advent of next-generation sequencing technologies, the cost and time of sequencing a human genome have dropped over 100 fold.
Next-Generation Sequencing Methods
Although all next-generation methods use different technologies, they all share a set of standard features....
100.6K
Maxam-Gilbert Sequencing01:05

Maxam-Gilbert Sequencing

13.5K
In the same year as the discovery of the Sanger sequencing method, another group of scientists, Allan Maxam and Walter Gilbert, demonstrated their chemical-cleavage method for DNA sequencing. The Maxam-Gilbert method relies on using different chemicals that can cleave the DNA sequence at specific sites, the separation of resulting DNA fragments of variable size using electrophoresis, and deciphering the DNA sequence from the resulting gel bands.
Challenges of the Maxam-Gilbert Method
The...
13.5K
Sanger Sequencing01:57

Sanger Sequencing

777.4K
DNA sequencing is a fundamental technique that is routinely used in the biological sciences. This method can be applied to a range of questions at different scales - from the sequencing of a cloned DNA fragment or the study of a mutation in a gene up to whole-genome sequencing. However, despite the widespread use of sequencing today, it was not until 1977 that Fredrick Sanger and his collaborators developed the chain-termination method to decode DNA sequences. It relies on the separation of a...
777.4K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Towards precision oncology: a multi-level cancer classification system integrating liquid biopsy and machine learning.

BioData mining·2025
Same author

A Multi-model Deep Learning Architecture for Diagnosing Multi-class Skin Diseases.

Journal of imaging informatics in medicine·2024
Same author

Precision cancer classification using liquid biopsy and advanced machine learning techniques.

Scientific reports·2024
Same author

MAC-ErrorReads: machine learning-assisted classifier for filtering erroneous NGS reads.

BMC bioinformatics·2024
Same author

Unboxing machine learning models for concrete strength prediction using XAI.

Scientific reports·2023
Same author

Blink-To-Live eye-based communication system for users with speech impairments.

Scientific reports·2023
Same journal

conMItion: an R package adjusting confounding factors for associations in multi-omics.

Bioinformatics (Oxford, England)·2026
Same journal

SpaMFG: a Spatial Multi-omics Integration Method based on Feature Grouping.

Bioinformatics (Oxford, England)·2026
Same journal

CSCN: Inference of Cell-Specific Causal Networks Using Single-Cell RNA-Seq Data.

Bioinformatics (Oxford, England)·2026
Same journal

Sparse CCA-Based Mediation Analysis with High-Dimensional Exposures and Mediators.

Bioinformatics (Oxford, England)·2026
Same journal

Enhancing Cross-Context Generalization in Drug Perturbation Prediction with a Multimodal Conditional Diffusion Framework.

Bioinformatics (Oxford, England)·2026
Same journal

Primer Design through Submodular Function Estimation.

Bioinformatics (Oxford, England)·2026
See all related articles

Related Experiment Video

Updated: Mar 17, 2026

Novel Sequence Discovery by Subtractive Genomics
09:40

Novel Sequence Discovery by Subtractive Genomics

Published on: January 25, 2019

9.2K

LightAssembler: fast and memory-efficient assembly algorithm for high-throughput sequencing reads.

Sara El-Metwally1, Magdi Zakaria2, Taher Hamza2

  • 1Molecular and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA Department of Computer Science, Faculty of Computers and Information, Mansoura University, Mansoura 35516, Egypt.

Bioinformatics (Oxford, England)
|July 15, 2016
PubMed
Summary
This summary is machine-generated.

Next-generation sequencing (NGS) generates vast data, overwhelming computational resources. LightAssembler offers a memory-efficient genome assembly solution, reducing memory usage significantly for accurate results.

More Related Videos

3' End Sequencing Library Preparation with A-seq2
12:01

3' End Sequencing Library Preparation with A-seq2

Published on: October 10, 2017

11.2K
Leveraging CyVerse Resources for De Novo Comparative Transcriptomics of Underserved Non-model Organisms
10:41

Leveraging CyVerse Resources for De Novo Comparative Transcriptomics of Underserved Non-model Organisms

Published on: May 9, 2017

9.7K

Related Experiment Videos

Last Updated: Mar 17, 2026

Novel Sequence Discovery by Subtractive Genomics
09:40

Novel Sequence Discovery by Subtractive Genomics

Published on: January 25, 2019

9.2K
3' End Sequencing Library Preparation with A-seq2
12:01

3' End Sequencing Library Preparation with A-seq2

Published on: October 10, 2017

11.2K
Leveraging CyVerse Resources for De Novo Comparative Transcriptomics of Underserved Non-model Organisms
10:41

Leveraging CyVerse Resources for De Novo Comparative Transcriptomics of Underserved Non-model Organisms

Published on: May 9, 2017

9.7K

Area of Science:

  • Genomics
  • Bioinformatics
  • Computational Biology

Background:

  • The rapid advancement of next-generation sequencing (NGS) technologies has led to an exponential increase in genomic data generation, surpassing Moore's Law.
  • Storing, processing, and analyzing this massive data deluge presents significant computational challenges, particularly for complex genomes.
  • Existing genome assembly pipelines often require large amounts of memory to handle the complex assembly graphs generated from high-throughput NGS reads and genomic repeats, making them intractable for large-scale analyses.

Purpose of the Study:

  • To develop a resource-efficient genome assembler that can operate on standard desktop machines.
  • To reduce the memory footprint of genome assembly processes without compromising accuracy or contiguity.
  • To provide a practical solution for analyzing large-scale genomic datasets.

Main Methods:

  • LightAssembler employs a lightweight assembly algorithm utilizing a pair of cache-oblivious Bloom filters.
  • One Bloom filter stores a uniform sample of k-mers, while the other stores k-mers identified as likely correct via a statistical test.
  • The algorithm features a streamlined implementation of graph traversal and simplification modules.

Main Results:

  • LightAssembler achieves comparable assembly accuracy and contiguity to existing competing tools.
  • The method demonstrates a significant reduction in memory usage (up to 10x) compared to other resource-efficient assemblers on benchmark datasets.
  • Assembly size and genome coverage remain relatively constant across different gap size parameters.

Conclusions:

  • LightAssembler provides an effective and memory-efficient solution for genome assembly, suitable for desktop environments.
  • The algorithm's design addresses the computational bottleneck associated with processing large-scale NGS data.
  • This tool facilitates more accessible and scalable genomic data analysis.