Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Downsampling01:20

Downsampling

194
When considering a sampled sequence with zero values between sampling instants, one can replace it by taking every N-th value of the sequence. At these integer multiples of N, the original and sampled sequences coincide. This process, known as decimation, involves extracting every N-th sample from a sequence, thereby creating a more efficient sequence.
The Fourier transform of the decimated sequence reveals a combination of scaled and shifted versions of the original spectrum. This...
194
Upsampling01:22

Upsampling

266
Managing signal sampling rates is essential in digital signal processing to maintain signal integrity. A decimated signal, characterized by a reduced frequency range due to its lower sampling rate, can be upsampled by inserting zeros between each sample. This upsampling process expands the original spectrum and introduces repeated spectral replicas at intervals dictated by the new Nyquist frequency. To refine this zero-inserted sequence, it is passed through a lowpass filter with a cutoff...
266
Next-generation Sequencing03:00

Next-generation Sequencing

91.6K
The first human genome sequencing project cost $2.7 billion and was declared complete in 2003, after 15 years of international cooperation and collaboration between several research teams and funding agencies. Today, with the advent of next-generation sequencing technologies, the cost and time of sequencing a human genome have dropped over 100 fold.
Next-Generation Sequencing Methods
Although all next-generation methods use different technologies, they all share a set of standard features....
91.6K
RNA-seq03:21

RNA-seq

10.1K
RNA sequencing, or RNA-Seq, is a high-throughput sequencing technology used to study the transcriptome of a cell. Transcriptomics helps to interpret the functional elements of a genome and identify the molecular constituents of an organism. Additionally, it also helps in understanding the development of an organism and the occurrence of diseases. 
Before the discovery of RNA-seq, microarray-based methods and Sanger sequencing were used for transcriptome analysis. However, while...
10.1K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Multimodal Information-Driven Heterogeneous Graph Neural Networks for Protein-Ligand Binding Affinity Prediction.

Journal of chemical information and modeling·2026
Same author

SCMBench: benchmarking domain-specific and foundation models for single-cell multi-omics data integration.

Nature communications·2026
Same author

HorusEye: a self-supervised foundation model for generalizable X-ray tomography restoration.

Nature computational science·2026
Same author

Microsecond Dynamics of the Pyrophosphate Ion Release in SARS-CoV‑2 RNA Polymerase.

JACS Au·2026
Same author

Incorporating valuable prior knowledge to improve deep learning prediction of genetic perturbation responses.

Genome research·2026
Same author

Pushing the boundaries of autonomous biological discovery.

Nature methods·2026
Same journal

MCFST: Spatial domain identification method based on multi-view graph convolutional network and graph fusion network.

Bioinformatics (Oxford, England)·2026
Same journal

SpaBiT: Enhancing Spatial Transcriptomics Resolution via Bidirectional Attention Transformers.

Bioinformatics (Oxford, England)·2026
Same journal

EDEL: Enhancing Dense Retrievers for Curation of Biomedical Knowledge Bases.

Bioinformatics (Oxford, England)·2026
Same journal

Informative Relational Learning for Adverse Reaction Prediction with Enhanced Generalization to Novel Drugs.

Bioinformatics (Oxford, England)·2026
Same journal

An interpretable deep learning framework uncovers features governing CRISPR-Cas9 genome-editing efficiency.

Bioinformatics (Oxford, England)·2026
Same journal

3DICE: Interpretable 3D Cross-Modal Learning for Drug-Target Interaction Prediction and Large-Scale Drug Discovery.

Bioinformatics (Oxford, England)·2026
See all related articles

Related Experiment Video

Updated: Jul 25, 2025

Author Spotlight: Cost-Effective Transcriptomic Drug Screening - Unlocking New Targets
06:40

Author Spotlight: Cost-Effective Transcriptomic Drug Screening - Unlocking New Targets

Published on: February 23, 2024

1.3K

S-leaping: an efficient downsampling method for large high-throughput sequencing data.

Hiroyuki Kuwahara1, Xin Gao1

  • 1Computer, Electrical and Mathematical Sciences and Engineering Division (CEMSE), Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Saudi Arabia.

Bioinformatics (Oxford, England)
|June 24, 2023
PubMed
Summary
This summary is machine-generated.

Downsampling large omics datasets is computationally challenging. We developed s-leaping, an efficient method, and fadso, a tool for FASTQ files, improving speed and reducing memory usage for omics data analysis.

More Related Videos

An Ultrahigh-throughput Microfluidic Platform for Single-cell Genome Sequencing
10:00

An Ultrahigh-throughput Microfluidic Platform for Single-cell Genome Sequencing

Published on: May 23, 2018

17.7K
Rare Event Detection Using Error-corrected DNA and RNA Sequencing
10:36

Rare Event Detection Using Error-corrected DNA and RNA Sequencing

Published on: August 3, 2018

12.1K

Related Experiment Videos

Last Updated: Jul 25, 2025

Author Spotlight: Cost-Effective Transcriptomic Drug Screening - Unlocking New Targets
06:40

Author Spotlight: Cost-Effective Transcriptomic Drug Screening - Unlocking New Targets

Published on: February 23, 2024

1.3K
An Ultrahigh-throughput Microfluidic Platform for Single-cell Genome Sequencing
10:00

An Ultrahigh-throughput Microfluidic Platform for Single-cell Genome Sequencing

Published on: May 23, 2018

17.7K
Rare Event Detection Using Error-corrected DNA and RNA Sequencing
10:36

Rare Event Detection Using Error-corrected DNA and RNA Sequencing

Published on: August 3, 2018

12.1K

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • Sequencing coverage is crucial for omics study design.
  • Downsampling is used to estimate cost-effective sequencing coverage.
  • Large datasets make traditional downsampling computationally intensive.

Purpose of the Study:

  • To develop an efficient and accurate downsampling method for large omics data.
  • To create a user-friendly tool for applying the method to FASTQ files.

Main Methods:

  • Developed an approximate downsampling algorithm named s-leaping.
  • Created a C-based tool called fadso for processing FASTQ data.
  • Compared s-leaping and fadso against existing downsampling methods.

Main Results:

  • s-leaping demonstrated up to 39% faster performance than existing methods with comparable accuracy.
  • fadso showed up to 12% increased speed and 21% lower memory usage on large datasets.
  • fadso achieved up to 40% higher throughput in parallel computing settings.

Conclusions:

  • s-leaping offers an efficient and accurate solution for downsampling large omics datasets.
  • fadso provides a practical and high-throughput tool for FASTQ file downsampling.
  • The developed methods facilitate cost-effective design and analysis of omics studies.