Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

System of Memory

System of Memory

Memory is categorized into three major systems: sensory memory, short-term memory (STM), and long-term memory (LTM). These systems differ in their capacity and the duration for which they can hold information. Sensory memory captures raw sensory input from the environment, holding it for just a few seconds or less. For example, on hearing a brief, loud sound, like a car horn honking, the sound seems to linger in the mind for a moment even after it stops. This is an instance of sensory memory...

Understanding Memory

Understanding Memory

Memory is the retention of information or experiences over time, facilitated through three main processes: encoding, storage, and retrieval. Encoding is the process of inputting information into the memory system. For instance, when listening to a lecture, watching a play, reading a book, or having a conversation, the brain is actively encoding information. This initial stage involves transforming sensory input into a form that can be processed and stored by the brain. Various factors, such as...

Buffers: Buffer Capacity

Buffers: Buffer Capacity

Buffer capacity is the quantitative measure of a buffer to resist the change in pH. As shown in the following equation, the buffer capacity, denoted by 'beta', is expressed as the number of moles of acid or base needed to change the pH of a one-liter buffer solution by 1 unit. Here, Ca and Cb indicate the number of moles of acid and base, respectively. Note that dpH represents the change in pH.
In the graph, pH is plotted as a function of the number of moles of base (Cb) added to a weak acid...

Maximum Size of Aggregate

Maximum Size of Aggregate

The maximum size of aggregate is defined as the aperture of the sieve retaining 15 percent or more of the particles present in the aggregate sample. The aggregate's maximum size impacts the concrete's water requirement, workability, and strength. Larger aggregates reduce the surface area needing cement paste coverage, which can lower water needs, thereby allowing a decrease in the water-to-cement ratio when the desired workability and richness of the mix are to be maintained, which can result...

Multimachine Stability

Multimachine Stability

Multimachine stability analysis is crucial for understanding the dynamics and stability of power systems with multiple synchronous machines. The objective is to solve the swing equations for a network of M machines connected to an N-bus power system.
In analyzing the system, the nodal equations represent the relationship between bus voltages, machine voltages, and machine currents. The nodal equation is given by:

Storage

Storage

A schema is a mental framework that helps individuals organize and interpret information. Schemata, formed from previous experiences, influence how we process new information: how we encode it, the inferences we make, and how we retrieve it. For instance, a schema for what a typical classroom looks like might include desks, a teacher's desk, a whiteboard, and students in such an environment. This expectation helps us quickly understand and navigate new classrooms without needing to analyze each...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

SNooPy: a statistical framework for long-read metagenomic variant calling.

Nucleic acids research·2026

Same author

A versatile multi-components mixed model for bacterial-Genome Wide association studies.

Nature communications·2026

Same author

Data Structures to Represent a Set of <math><mi>k</mi></math> -long DNA Sequences.

ACM computing surveys·2026

Same author

High-quality metagenome assembly from nanopore reads with nanoMDBG.

Nature communications·2026

Same author

De-Bruijn graph partitioning for scalable and accurate DNA storage processing.

Bioinformatics (Oxford, England)·2025

Same author

Comprehensive Annotation of Olfactory and Gustatory Receptor Genes and Transposable Elements Revealed Their Evolutionary Dynamics in Aphids.

Molecular biology and evolution·2025

Same journal

3DICE: Interpretable 3D Cross-Modal Learning for Drug-Target Interaction Prediction and Large-Scale Drug Discovery.

Bioinformatics (Oxford, England)·2026

Same journal

KASSPer: Kinase Active Site Structure Prediction using Protein and Ligand Language Models and Its Application to Virtual Screening.

Bioinformatics (Oxford, England)·2026

Same journal

IDR searcher: a search engine solution for public image resources.

Bioinformatics (Oxford, England)·2026

Same journal

KCFtools: Rapid alignment-free method for introgression screening and GWAS using k-mer profiles.

Bioinformatics (Oxford, England)·2026

Same journal

Meta2DB: Curated shotgun metagenomic feature sets and metadata for health state prediction.

Bioinformatics (Oxford, England)·2026

Same journal

conMItion: an R package adjusting confounding factors for associations in multi-omics.

Bioinformatics (Oxford, England)·2026

See all related articles

Search research articles

Related Experiment Video

Updated: May 15, 2026

Fast Colony Forming Unit Counting in 96-Well Plate Format Applied to the Drosophila Microbiome

Fast Colony Forming Unit Counting in 96-Well Plate Format Applied to the Drosophila Microbiome

Published on: January 13, 2023

DSK: k-mer counting with very low memory usage.

Guillaume Rizk¹, Dominique Lavenier, Rayan Chikhi

¹Algorizk, 75013 Paris, France.

Bioinformatics (Oxford, England)

|January 18, 2013

Summary

This summary is machine-generated.

A new streaming algorithm, DSK (disk streaming of k-mers), efficiently counts DNA/RNA k-mers using fixed memory and disk space. This method offers a memory, time, and disk trade-off, making it suitable for servers with limited memory.

More Related Videos

Automated Quantification and Analysis of Cell Counting Procedures Using ImageJ Plugins

Automated Quantification and Analysis of Cell Counting Procedures Using ImageJ Plugins

Published on: November 17, 2016

Micro-drive Array for Chronic in vivo Recording: Drive Fabrication

Micro-drive Array for Chronic in vivo Recording: Drive Fabrication

Published on: April 20, 2009

Related Experiment Videos

Last Updated: May 15, 2026

Fast Colony Forming Unit Counting in 96-Well Plate Format Applied to the Drosophila Microbiome

Fast Colony Forming Unit Counting in 96-Well Plate Format Applied to the Drosophila Microbiome

Published on: January 13, 2023

Automated Quantification and Analysis of Cell Counting Procedures Using ImageJ Plugins

Automated Quantification and Analysis of Cell Counting Procedures Using ImageJ Plugins

Published on: November 17, 2016

Micro-drive Array for Chronic in vivo Recording: Drive Fabrication

Micro-drive Array for Chronic in vivo Recording: Drive Fabrication

Published on: April 20, 2009

Area of Science:

Bioinformatics
Computational Biology
Genomics

Background:

K-mer counting is essential for DNA/RNA sequencing analysis.
Existing methods demand substantial in-memory data structures.
Data structure size scales with the number of distinct k-mers.

Purpose of the Study:

Introduce DSK (disk streaming of k-mers), a novel streaming algorithm for k-mer counting.
Address memory limitations of current state-of-the-art k-mer counting techniques.
Provide an efficient alternative for k-mer analysis on resource-constrained servers.

Main Methods:

DSK employs a streaming approach with a fixed memory and disk footprint.
Partitions the multi-set of k-mers and saves them to disk.
Loads partitions into temporary hash tables for counting.
Optionally filters low-abundance k-mers.

Main Results:

DSK successfully counted all 27-mers in a human genome dataset using only 4.0 GB RAM and 160 GB disk space.
The computation time for the human genome dataset was 17.9 hours.
DSK demonstrates a viable memory, time, and disk trade-off.

Conclusions:

DSK is the first algorithm capable of counting all k-mers in large datasets with limited memory.
DSK can serve as a replacement for existing tools like Jellyfish on servers with restricted memory.
The algorithm offers efficient k-mer counting for various bioinformatics applications.