Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Video

Updated: May 11, 2026

Detection of Copy Number Alterations Using Single Cell Sequencing
09:45

Detection of Copy Number Alterations Using Single Cell Sequencing

Published on: February 17, 2017

Disk-based k-mer counting on a PC.

Sebastian Deorowicz1, Agnieszka Debudaj-Grabysz, Szymon Grabowski

  • 1Institute of Informatics, Silesian University of Technology, Akademicka 16, 44-100 Gliwice, Poland. sebastian.deorowicz@polsl.pl

BMC Bioinformatics
|May 18, 2013
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Additive-driven microwave crystallization of tyramine polymorphs and salts: a quantum crystallography perspective. Corrigendum.

IUCrJ·2026
Same author

Reference-free discovery with barcoded single-cell sequencing.

Nature biotechnology·2026
Same author

FunctionaL Assigning Sequence Homing (FLASH) maps phenotype to sequence with deep and machine learning.

bioRxiv : the preprint server for biology·2026
Same author

Fast and accurate multiple-protein-sequence alignment at scale with FAMSA2.

Nature biotechnology·2026
Same author

A Reference-Free Algorithm Discovers Regulation in the Plant Transcriptome.

Plant direct·2026
Same author

MDCompress: better, faster compression of molecular dynamics simulation trajectories.

Bioinformatics (Oxford, England)·2026
Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026
Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026
Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026
Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026
Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026
Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026
See all related articles

We developed KMC, an efficient parallel algorithm for k-mer counting. This method rapidly processes large genomic datasets on standard computers, utilizing disk space and parallelism for speed and memory efficiency.

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • K-mer counting is crucial for bioinformatics tasks like genome assembly and repeat detection.
  • Efficiently analyzing large biological sequences requires robust k-mer counting methods.

Purpose of the Study:

  • To develop a parallel, disk-based algorithm for efficient k-mer counting.
  • To address memory and computational limitations in analyzing massive genomic datasets.

Main Methods:

  • A parallel, disk-based algorithm named KMC (k-mer counting) was implemented.
  • The algorithm leverages CPU and I/O parallelism for efficient data processing.
  • Resource management strategies were employed to optimize performance on commodity hardware.

More Related Videos

Informatic Analysis of Sequence Data from Batch Yeast 2-Hybrid Screens
09:14

Informatic Analysis of Sequence Data from Batch Yeast 2-Hybrid Screens

Published on: June 28, 2018

Counting Proteins in Single Cells with Addressable Droplet Microarrays
12:25

Counting Proteins in Single Cells with Addressable Droplet Microarrays

Published on: July 6, 2018

Related Experiment Videos

Last Updated: May 11, 2026

Detection of Copy Number Alterations Using Single Cell Sequencing
09:45

Detection of Copy Number Alterations Using Single Cell Sequencing

Published on: February 17, 2017

Informatic Analysis of Sequence Data from Batch Yeast 2-Hybrid Screens
09:14

Informatic Analysis of Sequence Data from Batch Yeast 2-Hybrid Screens

Published on: June 28, 2018

Counting Proteins in Single Cells with Addressable Droplet Microarrays
12:25

Counting Proteins in Single Cells with Addressable Droplet Microarrays

Published on: July 6, 2018

Main Results:

  • KMC provides a fast solution for k-mer counting, outperforming other algorithms in many settings.
  • It successfully processed short-read human genome data in under 40 minutes on a 16 GB RAM PC.
  • The algorithm is memory-frugal, capable of handling massive datasets on standard personal computers.

Conclusions:

  • KMC is a competitive k-mer counting procedure that utilizes disk space and parallelism effectively.
  • Judicious resource management enables solving large-scale bioinformatics problems on commodity PCs.
  • The findings highlight the potential of efficient algorithms for democratizing access to high-performance computing in bioinformatics.