Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Evolutionary Relationships through Genome Comparisons02:54

Evolutionary Relationships through Genome Comparisons

Genome comparison is one of the excellent ways to interpret the evolutionary relationships between organisms. The basic principle of genome comparison is that if two species share a common feature, it is likely encoded by the DNA sequence conserved between both species. The advent of genome sequencing technologies in the late 20th century enabled scientists to understand the concept of conservation of domains between species and helped them to deduce evolutionary relationships across diverse...
Comparing Mitochondrial, Chloroplast, and Prokaryotic Genomes02:16

Comparing Mitochondrial, Chloroplast, and Prokaryotic Genomes

The present-day mitochondrial and chloroplast genomes have retained some of the characteristics of their ancestral prokaryotes and also have acquired new attributes during their evolution within eukaryotic cells. Like prokaryotic genomes, mitochondrial and chloroplast genomes neither bind with histone-like proteins nor show complex packaging into chromosome-like structures, as observed in eukaryotes. Unlike mitotic cell divisions observed in eukaryotic cells, mitochondria and chloroplasts...
Comparing Copy Number Variations and SNPs02:26

Comparing Copy Number Variations and SNPs

Sequencing of the human genome has opened up several best-kept secrets of the genome. Scientists have identified thousands of genome variations that exist within a population. These variations can be a single nucleotide or a larger chromosomal variation.
Copy number variations or CNVs are the structural variations that cover more than 1kb of DNA sequence. The single nucleotide polymorphism (SNP), on the other hand, is a single nucleotide change or a point mutation that is found in more than 1%...
Genomics02:02

Genomics

Genomics is the science of genomes: it is the study of all the genetic material of an organism. In humans, the genome consists of information carried in 23 pairs of chromosomes in the nucleus, as well as mitochondrial DNA. In genomics, both coding and non-coding DNA is sequenced and analyzed. Genomics allows a better understanding of all living things, their evolution, and their diversity. It has a myriad of uses: for example, to build phylogenetic trees, to improve productivity and...
Modern Molecular Taxonomy01:29

Modern Molecular Taxonomy

Advancements in molecular biology have revolutionized the identification and characterization of bacteria, with multiple methods leveraging DNA sequencing for enhanced precision. As sequencing technologies improve and costs decline, these approaches are increasingly used in clinical, environmental, and evolutionary studies.Multilocus Sequence Typing (MLST) examines several housekeeping genes, essential chromosomal genes encoding cellular functions, to distinguish strains. Approximately...
Evolution of Microbial Genome01:08

Evolution of Microbial Genome

Microbial genome evolution is a highly dynamic process shaped by continual gene gain and loss across species and strains. This genomic flexibility allows microorganisms to adapt rapidly to environmental pressures and interactions with other organisms. Central to understanding this diversity is the distinction between the core and pan genomes.The core genome comprises the genes shared by all sampled strains of a species, representing essential functions needed for fundamental cellular processes.

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Kaminari: a frugal colored index for approximate <i>k</i>-mer queries.

Bioinformatics advances·2026
Same author

SNooPy: a statistical framework for long-read metagenomic variant calling.

Nucleic acids research·2026
Same author

A versatile multi-components mixed model for bacterial-Genome Wide association studies.

Nature communications·2026
Same author

Data Structures to Represent a Set of <math><mi>k</mi></math> -long DNA Sequences.

ACM computing surveys·2026
Same author

High-quality metagenome assembly from nanopore reads with nanoMDBG.

Nature communications·2026
Same author

De-Bruijn graph partitioning for scalable and accurate DNA storage processing.

Bioinformatics (Oxford, England)·2025
Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026
Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026
Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026
Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026
Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026
Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026
See all related articles

Related Experiment Video

Updated: May 15, 2026

Large-Scale Screens of Metagenomic Libraries
16:05

Large-Scale Screens of Metagenomic Libraries

Published on: May 28, 2007

Compareads: comparing huge metagenomic experiments.

Nicolas Maillet1, Claire Lemaitre, Rayan Chikhi

  • 1INRIA Rennes - Bretagne Atlantique/IRISA, EPI GenScale, Rennes, France. nicolas.maillet@inria.fr

BMC Bioinformatics
|January 4, 2013
PubMed
Summary
This summary is machine-generated.

Compareads enables de novo comparison of huge metagenomic datasets, identifying similar DNA reads without prior knowledge. This approach efficiently handles millions of reads on personal computers, revealing novel biological insights.

More Related Videos

Metagenomic Analysis of Silage
08:43

Metagenomic Analysis of Silage

Published on: January 13, 2017

Purifying the Impure: Sequencing Metagenomes and Metatranscriptomes from Complex Animal-associated Samples
11:23

Purifying the Impure: Sequencing Metagenomes and Metatranscriptomes from Complex Animal-associated Samples

Published on: December 22, 2014

Related Experiment Videos

Last Updated: May 15, 2026

Large-Scale Screens of Metagenomic Libraries
16:05

Large-Scale Screens of Metagenomic Libraries

Published on: May 28, 2007

Metagenomic Analysis of Silage
08:43

Metagenomic Analysis of Silage

Published on: January 13, 2017

Purifying the Impure: Sequencing Metagenomes and Metatranscriptomes from Complex Animal-associated Samples
11:23

Purifying the Impure: Sequencing Metagenomes and Metatranscriptomes from Complex Animal-associated Samples

Published on: December 22, 2014

Area of Science:

  • Bioinformatics
  • Genomics
  • Computational Biology

Background:

  • Metagenomic analysis traditionally relies on comparing sequences to existing databases.
  • This limits the discovery of unknown or unculturable microbial species, estimated at 99% for Bacteria.

Purpose of the Study:

  • Introduce Compareads, a novel de novo comparative metagenomic approach.
  • Enable the comparison of metagenomic datasets without relying on a priori knowledge.

Main Methods:

  • Compareads identifies similar DNA reads between two metagenomic datasets.
  • Utilizes a probabilistic data structure based on Bloom filters for efficient indexing of millions of reads.
  • Designed to handle extremely large datasets with limited memory and controlled error rates.

Main Results:

  • Compareads successfully retrieves biological information from massive datasets.
  • Scalable to over 100 million Illumina reads within hours using approximately 4 GB of memory.
  • Demonstrates practical usability on standard personal computers.

Conclusions:

  • Compareads offers a practical solution for de novo comparison of large metagenomic samples.
  • The developed data structure enables efficient analysis of previously inaccessible biological information.
  • The software is freely available under the CeCILL license.