Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Maximum Size of Aggregate

Maximum Size of Aggregate

The maximum size of aggregate is defined as the aperture of the sieve retaining 15 percent or more of the particles present in the aggregate sample. The aggregate's maximum size impacts the concrete's water requirement, workability, and strength. Larger aggregates reduce the surface area needing cement paste coverage, which can lower water needs, thereby allowing a decrease in the water-to-cement ratio when the desired workability and richness of the mix are to be maintained, which can result...

Understanding Memory

Understanding Memory

Memory is the retention of information or experiences over time, facilitated through three main processes: encoding, storage, and retrieval. Encoding is the process of inputting information into the memory system. For instance, when listening to a lecture, watching a play, reading a book, or having a conversation, the brain is actively encoding information. This initial stage involves transforming sensory input into a form that can be processed and stored by the brain. Various factors, such as...

Data Collection by Observations

Data Collection by Observations

Data collection refers to a systematic way of obtaining, observing, measuring, and analyzing accurate information. Observational studies are one of the most widely used methods of data collection. It involves collecting data by observing the behavior and physical characteristics of a sample without making any modifications to the sample.
An astronomer viewing the motion and brightness of stars in the sky and recording the data is an example of observational data collection. A botanist recording...

Storage

Storage

A schema is a mental framework that helps individuals organize and interpret information. Schemata, formed from previous experiences, influence how we process new information: how we encode it, the inferences we make, and how we retrieve it. For instance, a schema for what a typical classroom looks like might include desks, a teacher's desk, a whiteboard, and students in such an environment. This expectation helps us quickly understand and navigate new classrooms without needing to analyze each...

Data Collection III

Data Collection III

The physical assessment examines the patient for objective data that defines the patient's condition, and aids in formulating the nursing care plan. The purpose of physical assessment is a health status appraisal, which includes identifying health problems, and establishing a database for nursing intervention.
The principles to begin the physical assessment include conducting a comprehensive or problem-related history in a quiet, well-lit room, emphasizing privacy and comfort for the patient.

System of Memory

System of Memory

Memory is categorized into three major systems: sensory memory, short-term memory (STM), and long-term memory (LTM). These systems differ in their capacity and the duration for which they can hold information. Sensory memory captures raw sensory input from the environment, holding it for just a few seconds or less. For example, on hearing a brief, loud sound, like a car horn honking, the sound seems to linger in the mind for a moment even after it stops. This is an instance of sensory memory...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Tissue- and tumor-type-specific expression of internal-promoter-driven YEATS-domain-devoid isoforms of MLLT1 and MLLT3.

Biomarker research·2026

Same author

Acute myeloid leukemia risk stratification in younger and older patients through transcriptomic machine learning models.

Scientific reports·2025

Same author

The ganglioside GD3 and its synthase (ST8SIA1) as novel senescence markers associated with osteoarthritis.

GeroScience·2025

Same author

Algorithms to reconstruct past indels: The deletion-only parsimony problem.

PLoS computational biology·2025

Same author

CREMSA: compressed indexing of (ultra) large multiple sequence alignments.

Bioinformatics (Oxford, England)·2025

Same author

Targeting transcription-replication conflicts using G-quadruplexes stabilizers in multiple myeloma.

Blood neoplasia·2025

Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026

Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026

Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026

Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026

Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026

Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: May 31, 2026

Databases to Efficiently Manage Medium Sized, Low Velocity, Multidimensional Data in Tissue Engineering

Databases to Efficiently Manage Medium Sized, Low Velocity, Multidimensional Data in Tissue Engineering

Published on: November 22, 2019

Querying large read collections in main memory: a versatile data structure.

Nicolas Philippe¹, Mikaël Salson, Thierry Lecroq

¹LIRMM, UMR 5506, CNRS and Université de Montpellier 2, CC 477, 161 rue Ada, 34095 Montpellier, France.

BMC Bioinformatics

|June 21, 2011

Summary

This summary is machine-generated.

We developed Gk arrays, a novel data structure for efficiently indexing and querying large collections of sequencing reads. This solution significantly reduces memory usage and speeds up analysis for various genomics applications.

More Related Videos

A User-friendly and Powerful R Analysis of Large-scale Datasets

A User-friendly and Powerful R Analysis of Large-scale Datasets

Published on: November 4, 2025

Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications

Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications

Published on: February 23, 2019

Related Experiment Videos

Last Updated: May 31, 2026

Databases to Efficiently Manage Medium Sized, Low Velocity, Multidimensional Data in Tissue Engineering

Databases to Efficiently Manage Medium Sized, Low Velocity, Multidimensional Data in Tissue Engineering

Published on: November 22, 2019

A User-friendly and Powerful R Analysis of Large-scale Datasets

A User-friendly and Powerful R Analysis of Large-scale Datasets

Published on: November 4, 2025

Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications

Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications

Published on: February 23, 2019

Area of Science:

Bioinformatics
Computational Biology
Genomics

Background:

High Throughput Sequencing (HTS) generates massive datasets requiring efficient bioinformatic analyses.
Current focus is on genome indexing for read mapping, leaving read indexing largely unexplored.
Efficient querying of large read collections is crucial due to increasing sequencing throughput.

Purpose of the Study:

To introduce Gk arrays, a new data structure for indexing large read collections.
To present an algorithm for building and querying the Gk arrays structure.
To demonstrate the efficiency and memory advantages of Gk arrays compared to existing methods.

Main Methods:

Developed the Gk arrays data structure for read indexing.
Created an algorithm for constructing the Gk arrays.
Implemented procedures for querying the structure to retrieve reads containing specific k-mers.
Compared Gk arrays performance against adapted uncompressed indexing structures.

Main Results:

Gk arrays enable fast querying of large read collections.
The structure requires significantly less memory compared to other solutions.
Gk arrays can handle larger read collections efficiently.
Demonstrated applications in SNP detection, assembly, and RNA-Seq analysis.

Conclusions:

Gk arrays offer a versatile data structure for fast and accurate read analysis.
The structure facilitates efficient mining of genomics, epigenomics, metagenomics, and transcriptomics data.
The Gk arrays library is publicly available under a GPL-compliant license.