Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Long-patch Base Excision Repair01:02

Long-patch Base Excision Repair

8.1K
Since the discovery of the two BER pathways, there has been a debate about how a cell chooses one pathway over the other and the factors determining this selection. Numerous in vitro experiments have pointed out multiple determinants for the sub-pathway selection. These are:
8.1K
Understanding Memory01:19

Understanding Memory

1.6K
Memory is the retention of information or experiences over time, facilitated through three main processes: encoding, storage, and retrieval. Encoding is the process of inputting information into the memory system. For instance, when listening to a lecture, watching a play, reading a book, or having a conversation, the brain is actively encoding information. This initial stage involves transforming sensory input into a form that can be processed and stored by the brain. Various factors, such as...
1.6K
System of Memory01:23

System of Memory

7.5K
Memory is categorized into three major systems: sensory memory, short-term memory (STM), and long-term memory (LTM). These systems differ in their capacity and the duration for which they can hold information. Sensory memory captures raw sensory input from the environment, holding it for just a few seconds or less. For example, on hearing a brief, loud sound, like a car horn honking, the sound seems to linger in the mind for a moment even after it stops. This is an instance of sensory memory...
7.5K
Buffers: Buffer Capacity01:09

Buffers: Buffer Capacity

2.5K
Buffer capacity is the quantitative measure of a buffer to resist the change in pH. As shown in the following equation, the buffer capacity, denoted by 'beta', is expressed as the number of moles of acid or base needed to change the pH of a one-liter buffer solution by 1 unit. Here, Ca and Cb indicate the number of moles of acid and base, respectively. Note that dpH represents the change in pH.
In the graph, pH is plotted as a function of the number of moles of base (Cb) added to a weak...
2.5K
Mnemonic Devices01:23

Mnemonic Devices

469
Mnemonic devices are cognitive tools that facilitate memory retention by linking new information to familiar patterns or organizational strategies. These techniques are beneficial for remembering complex or lengthy sets of information by simplifying and structuring them in easily retrievable ways.
Acronyms
Acronyms are created by using the initial letters of a series of words to form a new word or phrase. This approach condenses complex information into a single, memorable entity. For example,...
469
Assembly of Cytoskeletal Filaments01:18

Assembly of Cytoskeletal Filaments

28.0K
Cytoskeletal filaments are polymeric forms of smaller protein subunits. However, individual cytoskeletal filaments may easily disassemble or associate with other similar filaments to form rigid structures. Microfilaments, made of actin monomers, rely on actin-binding proteins to form bundles and create networks of individual actin filaments. Microtubules rely on microtubule-associated proteins (MAPs) to form sturdy cylindrical structures. However, the proteins involved in forming complex...
28.0K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Distinct effects of aging and klotho deletion on the choroid plexus.

GeroScience·2026
Same author

Transcriptional interference revisited.

Nature genetics·2026
Same author

Erratum: Placental epigenetic clocks derived from crowdsourcing: Implications for the study of accelerated aging in obstetrics.

iScience·2026
Same author

The Biomarkers Consortium: 20 years of advancing precision medicine.

Nature reviews. Drug discovery·2026
Same author

Labeled dataset of X-ray protein ligand images in 3D point cloud and validated deep learning models.

Scientific data·2025
Same author

Pipeline Olympics: continuable benchmarking of computational workflows for DNA methylation sequencing data against an experimental gold standard.

Nucleic acids research·2025
Same journal

Haplotype-aware long-read error correction.

Algorithms for molecular biology : AMB·2026
Same journal

Extension of partial atom-to-atom maps: uniqueness and algorithms.

Algorithms for molecular biology : AMB·2026
Same journal

Lossless pangenome indexing using tag arrays.

Algorithms for molecular biology : AMB·2026
Same journal

Dolphyin: a combinatorial algorithm for identifying 1-Dollo phylogenies in cancer.

Algorithms for molecular biology : AMB·2026
Same journal

Probing transcription factor subsets in gene regulatory networks.

Algorithms for molecular biology : AMB·2026
Same journal

Comparing the ability of embedding methods on metabolic hypergraphs for capturing taxonomy-based features.

Algorithms for molecular biology : AMB·2026
See all related articles

Related Experiment Video

Updated: Feb 17, 2026

Assembly of Nucleosomal Arrays from Recombinant Core Histones and Nucleosome Positioning DNA
10:40

Assembly of Nucleosomal Arrays from Recombinant Core Histones and Nucleosome Positioning DNA

Published on: September 10, 2013

23.1K

Generalized enhanced suffix array construction in external memory.

Felipe A Louza1, Guilherme P Telles2, Steve Hoffmann3

  • 1Department of Computing and Mathematics, University of São Paulo, Av. Bandeirantes, 3900, Ribeirão Preto, 14040-901 Brazil.

Algorithms for Molecular Biology : AMB
|December 14, 2017
PubMed
Summary
This summary is machine-generated.

This study introduces a novel external memory algorithm for constructing generalized suffix arrays and longest common prefix arrays. The algorithm demonstrates competitive performance on large datasets, enabling efficient string processing for collections exceeding internal memory capacity.

Keywords:
Burrows–Wheeler transformExternal memory algorithmsLCP arrayString collectionsSuffix array

More Related Videos

Formation of Biomembrane Microarrays with a Squeegee-based Assembly Method
07:56

Formation of Biomembrane Microarrays with a Squeegee-based Assembly Method

Published on: May 8, 2014

14.2K
Micro-drive Array for Chronic in vivo Recording: Tetrode Assembly
14:19

Micro-drive Array for Chronic in vivo Recording: Tetrode Assembly

Published on: April 22, 2009

34.2K

Related Experiment Videos

Last Updated: Feb 17, 2026

Assembly of Nucleosomal Arrays from Recombinant Core Histones and Nucleosome Positioning DNA
10:40

Assembly of Nucleosomal Arrays from Recombinant Core Histones and Nucleosome Positioning DNA

Published on: September 10, 2013

23.1K
Formation of Biomembrane Microarrays with a Squeegee-based Assembly Method
07:56

Formation of Biomembrane Microarrays with a Squeegee-based Assembly Method

Published on: May 8, 2014

14.2K
Micro-drive Array for Chronic in vivo Recording: Tetrode Assembly
14:19

Micro-drive Array for Chronic in vivo Recording: Tetrode Assembly

Published on: April 22, 2009

34.2K

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Data Structures

Background:

  • Suffix arrays and augmented data structures are crucial for efficient string processing.
  • External memory construction of generalized suffix arrays is vital for large datasets exceeding internal memory.

Purpose of the Study:

  • To present and analyze a new external memory algorithm for constructing generalized suffix arrays augmented with the longest common prefix array.
  • To evaluate the algorithm's performance on large-scale string collections.

Main Methods:

  • The algorithm combines buffers, induced sorting, and a heap to avoid direct string comparisons.
  • External memory approach designed for datasets larger than available RAM.

Main Results:

  • The algorithm, [Formula: see text], shows competitive performance against existing methods on real datasets up to 24 GB.
  • Evaluated running time, efficiency, memory access, and optimization strategies.
  • Demonstrated the impact of operating system disk caching on performance.

Conclusions:

  • The algorithm is validated on diverse real-world datasets, proving its competitive performance.
  • The algorithm efficiently constructs generalized suffix arrays and longest common prefix arrays.
  • It can also construct the generalized Burrows-Wheeler transform with minimal overhead.