Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

Mechanistic models play a crucial role in algorithms for numerical problem-solving, particularly in nonlinear mixed effects modeling (NMEM). These models aim to minimize specific objective functions by evaluating various parameter estimates, leading to the development of systematic algorithms. In some cases, linearization techniques approximate the model using linear equations.
In individual population analyses, different algorithms are employed, such as Cauchy's method, which uses a...

Frequency-dependent Selection

Frequency-dependent Selection

When the fitness of a trait is influenced by how common it is (i.e., its frequency) relative to different traits within a population, this is referred to as frequency-dependent selection. Frequency-dependent selection may occur between species or within a single species. This type of selection can either be positive—with more common phenotypes having higher fitness—or negative, with rarer phenotypes conferring increased fitness.

Upsampling

Upsampling

Managing signal sampling rates is essential in digital signal processing to maintain signal integrity. A decimated signal, characterized by a reduced frequency range due to its lower sampling rate, can be upsampled by inserting zeros between each sample. This upsampling process expands the original spectrum and introduces repeated spectral replicas at intervals dictated by the new Nyquist frequency. To refine this zero-inserted sequence, it is passed through a lowpass filter with a cutoff...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Integrated flexible DNA methylation-chromatin segmentation modeling enhances epigenomic state annotation.

Nucleic acids research·2026

Same author

KeBaB: <i>k</i>-mer based breaking for finding long MEMs.

International Symposium on String Processing and Information Retrieval : SPIRE ... : proceedings. SPIRE (Symposium)·2025

Same author

Cleanifier: contamination removal from microbial sequences using spaced seeds of a human pangenome index.

Bioinformatics (Oxford, England)·2025

Same author

b-move: faster lossless approximate pattern matching in a run-length compressed index.

Algorithms for molecular biology : AMB·2025

Same author

A comprehensive review and evaluation of species richness estimation.

Briefings in bioinformatics·2025

Same author

Run-length compressed metagenomic read classification with SMEM-finding and tagging.

bioRxiv : the preprint server for biology·2025

Same journal

GMSA: A Graph Matching and Point Cloud Registration-Based Method for Spatial Transcriptomics Data Alignment.

Journal of computational biology : a journal of computational molecular cell biology·2026

Same journal

Investigations on Multiple Protein Scaffold Filling.

Journal of computational biology : a journal of computational molecular cell biology·2026

Same journal

Cell Type Prediction for Single-Cell RNA Sequencing Utilizing Unsupervised Domain Adaptation and Semi-Supervised Learning.

Journal of computational biology : a journal of computational molecular cell biology·2026

Same journal

PPIGAN: Prediction of Protein-Protein Interactions Using Generative Adversarial Networks.

Journal of computational biology : a journal of computational molecular cell biology·2026

Same journal

Deep Structure-Enhanced Cell Clustering Model for Single-Cell RNA Sequencing Data.

Journal of computational biology : a journal of computational molecular cell biology·2026

Same journal

Asymmetric Drug-Drug Interaction Prediction Based on Generative Adversarial Networks and Knowledge Graph.

Journal of computational biology : a journal of computational molecular cell biology·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 11, 2025

Protein WISDOM: A Workbench for In silico De novo Design of BioMolecules

Protein WISDOM: A Workbench for In silico De novo Design of BioMolecules

Published on: July 25, 2013

Lossless Approximate Pattern Matching: Automated Design of Efficient Search Schemes.

Luca Renders¹, Lore Depuydt¹, Sven Rahmann²

¹Internet Technology and Data Science Lab, Ghent University, Ghent, Belgium.

Journal of Computational Biology : a Journal of Computational Molecular Cell Biology

|September 30, 2024

Summary

This summary is machine-generated.

This study automates search scheme creation for approximate pattern matching, significantly improving efficiency for higher error rates (k=7). The new tool, Columba, offers faster and more comprehensive read mapping than existing methods.

Keywords:

approximate pattern matching integer linear program search schemes sequence alignment

More Related Videos

Pattern-based Search of Epigenomic Data Using GeNemo

Pattern-based Search of Epigenomic Data Using GeNemo

Published on: October 8, 2017

Creating and Applying a Reference to Facilitate the Discussion and Classification of Proteins in a Diverse Group

Creating and Applying a Reference to Facilitate the Discussion and Classification of Proteins in a Diverse Group

Published on: August 16, 2017

Related Experiment Videos

Last Updated: Jun 11, 2025

Protein WISDOM: A Workbench for In silico De novo Design of BioMolecules

Protein WISDOM: A Workbench for In silico De novo Design of BioMolecules

Published on: July 25, 2013

Pattern-based Search of Epigenomic Data Using GeNemo

Pattern-based Search of Epigenomic Data Using GeNemo

Published on: October 8, 2017

Creating and Applying a Reference to Facilitate the Discussion and Classification of Proteins in a Diverse Group

Creating and Applying a Reference to Facilitate the Discussion and Classification of Proteins in a Diverse Group

Published on: August 16, 2017

Area of Science:

Bioinformatics
Computational Biology
Algorithm Design

Background:

Approximate pattern matching is crucial for sequence analysis, but designing efficient search schemes for higher error tolerances (k > 4) is computationally intensive.
Existing methods struggle with scalability and efficiency when handling increased error rates in pattern matching.

Purpose of the Study:

To develop an automated and efficient method for generating search schemes for lossless approximate pattern matching up to k=7 errors.
To introduce a novel software tool, Columba, that implements these advanced search schemes for high-performance read mapping.

Main Methods:

Integration of a greedy algorithm and a novel Integer Linear Programming (ILP) formulation for automated search scheme design.
Development of Hato, an open-source tool for generating search schemes, and Columba 1.2, an open-source lossless read-mapper.
Dynamic scheme selection technique to further optimize efficiency based on specific search patterns.

Main Results:

Achieved efficient search schemes for up to k=7 errors, outperforming existing strategies in theoretical and practical analyses.
Columba 1.2 demonstrates superior performance, mapping 100,000 Illumina reads (150 bp) with k=6 in 75 seconds and k=7 in 2.25 hours.
Runtime reductions of up to 53% for higher k values and a four-fold higher mapping rate compared to a lossy tool.

Conclusions:

The proposed ILP-based approach and dynamic scheme selection significantly enhance the efficiency of approximate pattern matching.
Columba 1.2 represents a state-of-the-art lossless read-mapper, offering unprecedented speed and accuracy for high-throughput sequencing data analysis.