Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving01:29

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

45
Mechanistic models play a crucial role in algorithms for numerical problem-solving, particularly in nonlinear mixed effects modeling (NMEM). These models aim to minimize specific objective functions by evaluating various parameter estimates, leading to the development of systematic algorithms. In some cases, linearization techniques approximate the model using linear equations.
In individual population analyses, different algorithms are employed, such as Cauchy's method, which uses a...
45
Frequency-dependent Selection01:21

Frequency-dependent Selection

21.9K
When the fitness of a trait is influenced by how common it is (i.e., its frequency) relative to different traits within a population, this is referred to as frequency-dependent selection. Frequency-dependent selection may occur between species or within a single species. This type of selection can either be positive—with more common phenotypes having higher fitness—or negative, with rarer phenotypes conferring increased fitness.
21.9K
Upsampling01:22

Upsampling

209
Managing signal sampling rates is essential in digital signal processing to maintain signal integrity. A decimated signal, characterized by a reduced frequency range due to its lower sampling rate, can be upsampled by inserting zeros between each sample. This upsampling process expands the original spectrum and introduces repeated spectral replicas at intervals dictated by the new Nyquist frequency. To refine this zero-inserted sequence, it is passed through a lowpass filter with a cutoff...
209

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Integrated flexible DNA methylation-chromatin segmentation modeling enhances epigenomic state annotation.

Nucleic acids research·2026
Same author

KeBaB: <i>k</i>-mer based breaking for finding long MEMs.

International Symposium on String Processing and Information Retrieval : SPIRE ... : proceedings. SPIRE (Symposium)·2025
Same author

Cleanifier: contamination removal from microbial sequences using spaced seeds of a human pangenome index.

Bioinformatics (Oxford, England)·2025
Same author

b-move: faster lossless approximate pattern matching in a run-length compressed index.

Algorithms for molecular biology : AMB·2025
Same author

A comprehensive review and evaluation of species richness estimation.

Briefings in bioinformatics·2025
Same author

Run-length compressed metagenomic read classification with SMEM-finding and tagging.

bioRxiv : the preprint server for biology·2025
Same journal

GMSA: A Graph Matching and Point Cloud Registration-Based Method for Spatial Transcriptomics Data Alignment.

Journal of computational biology : a journal of computational molecular cell biology·2026
Same journal

Investigations on Multiple Protein Scaffold Filling.

Journal of computational biology : a journal of computational molecular cell biology·2026
Same journal

Cell Type Prediction for Single-Cell RNA Sequencing Utilizing Unsupervised Domain Adaptation and Semi-Supervised Learning.

Journal of computational biology : a journal of computational molecular cell biology·2026
Same journal

PPIGAN: Prediction of Protein-Protein Interactions Using Generative Adversarial Networks.

Journal of computational biology : a journal of computational molecular cell biology·2026
Same journal

Deep Structure-Enhanced Cell Clustering Model for Single-Cell RNA Sequencing Data.

Journal of computational biology : a journal of computational molecular cell biology·2026
Same journal

Asymmetric Drug-Drug Interaction Prediction Based on Generative Adversarial Networks and Knowledge Graph.

Journal of computational biology : a journal of computational molecular cell biology·2026
See all related articles

Related Experiment Video

Updated: Jun 11, 2025

Protein WISDOM: A Workbench for In silico De novo Design of BioMolecules
10:58

Protein WISDOM: A Workbench for In silico De novo Design of BioMolecules

Published on: July 25, 2013

17.0K

Lossless Approximate Pattern Matching: Automated Design of Efficient Search Schemes.

Luca Renders1, Lore Depuydt1, Sven Rahmann2

  • 1Internet Technology and Data Science Lab, Ghent University, Ghent, Belgium.

Journal of Computational Biology : a Journal of Computational Molecular Cell Biology
|September 30, 2024
PubMed
Summary
This summary is machine-generated.

This study automates search scheme creation for approximate pattern matching, significantly improving efficiency for higher error rates (k=7). The new tool, Columba, offers faster and more comprehensive read mapping than existing methods.

Keywords:
approximate pattern matchinginteger linear programsearch schemessequence alignment

More Related Videos

Pattern-based Search of Epigenomic Data Using GeNemo
06:38

Pattern-based Search of Epigenomic Data Using GeNemo

Published on: October 8, 2017

5.1K
Creating and Applying a Reference to Facilitate the Discussion and Classification of Proteins in a Diverse Group
07:49

Creating and Applying a Reference to Facilitate the Discussion and Classification of Proteins in a Diverse Group

Published on: August 16, 2017

7.0K

Related Experiment Videos

Last Updated: Jun 11, 2025

Protein WISDOM: A Workbench for In silico De novo Design of BioMolecules
10:58

Protein WISDOM: A Workbench for In silico De novo Design of BioMolecules

Published on: July 25, 2013

17.0K
Pattern-based Search of Epigenomic Data Using GeNemo
06:38

Pattern-based Search of Epigenomic Data Using GeNemo

Published on: October 8, 2017

5.1K
Creating and Applying a Reference to Facilitate the Discussion and Classification of Proteins in a Diverse Group
07:49

Creating and Applying a Reference to Facilitate the Discussion and Classification of Proteins in a Diverse Group

Published on: August 16, 2017

7.0K

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Algorithm Design

Background:

  • Approximate pattern matching is crucial for sequence analysis, but designing efficient search schemes for higher error tolerances (k > 4) is computationally intensive.
  • Existing methods struggle with scalability and efficiency when handling increased error rates in pattern matching.

Purpose of the Study:

  • To develop an automated and efficient method for generating search schemes for lossless approximate pattern matching up to k=7 errors.
  • To introduce a novel software tool, Columba, that implements these advanced search schemes for high-performance read mapping.

Main Methods:

  • Integration of a greedy algorithm and a novel Integer Linear Programming (ILP) formulation for automated search scheme design.
  • Development of Hato, an open-source tool for generating search schemes, and Columba 1.2, an open-source lossless read-mapper.
  • Dynamic scheme selection technique to further optimize efficiency based on specific search patterns.

Main Results:

  • Achieved efficient search schemes for up to k=7 errors, outperforming existing strategies in theoretical and practical analyses.
  • Columba 1.2 demonstrates superior performance, mapping 100,000 Illumina reads (150 bp) with k=6 in 75 seconds and k=7 in 2.25 hours.
  • Runtime reductions of up to 53% for higher k values and a four-fold higher mapping rate compared to a lossy tool.

Conclusions:

  • The proposed ILP-based approach and dynamic scheme selection significantly enhance the efficiency of approximate pattern matching.
  • Columba 1.2 represents a state-of-the-art lossless read-mapper, offering unprecedented speed and accuracy for high-throughput sequencing data analysis.