Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Mining, compressing and classifying with extensible motifs.

Alberto Apostolico1, Matteo Comin, Laxmi Parida

  • 1Dipartimento di Ingegneria dell'lnformazione, Università di Padova, Padova, Italy. axa@dei.unipd.it

Algorithms for Molecular Biology : AMB
|May 26, 2006
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Publisher Correction: Tumor transcriptional state predicts survival in immune-checkpoint-blockade-treated glioblastoma.

Nature cancer·2026
Same author

Predicting VNN resistance in European sea bass using machine learning on high dimensional low sample size data.

Frontiers in bioinformatics·2026
Same author

Quantum ensembling methods for healthcare and life science.

Briefings in bioinformatics·2026
Same author

Tumor transcriptional state predicts survival in immune-checkpoint-blockade-treated glioblastoma.

Nature cancer·2026
Same author

Fast Hashing of Spaced Seeds with DuoHash.

Journal of computational biology : a journal of computational molecular cell biology·2026
Same author

Remics: a redescription-based framework for multi-omics analysis.

Frontiers in cell and developmental biology·2026
Same journal

Haplotype-aware long-read error correction.

Algorithms for molecular biology : AMB·2026
Same journal

Extension of partial atom-to-atom maps: uniqueness and algorithms.

Algorithms for molecular biology : AMB·2026
Same journal

Lossless pangenome indexing using tag arrays.

Algorithms for molecular biology : AMB·2026
Same journal

Dolphyin: a combinatorial algorithm for identifying 1-Dollo phylogenies in cancer.

Algorithms for molecular biology : AMB·2026
Same journal

Probing transcription factor subsets in gene regulatory networks.

Algorithms for molecular biology : AMB·2026
Same journal

Comparing the ability of embedding methods on metabolic hypergraphs for capturing taxonomy-based features.

Algorithms for molecular biology : AMB·2026
See all related articles

Extensible motifs, a novel pattern class, enhance data compression and biological sequence classification. These flexible patterns improve codebook size and support unsupervised learning for structural inference.

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Data Compression

Background:

  • Motif patterns of maximal saturation are crucial for biomolecular sequence discovery and data compression.
  • Traditional motif discovery can be computationally intensive.
  • Specialized
  • rigid
  • motifs offer efficient discovery in low polynomial time.

Purpose of the Study:

  • To introduce and explore the properties of "extensible" motifs.
  • To demonstrate the utility of extensible motifs in data compression and biological sequence analysis.

Main Methods:

  • Consideration of "extensible" motifs with elastic gap sequences.
  • Application of extensible motifs in data compression via textual substitution.

Related Experiment Videos

  • Utilizing extensible motifs for structural inference and classification.
  • Main Results:

    • Extensible motifs improve compression by reducing codebook size.
    • These motifs facilitate unsupervised classification and phylogeny reconstruction.
    • The elasticity of extensible motifs allows fitting variable-length sequence segments.

    Conclusions:

    • Off-line compression using extensible motifs offers advantages for biological sequence compression.
    • Extensible motifs provide a powerful tool for classifying biological sequences.
    • This approach supports structural inference and phylogenetic analysis.