Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

The context-tree kernel for strings.

Marco Cuturi1, Jean-Philippe Vert

  • 1Computational Biology Group, Ecole des Mines de Paris, 35 rue Saint Honoré, 77300 Fontainebleau, France. marco.cuturi@ensmp.fr

Neural Networks : the Official Journal of the International Neural Network Society
|October 4, 2005
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Causal Inference Methods for Combining Randomized Trials and Observational Studies: A Review.

Statistical science : a review journal of the Institute of Mathematical Statistics·2025
Same author

Multicondition and multimodal temporal profile inference during mouse embryonic development.

Genome research·2025
Same author

Sceptic: pseudotime analysis for time-series single-cell sequencing and imaging data.

Genome biology·2025
Same author

Mapping cells through time and space with moscot.

Nature·2025
Same author

Multi-condition and multi-modal temporal profile inference during mouse embryonic development.

bioRxiv : the preprint server for biology·2024
Same author

LSMMD-MA: scaling multimodal data integration for single-cell genomics data analysis.

Bioinformatics (Oxford, England)·2023
Same journal

Dynamic analysis and reliable mechanical optimization application of ring HNN effected with a memristive neuron.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

DAFF-Net: A detection and search method for small-scale low surface brightness galaxies.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Quasi-synchronization for complex networks with hybrid pinning intermittent control.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Physics-encoded convolutional neural operators for parametric PDEs: A convergence-guaranteed framework via pre-computed kernel fields.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Exploiting audio-visual modalities in videos: Object detection via multi-stage bilateral coupling network.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Reliability-aware modality completion with cross-modal distillation for federated learning with missing modalities.

Neural networks : the official journal of the International Neural Network Society·2026
See all related articles

We developed a novel string kernel using information theory and data compression. This method efficiently classifies protein sequences, outperforming existing techniques without requiring biological expertise.

Area of Science:

  • Computational Biology
  • Bioinformatics
  • Machine Learning

Background:

  • String kernels are crucial for machine learning tasks like sequence classification.
  • Existing methods may require extensive biological prior knowledge or computational resources.

Purpose of the Study:

  • To introduce a new string kernel leveraging information theory and data compression.
  • To enable efficient and accurate string classification, particularly in proteomics.
  • To demonstrate the kernel's effectiveness without relying on biological domain expertise.

Main Methods:

  • Developed a novel kernel for strings inspired by information theory and data compression.
  • Utilized a Bayesian averaging framework with conjugate priors on probabilistic suffix trees (context-trees).

Related Experiment Videos

  • Adapted the context-tree weighting algorithm for efficient kernel computation in linear time and space.
  • Main Results:

    • The proposed kernel was computed in linear time and space.
    • Demonstrated strong performance on a standard protein homology detection experiment.
    • Achieved competitive results compared to state-of-the-art methods.

    Conclusions:

    • The context-tree kernel offers an efficient and effective approach for string classification.
    • This method shows promise for applications in proteomics and other sequence analysis domains.
    • The kernel's ability to perform well without biological priors is a significant advantage.