Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Are grammatical representations useful for learning from biological sequence data?--a case study.

S H Muggleton1, C H Bryant, A Srinivasan

  • 1Department of Computer Science, University of York, York YO10 5DD, United Kingdom.

Journal of Computational Biology : a Journal of Computational Molecular Cell Biology
|November 6, 2001
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Integrating multimodal prehabilitation into enhanced recovery after surgery programs (MPhERAS) for elderly patients: a systematic review and meta-analysis of randomized controlled trials and cohort studies.

Techniques in coloproctology·2025
Same author

<i>Meso</i>-Modified Porphyrinoids: Stimuli-Driven Conformational Plasticity, Modulation of π-Conjugation, and Bis-Rh(I) Complexation.

Inorganic chemistry·2025
Same author

Nonalternant Acenaphthylene-Fused Carbaporphyrins: Topological Modulation of π-Delocalization, (Anti)aromaticity, and B<sup>III</sup> Coordination.

Organic letters·2025
Same author

Siamese Twin Homocarbaporphyrin Dimer: Nonplanar Architecture, Localized Conjugation and Bis-B<sup>III</sup> Complex.

Chemistry, an Asian journal·2025
Same author

<i>E</i>-Stilbene-Imprinted Cyclic Diphyrin and Its B<sup>III</sup> Complex.

Organic letters·2025
Same author

Ectopic expression of the cation-chloride cotransporter KCC2 in blood exosomes as a biomarker for functional rehabilitation.

Frontiers in molecular neuroscience·2025
Same journal

GMSA: A Graph Matching and Point Cloud Registration-Based Method for Spatial Transcriptomics Data Alignment.

Journal of computational biology : a journal of computational molecular cell biology·2026
Same journal

Investigations on Multiple Protein Scaffold Filling.

Journal of computational biology : a journal of computational molecular cell biology·2026
Same journal

Cell Type Prediction for Single-Cell RNA Sequencing Utilizing Unsupervised Domain Adaptation and Semi-Supervised Learning.

Journal of computational biology : a journal of computational molecular cell biology·2026
Same journal

PPIGAN: Prediction of Protein-Protein Interactions Using Generative Adversarial Networks.

Journal of computational biology : a journal of computational molecular cell biology·2026
Same journal

Deep Structure-Enhanced Cell Clustering Model for Single-Cell RNA Sequencing Data.

Journal of computational biology : a journal of computational molecular cell biology·2026
Same journal

Asymmetric Drug-Drug Interaction Prediction Based on Generative Adversarial Networks and Knowledge Graph.

Journal of computational biology : a journal of computational molecular cell biology·2026
See all related articles

This study demonstrates that Chomsky-like grammar representations, learned using Inductive Logic Programming (ILP), significantly improve the efficiency of identifying human neuropeptide precursors (NPPs). This bioinformatics approach offers over 100x greater efficiency than random methods.

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Machine Learning

Background:

  • Existing bioinformatics methods lack cost-efficiency for identifying novel human neuropeptide precursors (NPPs).
  • Hand-coding a grammar for NPP recognition by specialists proved unsuccessful.
  • A need exists for improved, cost-effective methods in biological sequence family analysis.

Purpose of the Study:

  • To investigate the utility of Chomsky-like grammar representations for learning cost-effective biological sequence predictors.
  • To develop a novel method for recognizing human neuropeptide precursors (NPPs) using Inductive Logic Programming (ILP).
  • To enhance the efficiency of discovering new NPPs.

Main Methods:

  • Utilized the Inductive Logic Programming (ILP) Bayesian approach for learning from positive examples to generate a biological grammar.

Related Experiment Videos

  • Derived features from the ILP-generated grammar and other learning strategies.
  • Developed recognition models using C4.5 and C4.5rules, evaluating performance with predictive accuracy and a novel cost function, Relative Advantage (RA).
  • Main Results:

    • The best predictor, incorporating grammar-derived features, achieved over 100 times greater efficiency in searching for novel NPPs compared to random selection.
    • Models including grammar-derived features demonstrated significantly higher Relative Advantage (RA) than those without.
    • Predictive accuracy was found to be an inadequate performance measure for this specific domain due to class imbalance.

    Conclusions:

    • Chomsky-like grammar representations are effective for learning cost-effective predictors in biological sequence families.
    • The ILP Bayesian approach represents a novel and successful application for biological grammar induction.
    • Grammar-derived features significantly enhance the performance and cost-effectiveness of NPP recognition models.