Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Mining protein sequences for motifs.

Giri Narasimhan1, Changsong Bu, Yuan Gao

  • 1Department of Mathematical Sciences, The University of Memphis, Memphis, TN 38152, USA. giri@cs.fiu.edu

Journal of Computational Biology : a Journal of Computational Molecular Cell Biology
|December 19, 2002
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

EMoMiS: A pipeline for epitope-based molecular mimicry search in protein structures with potential applications to SARS-CoV-2.

Computational and structural biotechnology journal·2026
Same author

MEditome: Computational Detection of RNA Edit Sites Using <i>de Novo</i> Assembly in Microbiomes.

Journal of computational biology : a journal of computational molecular cell biology·2026
Same author

A comprehensive survey of scoring functions for protein docking models.

BMC bioinformatics·2025
Same author

Unfolding and de-confounding: biologically meaningful causal inference from longitudinal multi-omic networks using METALICA.

mSystems·2024
Same author

A molecular index for biological age identified from the metabolome and senescence-associated secretome in humans.

Aging cell·2024
Same author

Differences in gut microbiota between Dutch and South-Asian Surinamese: potential implications for type 2 diabetes mellitus.

Scientific reports·2024
Same journal

GMSA: A Graph Matching and Point Cloud Registration-Based Method for Spatial Transcriptomics Data Alignment.

Journal of computational biology : a journal of computational molecular cell biology·2026
Same journal

Investigations on Multiple Protein Scaffold Filling.

Journal of computational biology : a journal of computational molecular cell biology·2026
Same journal

Cell Type Prediction for Single-Cell RNA Sequencing Utilizing Unsupervised Domain Adaptation and Semi-Supervised Learning.

Journal of computational biology : a journal of computational molecular cell biology·2026
Same journal

PPIGAN: Prediction of Protein-Protein Interactions Using Generative Adversarial Networks.

Journal of computational biology : a journal of computational molecular cell biology·2026
Same journal

Deep Structure-Enhanced Cell Clustering Model for Single-Cell RNA Sequencing Data.

Journal of computational biology : a journal of computational molecular cell biology·2026
Same journal

Asymmetric Drug-Drug Interaction Prediction Based on Generative Adversarial Networks and Knowledge Graph.

Journal of computational biology : a journal of computational molecular cell biology·2026
See all related articles

We developed GYM, a novel algorithm using data mining to detect protein motifs. It builds a pattern dictionary from training sequences to identify motifs like Helix-Turn-Helix and Homeodomain in new proteins.

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Data Mining

Background:

  • Protein motifs are crucial for understanding protein function and evolution.
  • Existing methods for motif detection can be limited in scope and accuracy.

Purpose of the Study:

  • To design and implement a novel algorithm for detecting protein motifs using data mining techniques.
  • To create a program, GYM, capable of identifying specific protein motifs and providing sequence information.

Main Methods:

  • Developed an algorithm based on Data Mining and Knowledge Discovery principles.
  • Compiled a "pattern dictionary" of residue combinations from aligned training protein sequences.
  • Utilized statistical methods to determine algorithm parameters and ensure detection significance.

Related Experiment Videos

Main Results:

  • Implemented the GYM program for motif detection.
  • Successfully tested GYM on Helix-Turn-Helix and Homeodomain motifs.
  • GYM's detection results favorably compare with existing motif detection programs.
  • GYM provides additional useful information about protein sequences.

Conclusions:

  • The GYM algorithm effectively detects protein motifs by learning patterns from training data.
  • GYM offers a robust and informative tool for protein sequence analysis.
  • The data mining approach provides a statistically sound basis for motif discovery.