Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

A graph-based clustering method for a large set of sequences using a graph partitioning algorithm.

H Kawaji1, Y Yamaguchi, H Matsuda

  • 1Department of Informatics and Mathematical Science, Graduate School of Engineering Science, Osaka University, 1-3 Machikaneyama, Toyonaka, Osaka 560-8531, Japan. kawaji@ics.es.osaka-u.ac.jp

Genome Informatics. International Conference on Genome Informatics
|January 16, 2002
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

[Study on the "signal" constituents for the evaluation of animal crude drugs. VI. The identification method of cervi parvum cornu in medicines using DNA analysis].

Yakugaku zasshi : Journal of the Pharmaceutical Society of Japan·1999
Same author

PCR detection of DNA specific for Trichosporon species in serum of patients with disseminated trichosporonosis.

Journal of clinical microbiology·1999
Same author

16O excesses in olivine inclusions in Yamato-86009 and Murchison chondrites and their relation to CAIs.

Science (New York, N.Y.)·1999
Same author

4-[3,5-Bis(trimethylsilyl)benzamido] benzoic acid (TAC-101) inhibits the intrahepatic spread of hepatocellular carcinoma and prolongs the life-span of tumor-bearing animals.

Clinical & experimental metastasis·1999
Same author

Endoscopic ultrasonography of the pancreas in the dog.

Veterinary radiology & ultrasound : the official journal of the American College of Veterinary Radiology and the International Veterinary Radiology Association·1998
Same author

Endoscopic ultrasonographic findings of the pancreas after pancreatic duct ligation in the dog.

Veterinary radiology & ultrasound : the official journal of the American College of Veterinary Radiology and the International Veterinary Radiology Association·1998
Same journal

Linear regression models predicting strength of transcriptional activity of promoters.

Genome informatics. International Conference on Genome Informatics·2012
Same journal

Sign: large-scale gene network estimation environment for high performance computing.

Genome informatics. International Conference on Genome Informatics·2012
Same journal

Docking-calculation-based method for predicting protein-RNA interactions.

Genome informatics. International Conference on Genome Informatics·2012
Same journal

Mechanism of cell cycle disruption by multiple p53 pulses.

Genome informatics. International Conference on Genome Informatics·2012
Same journal

Database for crude drugs and Kampo medicine.

Genome informatics. International Conference on Genome Informatics·2012
Same journal

A dynamic programming algorithm to predict synthesis processes of tree-structured compounds with graph grammar.

Genome informatics. International Conference on Genome Informatics·2011
See all related articles

This study introduces a novel graph-based method for protein sequence clustering, outperforming traditional single linkage clustering. The new approach enhances protein family classification accuracy, improving upon existing methods.

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • Protein sequence clustering is crucial for understanding protein function and evolution.
  • Conventional methods like single linkage clustering have limitations in accuracy and efficiency.
  • Existing protein family databases require robust computational methods for accurate classification.

Purpose of the Study:

  • To develop an improved graph-based clustering method for protein sequences.
  • To enhance the accuracy of protein family classification compared to single linkage clustering.
  • To validate the method's effectiveness using mouse proteomes and InterPro families.

Main Methods:

  • Formulating protein sequence clustering as a graph partitioning problem.
  • Constructing a weighted linkage graph where vertices are sequences and edges represent high similarities.

Related Experiment Videos

  • Assigning edge weights based on sequence similarity scores.
  • Comparing clustering results with established InterPro families.
  • Main Results:

    • The proposed graph-based method significantly improves cluster quality over single linkage clustering.
    • The method achieved a high concordance with InterPro protein families.
    • Specifically, 77% of proteins within InterPro families were accurately classified into appropriate clusters.

    Conclusions:

    • The graph-based clustering approach offers a more effective strategy for protein family identification.
    • This method provides a valuable tool for bioinformatics research and protein family database curation.
    • The enhanced accuracy has implications for functional genomics and evolutionary studies.