Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

BAG: a graph theoretic sequence clustering algorithm.

Sun Kim1, Jason Lee

  • 1School of Informatics, Center for Genomics and Bioinformatics, Indiana University, Bloomington, IN 47408, USA. sunkim2@indiana.edu

International Journal of Data Mining and Bioinformatics
|April 11, 2008
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Ultrasound-guided hydrodissection of the subclavius muscle region in neurogenic thoracic outlet syndrome: a case report.

Frontiers in medicine·2026
Same author

Assessing the Durability of Selective Median Lobe Holmium Laser Enucleation of the Prostate: A Retrospective Cohort Study.

Urology·2026
Same author

Policies for the Inclusion of Pregnant Women in Research: Results from an International and a US Study.

Ethics & human research·2025
Same author

Risk of metachronous upper tract urothelial carcinoma following non-muscle-invasive bladder cancer.

BJU international·2025
Same author

Femoral Nerve Palsy After Anterior Total Hip Arthroplasty: A Systematic Review.

Arthroplasty today·2025
Same author

Identifying barriers and facilitators to reporting and disclosing sexual violence (SV) for undergraduate students in Canadian and American universities: a scoping review protocol.

Systematic reviews·2025
Same journal

DiffGRN: differential gene regulatory network analysis.

International journal of data mining and bioinformatics·2019
Same journal

Integration of multi-omics data for integrative gene regulatory network inference.

International journal of data mining and bioinformatics·2018
Same journal

The development of non-coding RNA ontology.

International journal of data mining and bioinformatics·2016
Same journal

Learning multiple distributed prototypes of semantic categories for named entity recognition.

International journal of data mining and bioinformatics·2015
Same journal

Weighted fusion regularisation and predicting microbial interactions with vector autoregressive model.

International journal of data mining and bioinformatics·2015
Same journal

Application of consensus string matching in the diagnosis of allelic heterogeneity involving transposition mutation.

International journal of data mining and bioinformatics·2015
See all related articles

This study introduces BAG, a novel algorithm for clustering biological sequences. BAG accurately clusters large datasets while significantly reducing fragmentation using graph properties and a new cluster utility metric.

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Graph Theory

Background:

  • Clustering biological sequences is crucial for understanding genomic data.
  • Existing methods face challenges with fragmentation and scalability.
  • Graph properties offer potential for improved sequence clustering.

Purpose of the Study:

  • To design a novel sequence clustering algorithm, BAG, addressing fragmentation issues.
  • To leverage graph properties for accurate and efficient biological sequence clustering.
  • To introduce a new metric, cluster utility, for guiding cluster splitting.

Main Methods:

  • Developed the BAG (Biological sequence clustering Algorithm using Graph properties) algorithm.
  • Utilized graph properties: biconnectedness, articulation points, and pquasi-completeness.

Related Experiment Videos

  • Implemented a novel 'cluster utility' metric for guided cluster splitting and merging.
  • Main Results:

    • BAG demonstrated accurate clustering of large sequence datasets (e.g., COG database).
    • The algorithm significantly reduced the number of fragmented clusters.
    • Effective merging strategies were employed after initial splitting.

    Conclusions:

    • BAG provides an accurate and efficient solution for biological sequence clustering.
    • The use of graph properties and cluster utility metric mitigates fragmentation.
    • The algorithm shows promise for large-scale genomic data analysis.