Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Video

Updated: Jun 6, 2026

Applying Cheminformatics to Develop a Structure Searchable Database of Analytical Methods
05:34

Applying Cheminformatics to Develop a Structure Searchable Database of Analytical Methods

Published on: June 6, 2025

The Text-mining based PubChem Bioassay neighboring analysis.

Lianyi Han1, Tugba O Suzek, Yanli Wang

  • 1National Center for Biotechnology Information, US National Library of Medicine, 8600 Rockville Pike, Bethesda, MD 20894, USA.

BMC Bioinformatics
|November 10, 2010
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Multi-omics characterization of the skin microbiota reveals the anti-aging roles of Stenotrophomonas maltophilia.

Microbiome·2026
Same author

MoleculeFormer is a GCN-transformer architecture for molecular property prediction.

Communications biology·2025
Same author

PersADE: a database of personalized adverse drug events and their underlying molecular mechanisms.

Nucleic acids research·2025
Same author

Dynamic clinical trial success rates for drugs in the 21st century.

Nature communications·2025
Same author

Microllm: a structured information extraction tool using large language models and named entity recognition in microbiology.

Briefings in bioinformatics·2025
Same author

A comprehensive reference catalog of human skin DNA virome reveals novel viral diversity and microenvironmental influences.

Microbiology spectrum·2025
Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026
Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026
Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026
Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026
Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026
Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026
See all related articles

This study introduces a text-mining method to link related bioassays in PubChem, improving the discovery of biological activity relationships. The approach effectively identifies conceptual relevance among assays, complementing existing methods.

Area of Science:

  • Bioinformatics
  • Cheminformatics
  • Computational Biology

Background:

  • PubChem BioAssay database has rapidly grown, leading to challenges in integrating and interpreting vast amounts of structured and unstructured experimental data.
  • Efficiently assembling bioactivity data and understanding relationships across diverse bioassays is increasingly demanding.

Purpose of the Study:

  • To develop and evaluate a text-mining based approach for bioassay neighboring analysis using unstructured text descriptions from PubChem.
  • To identify and interpret relationships among bioassay experiments by mining textual information.

Main Methods:

  • Utilized text-mining techniques to analyze unstructured descriptions within the PubChem BioAssay database.
  • Employed cosine scores and overlap fractions to evaluate bioassay pairs and their curated neighbors.

More Related Videos

Biosensor-based High Throughput Biopanning and Bioinformatics Analysis Strategy for the Global Validation of Drug-protein Interactions
08:31

Biosensor-based High Throughput Biopanning and Bioinformatics Analysis Strategy for the Global Validation of Drug-protein Interactions

Published on: December 1, 2020

Related Experiment Videos

Last Updated: Jun 6, 2026

Applying Cheminformatics to Develop a Structure Searchable Database of Analytical Methods
05:34

Applying Cheminformatics to Develop a Structure Searchable Database of Analytical Methods

Published on: June 6, 2025

Biosensor-based High Throughput Biopanning and Bioinformatics Analysis Strategy for the Global Validation of Drug-protein Interactions
08:31

Biosensor-based High Throughput Biopanning and Bioinformatics Analysis Strategy for the Global Validation of Drug-protein Interactions

Published on: December 1, 2020

  • Performed cosine score distribution and assay neighbor clustering analyses.
  • Main Results:

    • Identified strong correlations among bioassays based on conceptual relevance derived from text mining.
    • Demonstrated that the text-mining approach provides meaningful linkages and identifies additional relationships compared to existing methods.
    • Clustering analysis revealed conceptual groupings of related bioassays.

    Conclusions:

    • Text-mining based bioassay neighboring analysis is efficient for correlating bioassays and studying biological processes.
    • This method complements existing procedures, especially when structured information or specific annotations are lacking.
    • The approach can be used independently or alongside other methods to integrate assay results and generate hypotheses for bioactivity discovery.