Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Genetic Screens02:46

Genetic Screens

Genetic screens are tools used to identify genes and mutations responsible for phenotypes of interest. Genetic screens help identify individuals or a group of people at risk of developing  genetic diseases and help them with early intervention, targeted therapy, and reproductive options.
Forward genetic screens
Forward or “classical” genetic screens involve creating random mutations in an organism’s DNA using radiation, mutagens, or insertion of additional bases, which result in visible changes...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Distinctive Behaviors of Druggable Proteins in Cellular Networks.

PLoS computational biology·2015
Same author

canSAR: an updated cancer research and drug discovery knowledgebase.

Nucleic acids research·2015
Same author

Therapeutic opportunities within the DNA damage response.

Nature reviews. Cancer·2015
Same author

canSAR: updated cancer research and drug discovery knowledgebase.

Nucleic acids research·2013
Same author

Hybrid genetic algorithm-neural network: feature extraction for unpreprocessed microarray data.

Artificial intelligence in medicine·2011
Same author

Overhauling the PDB.

Nature biotechnology·2007
Same journal

OpenStats: how to combine statistics and research data management (RDM) to leverage efficient scientific data analysis by guided statistics.

Journal of cheminformatics·2026
Same journal

Unified heterogeneity-aware benchmark of drug synergy prediction: a cross-study analysis of traditional machine learning and graph deep learning models.

Journal of cheminformatics·2026
Same journal

Count your bits: fingerprint benchmarking to assess broad chemical space representation.

Journal of cheminformatics·2026
Same journal

Sampling out-of-distribution chemical spaces via Bayesian flow.

Journal of cheminformatics·2026
Same journal

Hold on tight: the kinetic profiling of opioid receptor ligands using the CORAL-MD.

Journal of cheminformatics·2026
Same journal

Transformer-accelerated discovery of inhibitors targeting the RpsA<sub>Δ438</sub> deletion in PZA-resistant tuberculosis.

Journal of cheminformatics·2026
See all related articles

Related Experiment Video

Updated: Jun 16, 2026

Screening for Endocrine Activity in Water Using Commercially-available In Vitro Transactivation Bioassays
08:00

Screening for Endocrine Activity in Water Using Commercially-available In Vitro Transactivation Bioassays

Published on: December 4, 2016

Virtual screening of bioassay data.

Amanda C Schierz1

  • 1Smart Technology Research Centre, Bournemouth University, Poole House, Talbot Campus, Poole, Dorset, BH12 5BB, UK.

Journal of Cheminformatics
|February 13, 2010
PubMed
Summary
This summary is machine-generated.

Virtual screening of bioassay data faces challenges with data access, high false positive rates, and imbalanced datasets. Cost-sensitive classifiers like SVM and C4.5 show promise, but careful parameter tuning is essential.

More Related Videos

High-throughput Screening for Chemical Modulators of Post-transcriptionally Regulated Genes
09:44

High-throughput Screening for Chemical Modulators of Post-transcriptionally Regulated Genes

Published on: March 3, 2015

A Microscopic Phenotypic Assay for the Quantification of Intracellular Mycobacteria Adapted for High-throughput/High-content Screening
15:28

A Microscopic Phenotypic Assay for the Quantification of Intracellular Mycobacteria Adapted for High-throughput/High-content Screening

Published on: January 17, 2014

Related Experiment Videos

Last Updated: Jun 16, 2026

Screening for Endocrine Activity in Water Using Commercially-available In Vitro Transactivation Bioassays
08:00

Screening for Endocrine Activity in Water Using Commercially-available In Vitro Transactivation Bioassays

Published on: December 4, 2016

High-throughput Screening for Chemical Modulators of Post-transcriptionally Regulated Genes
09:44

High-throughput Screening for Chemical Modulators of Post-transcriptionally Regulated Genes

Published on: March 3, 2015

A Microscopic Phenotypic Assay for the Quantification of Intracellular Mycobacteria Adapted for High-throughput/High-content Screening
15:28

A Microscopic Phenotypic Assay for the Quantification of Intracellular Mycobacteria Adapted for High-throughput/High-content Screening

Published on: January 17, 2014

Area of Science:

  • Computational chemistry
  • Cheminformatics
  • Machine learning in drug discovery

Background:

  • Challenges in virtual screening of bioassay data include limited access to curated datasets, a high rate of false positives in primary screening, and significant class imbalance (few active compounds vs. many inactive ones).
  • Pharmaceutical data accessibility is restricted, with PubChem data lacking curation and cross-referencing between primary and confirmatory assays.
  • Analysis of false positives in primary screening is hampered by poor data cross-referencing, though identified cases show an average of 64% false positives in High-Throughput Primary screening.

Purpose of the Study:

  • To discuss the key challenges in virtual screening of bioassay data.
  • To evaluate the performance of various Weka cost-sensitive classifiers (Naive Bayes, SVM, C4.5, Random Forest) on diverse bioassay datasets.
  • To investigate the impact of classifier choice and cost matrix settings on classification performance.

Main Methods:

  • Application of cost-sensitive classification algorithms available in the Weka software suite.
  • Utilizing Naive Bayes, Support Vector Machine (SVM), C4.5 decision tree, and Random Forest classifiers.
  • Testing classifiers on multiple bioassay datasets with varying characteristics.

Main Results:

  • Weka's Support Vector Machine (SVM) and C4.5 decision tree implementations demonstrated relatively strong performance in cost-sensitive classification tasks.
  • The optimal configuration of the Weka cost matrix is dependent on the specific base classifier being used, rather than solely on the ratio of class imbalance.
  • The high percentage of false positives in primary screening raises questions about the suitability of such data for virtual screening.

Conclusions:

  • Enhanced accessibility of curated pharmaceutical screening data is crucial for both industry and academia.
  • Virtual screening can benefit drug discovery by reducing the search space and improving primary screening processes through analysis of false positives.
  • Care must be taken when applying Weka's cost-sensitive classifiers; using generic misclassification costs based on class ratios is not recommended for comparing different classifiers on the same dataset.