Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Cluster Sampling Method01:20

Cluster Sampling Method

15.3K
Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...
15.3K
Parallel Processing01:20

Parallel Processing

839
The brain processes sensory information rapidly due to parallel processing, which involves sending data across multiple neural pathways at the same time. This method allows the brain to manage various sensory qualities, such as shapes, colors, movements, and locations, all concurrently. For instance, when observing a forest landscape, the brain simultaneously processes the movement of leaves, the shapes of trees, the depth between them, and the various shades of green. This enables a quick and...
839

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Cell painting and thermal proteome profiling for inference of drug targets and mechanism of action.

Molecular systems biology·2026
Same author

Benign-by-design chemistry: Reinventing ligand-based drug design at the edge of AI.

Drug discovery today·2026
Same author

AI agents in drug discovery: applications and case studies.

Drug discovery today·2026
Same author

Counting cells can accurately predict small-molecule bioactivity benchmarks.

Nature communications·2026
Same author

Co-exposure to PFAS and hydroxylated PCBs is associated with increased odds of multiple sclerosis.

Environment international·2025
Same author

PKSmart: an open-source computational model to predict intravenous pharmacokinetics of small molecules.

Journal of cheminformatics·2025
Same journal

OpenStats: how to combine statistics and research data management (RDM) to leverage efficient scientific data analysis by guided statistics.

Journal of cheminformatics·2026
Same journal

Unified heterogeneity-aware benchmark of drug synergy prediction: a cross-study analysis of traditional machine learning and graph deep learning models.

Journal of cheminformatics·2026
Same journal

Count your bits: fingerprint benchmarking to assess broad chemical space representation.

Journal of cheminformatics·2026
Same journal

Sampling out-of-distribution chemical spaces via Bayesian flow.

Journal of cheminformatics·2026
Same journal

Hold on tight: the kinetic profiling of opioid receptor ligands using the CORAL-MD.

Journal of cheminformatics·2026
Same journal

Transformer-accelerated discovery of inhibitors targeting the RpsA<sub>Δ438</sub> deletion in PZA-resistant tuberculosis.

Journal of cheminformatics·2026
See all related articles

Related Experiment Video

Updated: Mar 6, 2026

Author Spotlight: Enhancing Cryo-Electron Microscopy by Automated Data Collection and Analysis Techniques
07:52

Author Spotlight: Enhancing Cryo-Electron Microscopy by Automated Data Collection and Analysis Techniques

Published on: December 1, 2023

1.6K

Large-scale virtual screening on public cloud resources with Apache Spark.

Marco Capuccini1,2, Laeeq Ahmed3, Wesley Schaal2

  • 1Department of Information Technology, Uppsala University, Box 337, 75105 Uppsala, Sweden.

Journal of Cheminformatics
|March 21, 2017
PubMed
Summary
This summary is machine-generated.

We developed a scalable method for structure-based virtual screening using Apache Spark. This approach efficiently screens large molecular libraries on cloud resources, achieving 87% parallel efficiency.

Keywords:
Apache SparkCloud computingDockingVirtual screening

More Related Videos

Large-Scale Screens of Metagenomic Libraries
16:05

Large-Scale Screens of Metagenomic Libraries

Published on: May 28, 2007

9.3K
High-resolution Single Particle Analysis from Electron Cryo-microscopy Images Using SPHIRE
13:28

High-resolution Single Particle Analysis from Electron Cryo-microscopy Images Using SPHIRE

Published on: May 16, 2017

51.0K

Related Experiment Videos

Last Updated: Mar 6, 2026

Author Spotlight: Enhancing Cryo-Electron Microscopy by Automated Data Collection and Analysis Techniques
07:52

Author Spotlight: Enhancing Cryo-Electron Microscopy by Automated Data Collection and Analysis Techniques

Published on: December 1, 2023

1.6K
Large-Scale Screens of Metagenomic Libraries
16:05

Large-Scale Screens of Metagenomic Libraries

Published on: May 28, 2007

9.3K
High-resolution Single Particle Analysis from Electron Cryo-microscopy Images Using SPHIRE
13:28

High-resolution Single Particle Analysis from Electron Cryo-microscopy Images Using SPHIRE

Published on: May 16, 2017

51.0K

Area of Science:

  • Computational chemistry
  • Bioinformatics

Background:

  • Structure-based virtual screening (SBVS) is an in-silico method for screening molecular libraries against target receptors.
  • Traditional docking-based screening of large libraries is computationally intensive.
  • MapReduce frameworks like Apache Spark offer scalable and fault-tolerant solutions for large-scale data analysis.

Purpose of the Study:

  • To develop and implement a MapReduce-based approach for parallelizing structure-based virtual screening.
  • To leverage distributed cloud resources for efficient and scalable molecular screening.

Main Methods:

  • Developed a method to run existing docking software on distributed cloud resources using Apache Spark.
  • Implemented the method, named Spark-VS, for parallel SBVS.
  • Benchmarked the method by docking a target receptor against 2.2 million compounds.

Main Results:

  • Achieved good parallel efficiency (87%) when running in a public cloud environment.
  • Demonstrated the scalability of the method for screening large molecular libraries.
  • Successfully docked a target receptor against a large compound library.

Conclusions:

  • The developed method enables parallel structure-based virtual screening on cloud or commodity clusters.
  • The scalability allows for initial testing on smaller libraries before scaling to larger ones.
  • Spark-VS is available as open-source software.