Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Evolutionary Relationships through Genome Comparisons02:54

Evolutionary Relationships through Genome Comparisons

7.2K
Genome comparison is one of the excellent ways to interpret the evolutionary relationships between organisms. The basic principle of genome comparison is that if two species share a common feature, it is likely encoded by the DNA sequence conserved between both species. The advent of genome sequencing technologies in the late 20th century enabled scientists to understand the concept of conservation of domains between species and helped them to deduce evolutionary relationships across diverse...
7.2K
Globular and Fibrous Proteins02:21

Globular and Fibrous Proteins

48.2K
Many proteins can be classified into two distinct subtypes - globular or fibrous. These two types differ in their shapes and solubilities.
Globular proteins are also known as spheroproteins and typically are approximately round in shape. They contain a mix of amino acid types and contain differing sequences in their primary structures. Globular proteins have many different functions, such as enzymes, cellular messengers, and molecular transporters. These roles often require the proteins to be...
48.2K
Causes of Similarity-Dissimilarity Effect01:26

Causes of Similarity-Dissimilarity Effect

315
The similarity-dissimilarity effect, a fundamental concept in social psychology, explains how interpersonal similarities and differences influence attraction and social interactions. This effect is supported by three key psychological perspectives: balance theory, social comparison theory, and consensual validation.Balance Theory and Cognitive ConsistencyBalance theory, developed by Fritz Heider, posits that individuals seek cognitive consistency in their relationships. When two people share...
315
Modern Molecular Taxonomy01:29

Modern Molecular Taxonomy

819
Advancements in molecular biology have revolutionized the identification and characterization of bacteria, with multiple methods leveraging DNA sequencing for enhanced precision. As sequencing technologies improve and costs decline, these approaches are increasingly used in clinical, environmental, and evolutionary studies.Multilocus Sequence Typing (MLST) examines several housekeeping genes, essential chromosomal genes encoding cellular functions, to distinguish strains. Approximately...
819
Gene Families01:57

Gene Families

10.2K
Gene families consist of groups of genes proposed to have originated from a common ancestor. Typically these arise through events in which a gene or genes are mistakenly duplicated during cell division. Unlike their parent genes (which are subject to selection pressure to maintain function), these gene copies do not need to preserve their sequences and may evolve at a relatively faster rate.
Occasionally these regions can be adapted to take on new roles within the organism, becoming novel genes...
10.2K
IR Frequency Region: Fingerprint Region01:03

IR Frequency Region: Fingerprint Region

2.2K
IR spectra are divided into two main regions: the diagnostic region and the fingerprint region. The diagnostic region of the spectrum lies above 1500 cm−1. The absorptions resulting from single-bond vibrations of the N–H, C–H, and O–H stretch at higher wavenumbers and appear on the left side of the spectrum. The stretching absorptions of the C≡C and C≡N occur between 2100–2300 cm−1. In contrast, those arising from stretching absorptions of the...
2.2K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Defining peptides in ChEBI.

Journal of cheminformatics·2026
Same author

ANNalog: generation of MedChem-similar molecules.

Journal of cheminformatics·2026
Same author

Correction: Integrating artificial intelligence and manual curation to enhance bioassay annotations in ChEMBL.

Journal of cheminformatics·2026
Same author

Integrating artificial intelligence and manual curation to enhance bioassay annotations in ChEMBL.

Journal of cheminformatics·2026
Same author

ChEBI: re-engineered for a sustainable future.

Nucleic acids research·2025
Same author

EMBL's European Bioinformatics Institute (EMBL-EBI) in 2025.

Nucleic acids research·2025
Same journal

Multimodal feature fusion for molecular property classification.

Journal of cheminformatics·2026
Same journal

P2MAT: A machine learning (ML) driven software for Property Prediction of MATerial.

Journal of cheminformatics·2026
Same journal

Computational design of low-volatility lubricants for space using interpretable machine learning.

Journal of cheminformatics·2026
Same journal

OpenStats: how to combine statistics and research data management (RDM) to leverage efficient scientific data analysis by guided statistics.

Journal of cheminformatics·2026
Same journal

Unified heterogeneity-aware benchmark of drug synergy prediction: a cross-study analysis of traditional machine learning and graph deep learning models.

Journal of cheminformatics·2026
Same journal

Count your bits: fingerprint benchmarking to assess broad chemical space representation.

Journal of cheminformatics·2026
See all related articles

Related Experiment Video

Updated: Mar 18, 2026

Chromatographic Fingerprinting by Template Matching for Data Collected by Comprehensive Two-Dimensional Gas Chromatography
10:14

Chromatographic Fingerprinting by Template Matching for Data Collected by Comprehensive Two-Dimensional Gas Chromatography

Published on: September 2, 2020

5.5K

Comparing structural fingerprints using a literature-based similarity benchmark.

Noel M O'Boyle1, Roger A Sayle1

  • 1NextMove Software, Innovation Centre, Cambridge Science Park, Milton Road, Cambridge, CB4 0EY UK.

Journal of Cheminformatics
|July 7, 2016
PubMed
Summary
This summary is machine-generated.

This study defines molecular similarity for drug discovery, creating benchmark datasets from medicinal chemistry literature. These datasets evaluate molecular fingerprints, identifying top performers for ranking diverse structures and close analogues.

Keywords:
Molecular fingerprintsSimilarity benchmarkSimilarity searchingStructural similarity

More Related Videos

Comparing Bibliometric Analysis Using PubMed, Scopus, and Web of Science Databases
05:02

Comparing Bibliometric Analysis Using PubMed, Scopus, and Web of Science Databases

Published on: October 24, 2019

34.2K
An Analytical Tool-box for Comprehensive Biochemical, Structural and Transcriptome Evaluation of Oral Biofilms Mediated by Mutans Streptococci
11:09

An Analytical Tool-box for Comprehensive Biochemical, Structural and Transcriptome Evaluation of Oral Biofilms Mediated by Mutans Streptococci

Published on: January 25, 2011

18.2K

Related Experiment Videos

Last Updated: Mar 18, 2026

Chromatographic Fingerprinting by Template Matching for Data Collected by Comprehensive Two-Dimensional Gas Chromatography
10:14

Chromatographic Fingerprinting by Template Matching for Data Collected by Comprehensive Two-Dimensional Gas Chromatography

Published on: September 2, 2020

5.5K
Comparing Bibliometric Analysis Using PubMed, Scopus, and Web of Science Databases
05:02

Comparing Bibliometric Analysis Using PubMed, Scopus, and Web of Science Databases

Published on: October 24, 2019

34.2K
An Analytical Tool-box for Comprehensive Biochemical, Structural and Transcriptome Evaluation of Oral Biofilms Mediated by Mutans Streptococci
11:09

An Analytical Tool-box for Comprehensive Biochemical, Structural and Transcriptome Evaluation of Oral Biofilms Mediated by Mutans Streptococci

Published on: January 25, 2011

18.2K

Area of Science:

  • Cheminformatics
  • Medicinal Chemistry
  • Drug Discovery

Background:

  • Molecular similarity is a core concept in cheminformatics but lacks objective assessment.
  • A practical definition is proposed: molecules are similar if a medicinal chemist would test them concurrently in a drug discovery program.
  • This definition leverages medicinal chemistry literature, assuming molecules in the same activity table are considered similar.

Purpose of the Study:

  • To develop a practical definition of molecular similarity for drug discovery.
  • To create benchmark datasets for evaluating structural similarity measures.
  • To assess the performance of various molecular fingerprints.

Main Methods:

  • Defined molecular similarity based on medicinal chemists' likely testing strategies.
  • Constructed benchmark datasets from ChEMBL activity data and medicinal chemistry literature.
  • Evaluated 28 different molecular fingerprints using these benchmark datasets.

Main Results:

  • Created two sensitive benchmark datasets for structural similarity, outperforming virtual screening results.
  • Identified Extended-connectivity fingerprints (ECFP) with diameter 4 and 6, and topological torsion fingerprints as top performers for diverse structures.
  • Atom pair fingerprints excelled at ranking very close analogues.

Conclusions:

  • ECFP fingerprints show significantly improved performance with increased bit-vector length (1024 to 16,384) for ranking diverse structures and virtual screening.
  • The developed benchmark datasets are valuable for guiding the development of improved molecular similarity measures.
  • Specific fingerprints are recommended based on whether ranking diverse structures or close analogues.