Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Evolutionary Relationships through Genome Comparisons02:54

Evolutionary Relationships through Genome Comparisons

6.1K
Genome comparison is one of the excellent ways to interpret the evolutionary relationships between organisms. The basic principle of genome comparison is that if two species share a common feature, it is likely encoded by the DNA sequence conserved between both species. The advent of genome sequencing technologies in the late 20th century enabled scientists to understand the concept of conservation of domains between species and helped them to deduce evolutionary relationships across diverse...
6.1K
Modern Molecular Taxonomy01:29

Modern Molecular Taxonomy

77
Advancements in molecular biology have revolutionized the identification and characterization of bacteria, with multiple methods leveraging DNA sequencing for enhanced precision. As sequencing technologies improve and costs decline, these approaches are increasingly used in clinical, environmental, and evolutionary studies.Multilocus Sequence Typing (MLST) examines several housekeeping genes, essential chromosomal genes encoding cellular functions, to distinguish strains. Approximately...
77
Protein Families02:47

Protein Families

15.6K
Protein families are groups of homologous proteins; that is, they have similarities in amino acid sequences and three-dimensional structures. Protein families usually occur because of gene duplication, where an additional copy of a gene is inserted into the genome of an organism.   Mutations that change the amino acids but still allow the protein to be properly synthesized, will lead to new protein family members.   If these new proteins contain similar amino acids in key...
15.6K
¹H NMR Chemical Shift Equivalence: Homotopic and Heterotopic Protons01:03

¹H NMR Chemical Shift Equivalence: Homotopic and Heterotopic Protons

2.5K
Protons in identical electronic environments within a molecule are chemically equivalent and have the same chemical shift. The replacement test is a useful tool to identify chemical equivalence and predict NMR spectra. A substituent replaces each of the protons being examined and the resulting molecules are compared. If the same molecule is obtained, the protons are equivalent or homotopic. Replacement of any hydrogens in ethane by chlorine yields chloroethane because all six protons are...
2.5K
Gene Families01:57

Gene Families

8.9K
Gene families consist of groups of genes proposed to have originated from a common ancestor. Typically these arise through events in which a gene or genes are mistakenly duplicated during cell division. Unlike their parent genes (which are subject to selection pressure to maintain function), these gene copies do not need to preserve their sequences and may evolve at a relatively faster rate.
Occasionally these regions can be adapted to take on new roles within the organism, becoming novel genes...
8.9K
Applications of Molecular Taxonomy01:20

Applications of Molecular Taxonomy

59
Molecular taxonomy has revolutionized the understanding and classification of bacteria, providing precise insights into their diversity, evolutionary relationships, and ecological roles. By utilizing molecular techniques such as DNA sequencing and fingerprinting, researchers have made significant strides in various fields related to bacterial studies.Resolving Taxonomic AmbiguitiesMolecular taxonomy has been instrumental in distinguishing closely related bacterial species initially thought to...
59

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Embrace Open, Collaborative, Discovery-Based Exposomics.

Environmental science & technology·2026
Same author

Mood Altering Waters: Multidimensional Profiling and Chiral Characterization of Antidepressants in Effluent-Impacted Waterways.

Environmental science & technology·2026
Same author

Browsing FAIR Transformation Product Information with FAIR-TPs.

ACS environmental Au·2026
Same author

Identification of persistent substructures in transformation products with zebrafish embryos using cheminformatics and a suspect screening approach.

Environmental science. Processes & impacts·2026
Same author

Mapping the human chemical exposome for public health.

Nature medicine·2026
Same author

MassBank: an open and FAIR mass spectral data resource.

Nucleic acids research·2025
Same journal

Unified heterogeneity-aware benchmark of drug synergy prediction: a cross-study analysis of traditional machine learning and graph deep learning models.

Journal of cheminformatics·2026
Same journal

Count your bits: fingerprint benchmarking to assess broad chemical space representation.

Journal of cheminformatics·2026
Same journal

Sampling out-of-distribution chemical spaces via Bayesian flow.

Journal of cheminformatics·2026
Same journal

Hold on tight: the kinetic profiling of opioid receptor ligands using the CORAL-MD.

Journal of cheminformatics·2026
Same journal

Transformer-accelerated discovery of inhibitors targeting the RpsA<sub>Δ438</sub> deletion in PZA-resistant tuberculosis.

Journal of cheminformatics·2026
Same journal

DICL: a manually curated database of ion channels and ligands as a useful platform for drug discovery targeting ion channels.

Journal of cheminformatics·2026
See all related articles

Related Experiment Video

Updated: Aug 17, 2025

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations
08:03

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations

Published on: December 7, 2021

2.3K

An algorithm to classify homologous series within compound datasets.

Adelene Lai1,2, Jonas Schaub3, Christoph Steinbeck3

  • 1Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, 6 Avenue du Swing, 4367, Belvaux, Luxembourg. adelene.lai@uni.lu.

Journal of Cheminformatics
|December 12, 2022
PubMed
Summary
This summary is machine-generated.

An algorithm was developed to identify homologous series in chemical datasets, aiding in understanding chemical diversity and identification. This method classifies compounds based on repeating structural units, improving data annotation for environmental and natural product chemistry.

Keywords:
AlgorithmEnvironmental chemistryExposomicsFragmentationHomologous seriesNatural productsPattern recognitionPolymersRDKitScaffolds

More Related Videos

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.6K
A Concoction Pipeline for Generating Molecular Operational Taxonomic Units (MOTUs) Among Riparian and Aquatic Beetles
10:23

A Concoction Pipeline for Generating Molecular Operational Taxonomic Units (MOTUs) Among Riparian and Aquatic Beetles

Published on: July 11, 2025

173

Related Experiment Videos

Last Updated: Aug 17, 2025

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations
08:03

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations

Published on: December 7, 2021

2.3K
Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.6K
A Concoction Pipeline for Generating Molecular Operational Taxonomic Units (MOTUs) Among Riparian and Aquatic Beetles
10:23

A Concoction Pipeline for Generating Molecular Operational Taxonomic Units (MOTUs) Among Riparian and Aquatic Beetles

Published on: July 11, 2025

173

Area of Science:

  • Computational Chemistry
  • Cheminformatics
  • Data Science

Background:

  • Homologous series are crucial in various scientific fields but often remain unannotated in chemical databases.
  • This lack of annotation hinders the comprehension of chemical diversity and analytical identification.

Purpose of the Study:

  • To develop and implement an algorithm for detecting homologous series within compound datasets.
  • To enhance the annotation of chemical compounds and facilitate their identification.

Main Methods:

  • Developed an algorithm using RDKit to classify homologous series based on repeating units (monomers) and core structures.
  • Applied the algorithm to three diverse datasets: NORMAN-SLE (environmental chemistry), PubChemLite for Exposomics, and COCONUT (natural products).
  • Utilized SMILES strings for molecules and SMARTS for monomers as primary inputs in an iterative process of substructure matching and fragmentation.

Main Results:

  • Successfully classified over 2000, 12,000, and 5000 homologous series with CH2 repeating units in NORMAN-SLE, PubChemLite, and COCONUT datasets, respectively.
  • Validated the algorithm's performance against published homologous series and existing methods, including PFAS compound categorization.
  • Demonstrated the algorithm's effectiveness in identifying and grouping compounds belonging to homologous series.

Conclusions:

  • The developed algorithm, OngLai, effectively classifies homologous series in large chemical datasets.
  • This tool addresses a significant challenge in chemical data annotation, improving chemical diversity analysis and identification.
  • The open availability of the algorithm promotes further research and application in related scientific domains.