Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Proteomics01:33

Proteomics

8.8K
A proteome is the entire set of proteins that a cell type produces. We can study proteomes using the knowledge of genomes because genes code for mRNAs, and the mRNAs encode proteins. Although mRNA analysis is a step in the right direction, not all mRNAs are translated into proteins.
Proteomics is the study of proteomes' function. It involves the large-scale systematic study of the proteome to denote the protein complement expressed by a genome. Scientist Mark Wilkins coined the term...
8.8K
Classification of Titrimetric Analysis Based on Reaction Types01:01

Classification of Titrimetric Analysis Based on Reaction Types

1.1K
Titrimetric analysis in solution chemistry involves measuring the volume of solutions and is often called volumetric analysis. The standard solution of known concentration in the burette is called the titrant, whereas the solution of unknown concentration in the flask is called the analyte, or titrand. Titrimetric analyses can be classified into four types based on the reactions between the titrant and analyte.
Titrations between an acid and a base lead to neutralization reactions that form...
1.1K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Gene expression profiling of whole blood samples following marathon running in non-elite athletes.

Biology of sport·2026
Same author

Gene expression integration and similarity score-based modeling improve risk stratification in idiopathic venous thrombophilia.

Journal of thrombosis and haemostasis : JTH·2026
Same author

Phenotypic clustering of newly diagnosed type 2 diabetes in a Mediterranean cohort.

Cardiovascular diabetology·2026
Same author

singIST: An integrative method for comparative single-cell transcriptomics between disease models and humans.

PLoS computational biology·2026
Same author

Proof of concept for an age- and inflammation-adjusted model for the establishment of pediatric serum copper reference intervals.

Clinical nutrition (Edinburgh, Scotland)·2026
Same author

Guanine and pregnenolone sulfate are associated with incident type 2 diabetes in two independent populations.

Frontiers in endocrinology·2025
Same journal

PACEff Builder: An Efficient Platform for Constructing PACE Hybrid-Resolution Models for Molecular Dynamics Simulations of Aqueous Protein, Peptide Assembly, and Membrane Protein Systems.

Journal of chemical information and modeling·2026
Same journal

TransKla: A Local-Global Cross-Attention Based Transformer Approach for Prediction of Lysine Lactylation Sites.

Journal of chemical information and modeling·2026
Same journal

CondenSimAdapter: A Versatile Builder for Multiscale Simulations of Protein Condensates with Broad Force-Field Compatibility and Robust Dense-Phase Relaxation.

Journal of chemical information and modeling·2026
Same journal

Simulation Guided Design of a Potentially Hyperactive Ice Nucleating Protein.

Journal of chemical information and modeling·2026
Same journal

Setting the Bases of the Photogenotoxicity of <i>p</i>-Aminobenzoic Acid.

Journal of chemical information and modeling·2026
Same journal

Probing Charge-Controlled Inter-Domain Flexibility: Integrating Experimental and Coarse-Grained Approaches.

Journal of chemical information and modeling·2026
See all related articles

Related Experiment Video

Updated: Nov 11, 2025

Author Spotlight: Advancing Alzheimer's Research &#8211; Exploring Early Detection and Multi-Omics Approaches
09:47

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

1.4K

Balancing Data on Deep Learning-Based Proteochemometric Activity Classification.

Angela Lopez-Del Rio1,2, Sergio Picart-Armada1,2, Alexandre Perera-Lluna1,2

  • 1B2SLab, Departament d'Enginyeria de Sistemes, Automàtica i Informàtica Industrial, Universitat Politècnica de Catalunya, 08028 Barcelona, Spain.

Journal of Chemical Information and Modeling
|March 29, 2021
PubMed
Summary
This summary is machine-generated.

Data imbalance in proteochemometric models can affect performance. A semi-resampling strategy combining data augmentation and clustering is recommended for accurate ligand-target activity prediction in pharmaceutical development.

More Related Videos

Deep Proteome Profiling by Isobaric Labeling, Extensive Liquid Chromatography, Mass Spectrometry, and Software-assisted Quantification
10:37

Deep Proteome Profiling by Isobaric Labeling, Extensive Liquid Chromatography, Mass Spectrometry, and Software-assisted Quantification

Published on: November 15, 2017

12.3K
Author Spotlight: Generating Neuronal Phenotypic Profiles - A Protocol to Culture and Image Human Midbrain Dopaminergic Neurons
09:21

Author Spotlight: Generating Neuronal Phenotypic Profiles - A Protocol to Culture and Image Human Midbrain Dopaminergic Neurons

Published on: July 7, 2023

1.8K

Related Experiment Videos

Last Updated: Nov 11, 2025

Author Spotlight: Advancing Alzheimer's Research &#8211; Exploring Early Detection and Multi-Omics Approaches
09:47

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

1.4K
Deep Proteome Profiling by Isobaric Labeling, Extensive Liquid Chromatography, Mass Spectrometry, and Software-assisted Quantification
10:37

Deep Proteome Profiling by Isobaric Labeling, Extensive Liquid Chromatography, Mass Spectrometry, and Software-assisted Quantification

Published on: November 15, 2017

12.3K
Author Spotlight: Generating Neuronal Phenotypic Profiles - A Protocol to Culture and Image Human Midbrain Dopaminergic Neurons
09:21

Author Spotlight: Generating Neuronal Phenotypic Profiles - A Protocol to Culture and Image Human Midbrain Dopaminergic Neurons

Published on: July 7, 2023

1.8K

Area of Science:

  • Computational chemistry
  • Cheminformatics
  • Machine learning in drug discovery

Background:

  • In silico analysis of biological activity data is crucial for pharmaceutical development.
  • Proteochemometric models leverage machine learning for ligand-target activity prediction by sharing information across targets.
  • Bioactivity datasets often exhibit imbalance, potentially compromising model performance.

Purpose of the Study:

  • To investigate the impact of various data balancing strategies on deep learning proteochemometric models.
  • To control for compound series bias using clustering techniques within these models.
  • To evaluate balancing strategies across diverse protein families (kinases, GPCRs, nuclear receptors, proteases).

Main Methods:

  • Exploration of four balancing strategies: no_resampling, resampling_after_clustering, resampling_before_clustering, and semi_resampling.
  • Application of clustering to mitigate compound series bias.
  • Evaluation of models on BindingDB datasets encompassing kinases, GPCRs, nuclear receptors, and proteases.
  • Utilizing deep learning for target-compound activity classification.

Main Results:

  • The predicted proportion of positive activity predictions correlated with the actual data balance in the test set.
  • Data balancing strategies demonstrably influenced the performance estimates of proteochemometric models.
  • The semi_resampling strategy, involving data augmentation and clustering, showed promise in mitigating imbalance effects.

Conclusions:

  • Data balance significantly impacts the performance estimation of proteochemometric models.
  • A semi-resampling approach, integrating data augmentation and clustering in the training set, is recommended for realistic scenarios.
  • This strategy helps mitigate data imbalance effects, leading to more reliable predictions in pharmaceutical development.