Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Distribution Reliability and Automation01:25

Distribution Reliability and Automation

Distribution reliability in electrical power systems is critical for ensuring an uninterrupted power supply to consumers at minimal cost. According to IEEE Standard Terms, reliability is the probability that a device will function without failure over a specified time period or amount of usage. For electric power distribution, this translates to maintaining continuous power supply and addressing customer concerns over power outages. Several indices, as defined by IEEE Standard 1366-2012, are...
Distribution and Dispersion00:54

Distribution and Dispersion

To understand intra-specific interactions in populations, scientists measure the spatial arrangement of species individuals. This geographic arrangement is known as the species distribution or dispersion. Highly territorial species exhibit a uniform distribution pattern, in which individuals are spaced at relatively equal distances from one another. Species that are highly tied to particular resources, such as food or shelter, tend to concentrate around those resources, and thus exhibit a...
Distribution of Molecular Speeds01:27

Distribution of Molecular Speeds

The motion of molecules in a gas is random in magnitude and direction for individual molecules, but a gas of many molecules has a predictable distribution of molecular speeds. This predictable distribution of molecular speeds is known as the Maxwell-Boltzmann distribution. The distribution of molecular speeds in liquids is comparable to that of gases but not identical and can help to understand the phenomenon of the boiling and vapor pressure of a liquid. Consider that a molecule requires a...
Model Approaches for Pharmacokinetic Data: Distributed Parameter Models01:06

Model Approaches for Pharmacokinetic Data: Distributed Parameter Models

Pharmacokinetic models are mathematical constructs that represent and predict the time course of drug concentrations in the body, providing meaningful pharmacokinetic parameters. These models are categorized into compartment, physiological, and distributed parameter models.
The distributed parameter models are specifically designed to account for variations and differences in some drug classes. This model is particularly useful for assessing regional concentrations of anticancer or...
Distributed Loads: Problem Solving01:21

Distributed Loads: Problem Solving

Beams are structural elements commonly employed in engineering applications requiring different load-carrying capacities. The first step in analyzing a beam under a distributed load is to simplify the problem by dividing the load into smaller regions, which allows one to consider each region separately and calculate the magnitude of the equivalent resultant load acting on each portion of the beam. The magnitude of the equivalent resultant load for each region can be determined by calculating...
Data: Types and Distribution01:19

Data: Types and Distribution

In biostatistics, data are the observations collected for analysis. There are two main types: parametric and non-parametric. Parametric data, which include continuous (e.g., weight) and discrete numerical data (e.g., number of tablets), assume a particular distribution pattern, often the normal distribution. Non-parametric data do not adhere to a specific distribution and typically comprise nominal (e.g., gender) and ordinal categorical data (e.g., pain scale ratings).
Distributions in...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Homogeneously dispersed cobalt catalytic sites in the channels of zirconium-organic framework enhance rapid degradation of tetracycline.

Journal of colloid and interface science·2026
Same author

HACC-TNF-α-VLP Nanoparticles Facilitate HSP90-Dependent Antigen Cross-Presentation to Enhance CD8⁺ T Cell and Mucosal Immunity Against Foot-and-Mouth Disease.

International journal of nanomedicine·2026
Same author

DiSCO: deconvoluting spatial transcriptomics via combinatorial optimization with a foundational diffusion model.

Briefings in bioinformatics·2026
Same author

Keratinocyte-specific STAT3 knockout attenuates imiquimod-induced psoriasiform phenotype in mice.

Experimental animals·2026
Same author

Wnt7a mediates the Wnt/β-catenin signaling pathway to influence ferroptosis in T lymphocytes of HIV-infected individuals.

Molecular immunology·2026
Same author

Role of the STAT3 Signaling Pathway in Cell Proliferation and Inflammation in Psoriasis and Approaches for Targeted Therapies: A Review.

Medical science monitor : international medical journal of experimental and clinical research·2026
Same journal

Spectroscopic Investigation of the In Vivo Light-Dependent Photodynamics of the Marine Diatom Phaeodactylum tricornutum.

Chemphyschem : a European journal of chemical physics and physical chemistry·2026
Same journal

Atomistic Insights into the Thermal Decomposition and Runaway Mechanism of Peroxypropionic Acid.

Chemphyschem : a European journal of chemical physics and physical chemistry·2026
Same journal

Hydrazine Adsorption on Hexagonal Ice (0001): First-Principles Investigations on Stability, Dynamics, and Chirality Changes.

Chemphyschem : a European journal of chemical physics and physical chemistry·2026
Same journal

Sustainable Ball Milling-Assisted Synthesis of Bread Waste-Derived Highly Porous Carbons for Adsorption-Based Applications.

Chemphyschem : a European journal of chemical physics and physical chemistry·2026
Same journal

RNALig: An ML-Driven Structure-Based Scoring Function for Estimating Binding Affinities of RNA-Ligand Complexes.

Chemphyschem : a European journal of chemical physics and physical chemistry·2026
Same journal

Photoswitchable Polar Azobenzene-Based Liquid Crystals for Electro-Optic and Optical Data Storage Applications.

Chemphyschem : a European journal of chemical physics and physical chemistry·2026
See all related articles

Related Experiment Video

Updated: May 20, 2026

New Features in Visual Dynamics 3.0
05:00

New Features in Visual Dynamics 3.0

Published on: August 9, 2024

D2VP: A Visual First Platform for Diagnosing Data Distribution Mismatch Toward Reliable Machine Learning in Materials

Xuqiang Shao1, Zhaoyan Dong1, Piao Ma2,3

  • 1Department of Computer Science, North China Electric Power University, Baoding, China.

Chemphyschem : a European Journal of Chemical Physics and Physical Chemistry
|May 18, 2026
PubMed
Summary
This summary is machine-generated.

Data Distribution Mismatch (DDM) hinders machine learning reliability in chemistry. Our visual platform, D2VP, diagnoses and visualizes data issues, enhancing model trustworthiness for AI in science.

Keywords:
data distribution mismatchdata–centric Imachine learning reliabilitymaterials informaticsvisual analytics

More Related Videos

Novel 3D/VR Interactive Environment for MD Simulations, Visualization and Analysis
11:29

Novel 3D/VR Interactive Environment for MD Simulations, Visualization and Analysis

Published on: December 18, 2014

Visualization Method for Proprioceptive Drift on a 2D Plane Using Support Vector Machine
07:05

Visualization Method for Proprioceptive Drift on a 2D Plane Using Support Vector Machine

Published on: October 27, 2016

Related Experiment Videos

Last Updated: May 20, 2026

New Features in Visual Dynamics 3.0
05:00

New Features in Visual Dynamics 3.0

Published on: August 9, 2024

Novel 3D/VR Interactive Environment for MD Simulations, Visualization and Analysis
11:29

Novel 3D/VR Interactive Environment for MD Simulations, Visualization and Analysis

Published on: December 18, 2014

Visualization Method for Proprioceptive Drift on a 2D Plane Using Support Vector Machine
07:05

Visualization Method for Proprioceptive Drift on a 2D Plane Using Support Vector Machine

Published on: October 27, 2016

Area of Science:

  • Chemistry
  • Materials Science
  • Data Science
  • Machine Learning

Background:

  • Machine learning (ML) models in chemistry and materials discovery face reliability challenges due to Data Distribution Mismatch (DDM).
  • DDM manifests as over-density, sparsity, and extrapolation risk, compromising model predictions.
  • Existing reliability assessments often rely on abstract metrics, lacking intuitive scientific insight.

Purpose of the Study:

  • To introduce a data-centric, visual-first methodology for assessing ML model reliability.
  • To operationalize this methodology through an open-source interactive system called D2VP (Data Diagnosis & Visualization Platform).
  • To transform abstract reliability metrics into actionable, evidence-based visual inquiry.

Main Methods:

  • Developed D2VP, an interactive system with dual capabilities for diagnosing distribution anomalies and visualizing high-dimensional chemical spaces.
  • Employed a human-in-the-loop approach for interactive data exploration and reliability assessment.
  • Validated the methodology through case studies addressing over-density, sparsity, and extrapolation risk.

Main Results:

  • Redundancy-reduced sampling decreased dataset size by over 60% with minimal accuracy loss.
  • Exploration-based sampling, with only 10% targeted data, reduced model mean absolute error by over 80% for sparse datasets.
  • The credibility analysis toolkit identified high-risk extrapolation scenarios, including a 1:400 training-to-prediction ratio.

Conclusions:

  • The D2VP methodology effectively transforms abstract reliability metrics into actionable scientific insights.
  • The platform provides a domain-agnostic framework for building trustworthy predictive models in AI for science.
  • Visual inquiry and data-centric diagnosis are crucial for reliable ML applications in scientific discovery.