Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Distillation: Vapor–Liquid Equilibria01:01

Distillation: Vapor–Liquid Equilibria

5.6K
Distillation is a separation technique that takes advantage of the boiling point properties of disparate elements in a mixture. To perform distillation, we begin by heating a miscible mixture of two liquids with a significant difference in boiling points (at least 20°C). As the solution heats up and reaches the bubble point of the more volatile component, some molecules of the more volatile component transition into the gas phase and travel upward into the condenser, which is a glass tube...
5.6K
Detection of Gross Error: The Q Test01:00

Detection of Gross Error: The Q Test

7.4K
When one or more data points appear far from the rest of the data, there is a need to determine whether they are outliers and whether they should be eliminated from the data set to ensure an accurate representation of the measured value. In many cases, outliers arise from gross errors (or human errors) and do not accurately reflect the underlying phenomenon. In some cases, however, these apparent outliers reflect true phenomenological differences. In these cases, we can use statistical methods...
7.4K
Data Validation01:15

Data Validation

3.6K
Method validation is a crucial process in analytical chemistry designed to confirm that a given method consistently produces reliable and high-quality results. This process is essential when a method is applied to different sample matrices or when procedural modifications are made, ensuring that the results meet acceptable standards across various applications.
Key parameters for method validation include:
3.6K
Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

4.6K
Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...
4.6K
Modified Boxplots00:57

Modified Boxplots

11.8K
A standard box and whisker plot informs us about the spread of the data in a given sample. One can identify the minimum value, maximum value, first quartile value, second quartile or median value, and third quartile.
However, the box plot does not tell the reader about outliers - values that lie far from the center of the data. We can modify the standard box and whisker plot to identify the outliers and visualize the actual spread of the data in a sample.
Initially, we calculate the adjusted...
11.8K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Prediction of diffusion coefficients in mixtures with tensor completion.

Physical chemistry chemical physics : PCCP·2026
Same author

Enabling Quantitative Benchtop <sup>13</sup>C NMR Spectroscopy in Fast Continuous Flow.

Magnetic resonance in chemistry : MRC·2026
Same author

Thermodynamically consistent machine learning model for excess Gibbs energy.

Nature communications·2026
Same author

CHAOS - A Large-scale Database for σ-Profiles and Other Molecular Descriptors.

Journal of chemical information and modeling·2026
Same author

Improvement of Diffusion Coefficient Prediction by Active Learning.

The journal of physical chemistry. B·2025
Same author

Quantitative Nuclear Magnetic Resonance Spectroscopy with Overhauser Dynamic Nuclear Polarization.

Chemphyschem : a European journal of chemical physics and physical chemistry·2025
Same journal

An On-Orbit Star Tracker Benchmark for Resident Space Object Detection and Attitude Estimation.

Scientific data·2026
Same journal

Chromosome-level genome assembly of an endangered bird Java Sparrow (Padda oryzivora).

Scientific data·2026
Same journal

A High-Fidelity Multi-Model Benchmark Dataset for General Aviation Anomaly Detection Generated via Physics-Based Fault Injection.

Scientific data·2026
Same journal

DriE-Cog dataset: A multimodal physiological-behavioural dataset for intelligent driving emergency response.

Scientific data·2026
Same journal

High-resolution 3D scan of the Devil's trails palaeontological ichnosite (Southern Italy) for the digital legacy.

Scientific data·2026
Same journal

Chromosome-level genome assembly of oil-tea tree Camellia brevistyla.

Scientific data·2026
See all related articles
  1. Home
  2. Batch Distillation Data For Developing Machine Learning Anomaly Detection Methods.
  1. Home
  2. Batch Distillation Data For Developing Machine Learning Anomaly Detection Methods.

Related Experiment Video

Single Droplet Digital Polymerase Chain Reaction for Comprehensive and Simultaneous Detection of Mutations in Hotspot Regions
08:23

Single Droplet Digital Polymerase Chain Reaction for Comprehensive and Simultaneous Detection of Mutations in Hotspot Regions

Published on: September 25, 2018

14.2K

Batch Distillation Data for Developing Machine Learning Anomaly Detection Methods.

Justus Arweiler1, Indra Jungjohann1, Aparna Muraleedharan2

  • 1Laboratory of Engineering Thermodynamics, RPTU Kaiserslautern, Erwin-Schrödinger-Straße 44, 67663, Kaiserslautern, Germany.

Scientific Data
|March 31, 2026

View abstract on PubMed

Summary
This summary is machine-generated.

Researchers created a new dataset for machine learning anomaly detection in chemical processes. This freely available data includes diverse sensor readings and expert annotations to train advanced methods.

More Related Videos

Chromatographic Fingerprinting by Template Matching for Data Collected by Comprehensive Two-Dimensional Gas Chromatography
10:14

Chromatographic Fingerprinting by Template Matching for Data Collected by Comprehensive Two-Dimensional Gas Chromatography

Published on: September 2, 2020

5.6K
Quantitative Detection of Trace Explosive Vapors by Programmed Temperature Desorption Gas Chromatography-Electron Capture Detector
07:57

Quantitative Detection of Trace Explosive Vapors by Programmed Temperature Desorption Gas Chromatography-Electron Capture Detector

Published on: July 25, 2014

20.5K

Related Experiment Videos

Single Droplet Digital Polymerase Chain Reaction for Comprehensive and Simultaneous Detection of Mutations in Hotspot Regions
08:23

Single Droplet Digital Polymerase Chain Reaction for Comprehensive and Simultaneous Detection of Mutations in Hotspot Regions

Published on: September 25, 2018

14.2K
Chromatographic Fingerprinting by Template Matching for Data Collected by Comprehensive Two-Dimensional Gas Chromatography
10:14

Chromatographic Fingerprinting by Template Matching for Data Collected by Comprehensive Two-Dimensional Gas Chromatography

Published on: September 2, 2020

5.6K
Quantitative Detection of Trace Explosive Vapors by Programmed Temperature Desorption Gas Chromatography-Electron Capture Detector
07:57

Quantitative Detection of Trace Explosive Vapors by Programmed Temperature Desorption Gas Chromatography-Electron Capture Detector

Published on: July 25, 2014

20.5K

Area of Science:

  • Chemical Engineering
  • Data Science
  • Process Control

Background:

  • Machine learning (ML) shows promise for anomaly detection (AD) in chemical processes.
  • Development of ML-based AD methods is limited by the scarcity of public experimental data.

Purpose of the Study:

  • To address the data gap for ML-based AD in chemical processes.
  • To generate a comprehensive experimental dataset for training and validating ML models.
  • To enable the development of interpretable and explainable AD methods.

Main Methods:

  • Established a laboratory-scale batch distillation plant for data generation.
  • Conducted 119 experiments with varying operating conditions and mixtures, including induced anomalies.
  • Collected time-series sensor/actuator data, measurement uncertainty, NMR spectroscopy, video, and audio recordings.

Main Results:

  • Generated an extensive, structured dataset with fault-free and anomalous experiments.
  • Included detailed metadata, expert annotations, and an anomaly ontology.
  • Dataset is publicly available, facilitating ML-based AD research.

Conclusions:

  • The new dataset supports the advancement of ML-based AD in chemical processes.
  • Enables development of interpretable, explainable, and mitigable AD solutions.
  • Promotes further research in data-driven process monitoring and control.