Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Improving Translational Accuracy02:07

Improving Translational Accuracy

15.4K
Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
15.4K
Improving Translational Accuracy02:07

Improving Translational Accuracy

3.7K
3.7K
Multiple Regression01:25

Multiple Regression

4.2K
Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
4.2K
Regression Analysis01:11

Regression Analysis

8.8K
Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:
8.8K
Multi-input and Multi-variable systems01:22

Multi-input and Multi-variable systems

455
Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence of...
455
Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

4.4K
Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...
4.4K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

The Poisson CUSUM Chart for Monitoring Small Counts: Addressing the Estimation Uncertainty.

Biometrical journal. Biometrische Zeitschrift·2026
Same author

A pipeline for meso- and microplastic identification in aquatic systems using machine learning and hyperspectral imaging.

Environmental pollution (Barking, Essex : 1987)·2026
Same author

Can atrial fibrillation ablation outcomes be properly predicted with electrocardiography and artificial intelligence?

European heart journal. Digital health·2026
Same author

Continuous non-invasive electrophysiological monitoring in high-risk pregnancies: study protocol of a cohort intervention random sampling study in a tertiary obstetrical care centre in the Netherlands (NIEM-O study).

BMJ open·2025
Same author

Optimizing clinical scientific research: the cohort intervention random sampling study with historical controls.

Journal of comparative effectiveness research·2025
Same author

An end-to-end data analysis framework for real-time detection and source identification of pollution events via e-nose networks.

Analytical and bioanalytical chemistry·2025
Same journal

Smartphone-assisted fluorescence and colorimetric dual-mode sensor for visual quantitative detection of nitrite and nitrate in real samples.

Analytica chimica acta·2026
Same journal

Folding integrated all-paper photoelectrochemical immunoassay using annealed ZnO for point-of-care detection of ferritin.

Analytica chimica acta·2026
Same journal

Dual-mode electrochemical-SERS detection of chloramphenicol based on dual-signal enhancement.

Analytica chimica acta·2026
Same journal

Multi-screening of beta-lactam antibiotics in milk based on Fe<sub>3</sub>O<sub>4</sub>@phage/bacteria system and aggregation induced emission luminogen.

Analytica chimica acta·2026
Same journal

A porous phosphate-rich β-cyclodextrin polymer for efficient and broad-spectrum enrichment of antibiotics.

Analytica chimica acta·2026
Same journal

Corrigendum to "LUMIN: A novel algorithm for automated mixture quantification using 1D <sup>1</sup>H NMR spectra" [Analytica Chimica Acta 1411 (2026) 345639].

Analytica chimica acta·2026
See all related articles

Related Experiment Videos

Boosting model performance and interpretation by entangling preprocessing selection and variable selection.

Jan Gerretzen1, Ewa Szymańska1, Jacob Bart2

  • 1Radboud University, Institute for Molecules and Materials, Heyendaalseweg 135, 6525 AJ Nijmegen, The Netherlands; TI-COAST, P.O. Box 18, 6160 MD Geleen, The Netherlands.

Analytica Chimica Acta
|September 14, 2016
PubMed
Summary
This summary is machine-generated.

Selecting the right data preprocessing and variable selection methods is crucial for accurate chemometric modeling. This study integrates both, improving model interpretation and predictive performance by considering prior knowledge of true variables.

Keywords:
ChemometricsDesign of experimentsPartial least squaresPreprocessing selectionVariable selection

Related Experiment Videos

Area of Science:

  • Chemometrics
  • Data Science
  • Analytical Chemistry

Background:

  • Data preprocessing is essential for removing artifacts and enhancing information in analytical data.
  • Selecting optimal preprocessing strategies is challenging due to numerous available methods.
  • Existing methods often focus solely on predictive performance, neglecting model interpretability.

Purpose of the Study:

  • To integrate variable selection into a Design of Experiments (DoE)-based preprocessing selection approach.
  • To enhance the interpretability and predictive performance of chemometric models.
  • To address the impact of data artifacts on variable selection and model interpretation.

Main Methods:

  • Utilized a Design of Experiments (DoE) framework for selecting preprocessing strategies.
  • Integrated variable selection methods within the DoE-based preprocessing selection process.
  • Employed Partial Least Squares (PLS) as the modeling technique and PPRV-FCAM for variable selection.

Main Results:

  • The integrated approach significantly improved both model interpretability and predictive performance.
  • Variable selection results showed higher compliance with true informative variables compared to individual optimization.
  • Demonstrated that combining preprocessing and variable selection yields superior outcomes.

Conclusions:

  • The presented generic approach enhances objective interpretation and predictive accuracy in chemometric modeling.
  • This integrated strategy offers a robust solution for selecting optimal preprocessing and variable selection methods.
  • The methodology is adaptable to various models, variable selection techniques, and preprocessing methods.