Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Correlation and Regression00:53

Correlation and Regression

In statistics, correlation describes the degree of association between two variables. In the subfield of linear regression, correlation is mathematically expressed by the correlation coefficient, which describes the strength and direction of the relationship between two variables. The coefficient is symbolically represented by 'r' and ranges from -1 to +1. A positive value indicates a positive correlation where the two variables move in the same direction. A negative value suggests a negative...
Calculating and Interpreting the Linear Correlation Coefficient01:11

Calculating and Interpreting the Linear Correlation Coefficient

The correlation coefficient, r, developed by Karl Pearson in the early 1900s, is numerical and provides a measure of strength and direction of the linear association between the independent variable, x, and the dependent variable, y. Hence, it is also known as the Pearson product-moment correlation coefficient. It can be calculated using the following equation:
Coefficient of Correlation01:12

Coefficient of Correlation

The correlation coefficient, r, developed by Karl Pearson in the early 1900s, is numerical and provides a measure of strength and direction of the linear association between the independent variable x and the dependent variable y.
If you suspect a linear relationship between x and y, then r can measure how strong the linear relationship is.
What the VALUE of r tells us:
The value of r is always between –1 and +1: –1 ≤ r ≤ 1.
The size of the correlation r indicates the strength of the linear...
Correlation of Experimental Data01:23

Correlation of Experimental Data

Dimensional analysis simplifies complex physical problems and guides experimental investigations, but it does not provide complete solutions. It identifies the dimensionless groups that influence a phenomenon, but experimental data is needed to establish the specific relationships and validate theoretical predictions.
For example, a spherical particle moving through a viscous fluid experiences drag. Dimensional analysis shows that the drag force depends on the particle's diameter, velocity, and...
Multiple Regression01:25

Multiple Regression

Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
Calibration Curves: Correlation Coefficient01:10

Calibration Curves: Correlation Coefficient

In a linear calibration curve, there is a value called the calibration coefficient, denoted by 'r,' which measures the strength and the direction of association between two variables. The correlation coefficient value ranges from −1 to +1. A value of +1 indicates a perfect positive linear correlation, −1 denotes a perfect negative correlation, and 0 implies no correlation between the two variables. A positive correlation value establishes that as one variable increases, the other increases, and...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Classification of Apples (<i>Malus × domestica</i> borkh.) According to Geographical Origin, Variety and Production Method Using Liquid Chromatography Mass Spectrometry and Random Forest.

Foods (Basel, Switzerland)·2025
Same author

Competing technologies: determining the geographical origin of strawberries (<i>Fragaria</i> × <i>ananassa</i>) using laboratory based near-infrared spectroscopy compared to a simple portable device.

Molecular omics·2024
Same author

Detection of Sugar Syrups in Honey Using Untargeted Liquid Chromatography-Mass Spectrometry and Chemometrics.

Metabolites·2024
Same author

SERS microscopy as a tool for comprehensive biochemical characterization in complex samples.

Chemical Society reviews·2024
Same author

Food Monitoring: Limitations of Accelerated Storage to Predict Molecular Changes in Hazelnuts (<i>Corylus avellana</i> L.) under Realistic Conditions Using UPLC-ESI-IM-QTOF-MS.

Metabolites·2023
Same author

Opening the Random Forest Black Box of <sup>1</sup>H NMR Metabolomics Data by the Exploitation of Surrogate Variables.

Metabolites·2023
Same journal

Single-cell mass spectrometry imaging: platform advances for multimodal spatial omics.

Analytical and bioanalytical chemistry·2026
Same journal

Advancing total uronic acid quantification using a stable isotope dilution approach: validation and application to plant- and algal-derived polysaccharides.

Analytical and bioanalytical chemistry·2026
Same journal

Electroanalytical method development for the receptor tyrosine kinase inhibitor lenvatinib using a Ti<sub>3</sub>C<sub>2</sub>T<sub>x</sub>-MXene based molecularly imprinted polymer modified carbon electrode.

Analytical and bioanalytical chemistry·2026
Same journal

Impact of blood contamination on hydrophilic metabolomics in human meningioma tissue.

Analytical and bioanalytical chemistry·2026
Same journal

A UiO-66@MR paper-based colorimetric sensor for sensitive detection of food spoilage volatile organic compounds and visual freshness monitoring.

Analytical and bioanalytical chemistry·2026
Same journal

An electrochemical sensing platform based on UiO@TATF COF/CB composites for the detection of nitrofurazone.

Analytical and bioanalytical chemistry·2026
See all related articles

Related Experiment Videos

Understanding complex analytical data by a supervised correlation coefficient obtained from random forest.

Stephan Seifert1

  • 1Hamburg School of Food Science, Department of Chemistry, Universität Hamburg, Grindelallee 117, 20146, Hamburg, Germany. stephan.seifert@uni-hamburg.de.

Analytical and Bioanalytical Chemistry
|June 23, 2026
PubMed
Summary
This summary is machine-generated.

Surrogate Minimal Depth (SMD) enhances chemometric analysis by revealing variable relationships for better interpretation of complex analytical data. This method aids in understanding how variables impact predictive models, improving biological sample classification and component quantification.

Keywords:
BeveragesChemometricsFoodsICP-MSIR spectroscopyMass spectrometryRaman spectroscopySoftwareStatisticsVariable selection

Related Experiment Videos

Area of Science:

  • Chemometrics
  • Analytical Chemistry
  • Bioinformatics

Background:

  • Multivariate chemometric methods are crucial for analyzing complex analytical data from techniques like mass spectrometry and NMR.
  • Interpreting variables contributing to predictions is as important as accurate sample classification or quantification.
  • Existing variable selection algorithms, such as partial least squares regression and random forest, have limitations in fully capturing variable interdependencies.

Purpose of the Study:

  • To illustrate the application of Surrogate Minimal Depth (SMD) for variable selection and relationship analysis in complex analytical data.
  • To demonstrate how SMD incorporates variable interdependencies into variable importance assessment.
  • To highlight the utility of SMD in interpreting predictive models derived from diverse analytical data types.

Main Methods:

  • Utilizing surrogate variables derived from random forest models within the SMD framework.
  • Quantifying the mutual impact of variables on predictive models using a supervised correlation coefficient.
  • Applying SMD to various types of complex analytical data, including mass spectrometry, NMR, and vibrational spectroscopy.

Main Results:

  • SMD effectively incorporates variable relationships into the analysis of variable importance.
  • The method provides a supervised correlation coefficient to express the mutual impact of variables.
  • Demonstrated successful application of SMD across different analytical data types, showcasing its versatility.

Conclusions:

  • SMD offers a powerful approach for interpreting complex analytical data by elucidating variable relationships.
  • The method enhances the understanding of how individual variables and their interactions influence predictive models.
  • SMD holds significant potential for advancing chemometric analysis in various scientific domains.