Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Regression Analysis01:11

Regression Analysis

6.0K
Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:
6.0K
Multiple Regression01:25

Multiple Regression

3.1K
Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
3.1K
Regression Toward the Mean01:52

Regression Toward the Mean

6.5K
Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when...
6.5K
Correlation and Regression00:53

Correlation and Regression

1.7K
In statistics, correlation describes the degree of association between two variables. In the subfield of linear regression, correlation is mathematically expressed by the correlation coefficient, which describes the strength and direction of the relationship between two variables. The coefficient is symbolically represented by 'r' and ranges from -1 to +1. A positive value indicates a positive correlation where the two variables move in the same direction. A negative value suggests a...
1.7K
Woodward–Hoffmann Selection Rules and Microscopic Reversibility01:34

Woodward–Hoffmann Selection Rules and Microscopic Reversibility

3.3K
Electrocyclic reactions, cycloadditions, and sigmatropic rearrangements are concerted pericyclic reactions that proceed via a cyclic transition state. These reactions are stereospecific and regioselective. The stereochemistry of the products depends on the symmetry characteristics of the interacting orbitals and the reaction conditions. Accordingly, pericyclic reactions are classified as either symmetry-allowed or symmetry-forbidden. Woodward and Hoffmann presented the selection criteria for...
3.3K
Predicting Products: Substitution vs. Elimination02:52

Predicting Products: Substitution vs. Elimination

12.1K
When a nucleophile and an alkyl halide react, nucleophilic substitution and β-elimination reactions compete to generate products.
The following factors can influence the mechanisms competing against each other:
12.1K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Electron-Counting Controls Bifunctional Activity in Single-Atom Catalysts through a Three-Regime Adsorption Mechanism.

Journal of the American Chemical Society·2026
Same author

Geostructure-Induced Nonmonotonic Transition between Cassie and Wenzel States on Hydrophilic Substrates.

Langmuir : the ACS journal of surfaces and colloids·2026
Same author

Atomic-level insights into the high intrinsic thermostability of individual anatase TiO<sub>2</sub> nanocrystals through surface-locking effects.

Nature communications·2026
Same author

Unraveling the Temperature Effects on the Raman Spectra of V<sub>2</sub>O<sub>5</sub> by Combining Machine Learning and Molecular Dynamics.

The journal of physical chemistry letters·2026
Same author

Surface Coordination-Promoted Electrosynthesis of 1,4-Butanediol from Ethylene and Water via 2-Bromoethanol.

Journal of the American Chemical Society·2026
Same author

Upwind electromigration of sub-10-nm metallic nano-interconnects.

Nature communications·2026
Same journal

Improving PCM in Protic Media: Markov State Models for TD-DFT Calculations.

Journal of chemical theory and computation·2026
Same journal

Efficient Coupled-Cluster Python Frameworks for Next-Generation GPUs: A Comparative Study of CuPy and PyTorch on the Hopper and Grace Hopper Architecture.

Journal of chemical theory and computation·2026
Same journal

Extending the MARTINI 3 Coarse-Grained Force Field to Polypeptoids.

Journal of chemical theory and computation·2026
Same journal

Statistical Mechanics of Density- and Temperature-Dependent Potentials: Application to Condensed Phases within GenDPDE.

Journal of chemical theory and computation·2026
Same journal

BFEE-Docking: A User-Friendly and Customizable End-to-End Tool from High-Throughput Virtual Screening to Binding Free-Energy Calculations.

Journal of chemical theory and computation·2026
Same journal

On-the-Fly Trajectory Simulation of Two-Pulse, Three-Pulse, and Higher-Order Pump-Probe Signals.

Journal of chemical theory and computation·2026
See all related articles

Related Experiment Video

Updated: Sep 4, 2025

O-cresol Concentration Online Measurement Based On Near Infrared Spectroscopy Via Partial Least Square Regression
06:50

O-cresol Concentration Online Measurement Based On Near Infrared Spectroscopy Via Partial Least Square Regression

Published on: November 8, 2019

6.7K

Improving Symbolic Regression for Predicting Materials Properties with Iterative Variable Selection.

Zhen Guo1, Shunbo Hu1,2, Zhong-Kang Han3

  • 1Materials Genome Institute, Shanghai University, Shanghai 200444, China.

Journal of Chemical Theory and Computation
|July 14, 2022
PubMed
Summary
This summary is machine-generated.

This study introduces VS-SISSO, a new algorithm combining symbolic regression and variable selection to discover explicit mathematical relationships in materials science. It efficiently handles complex datasets with numerous features, improving model accuracy.

More Related Videos

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.6K
Experimental Multiscale Methodology for Predicting Material Fouling Resistance
09:13

Experimental Multiscale Methodology for Predicting Material Fouling Resistance

1.5K

Related Experiment Videos

Last Updated: Sep 4, 2025

O-cresol Concentration Online Measurement Based On Near Infrared Spectroscopy Via Partial Least Square Regression
06:50

O-cresol Concentration Online Measurement Based On Near Infrared Spectroscopy Via Partial Least Square Regression

Published on: November 8, 2019

6.7K
Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.6K
Experimental Multiscale Methodology for Predicting Material Fouling Resistance
09:13

Experimental Multiscale Methodology for Predicting Material Fouling Resistance

1.5K

Area of Science:

  • Materials Science
  • Computational Chemistry
  • Data Science

Background:

  • Symbolic regression (SR) is valuable for deriving explicit structure-property relationships in materials.
  • High-dimensional data and unclear key variables pose significant challenges for traditional SR methods.
  • Existing SR techniques struggle with computational complexity when dealing with a large number of input features.

Purpose of the Study:

  • To develop an automated approach for identifying important variables in complex materials science problems.
  • To enhance the efficiency and accuracy of symbolic regression for materials discovery.
  • To overcome the computational limitations of SR methods when faced with high-dimensional feature spaces.

Main Methods:

  • Integration of symbolic regression (using SISSO) with iterative variable selection (VS).
  • Development of a novel algorithm: VS-assisted SISSO (VS-SISSO).
  • Utilized random search for the variable selection component within the algorithm.

Main Results:

  • VS-SISSO effectively manages hundreds of input features, overcoming limitations of standalone SISSO.
  • The algorithm demonstrates fast convergence towards optimal or near-optimal solutions for models with moderate complexity.
  • Successfully applied to learn approximate equations for perovskite band gaps and single-atom alloy catalyst stability.

Conclusions:

  • The proposed VS-SISSO approach significantly improves the accuracy and applicability of symbolic regression in materials science.
  • Automated variable selection is crucial for enabling SR in complex, high-dimensional materials data.
  • This method accelerates the discovery of explicit mathematical models for material properties.