Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Regression Toward the Mean01:52

Regression Toward the Mean

6.3K
Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when...
6.3K
Regression Analysis01:11

Regression Analysis

6.0K
Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:
6.0K
Residuals and Least-Squares Property01:11

Residuals and Least-Squares Property

7.8K
The vertical distance between the actual value of y and the estimated value of y. In other words, it measures the vertical distance between the actual data point and the predicted point on the line
If the observed data point lies above the line, the residual is positive, and the line underestimates the actual data value for y. If the observed data point lies below the line, the residual is negative, and the line overestimates the actual data value for y.
The process of fitting the best-fit...
7.8K
Multiple Regression01:25

Multiple Regression

3.1K
Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
3.1K
Linear Approximation in Frequency Domain01:26

Linear Approximation in Frequency Domain

123
Linear systems are characterized by two main properties: superposition and homogeneity. Superposition allows the response to multiple inputs to be the sum of the responses to each individual input. Homogeneity ensures that scaling an input by a scalar results in the response being scaled by the same scalar.
In contrast, nonlinear systems do not inherently possess these properties. However, for small deviations around an operating point, a nonlinear system can often be approximated as linear....
123
Calibration Curves: Linear Least Squares01:20

Calibration Curves: Linear Least Squares

1.6K
A calibration curve is a plot of the instrument's response against a series of known concentrations of a substance. This curve is used to set the instrument response levels, using the substance and its concentrations as standards. Alternatively, or additionally, an equation is fitted to the calibration curve plot and subsequently used to calculate the unknown concentrations of other samples reliably.
For data that follow a straight line, the standard method for fitting is the linear...
1.6K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Substitution Spectrum and Selection at G-quadruplexes in Great Ape Telomere-to-Telomere Genomes.

Genome biology and evolution·2026
Same author

Mammalian mitochondrial DNA accumulates insertions and deletions with age in energetically demanding tissues.

Molecular biology and evolution·2026
Same author

Allele Frequency Selection and No Age-Related Increase in Human Oocyte Mitochondrial Mutations.

Obstetrical & gynecological survey·2026
Same author

Contrasting pre-vaccine COVID-19 waves in Italy through functional data analysis.

Scientific reports·2025
Same author

Comparative analysis of single-stranded and non-canonical DNA formation in human and other ape cells with telomere-to-telomere genomes.

bioRxiv : the preprint server for biology·2025
Same author

Non-canonical DNA in bird telomere-to-telomere genomes.

bioRxiv : the preprint server for biology·2025
Same journal

Probabilistic Joint and Individual Variation Explained (ProJIVE) for Data Integration.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2026
Same journal

fastkqr: A Fast Algorithm for Kernel Quantile Regression.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2026
Same journal

Empirical Bayes Covariance Decomposition, and a Solution to the Multiple Tuning Problem in Sparse PCA.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2026
Same journal

Joint Registration and Conformal Prediction for Partially Observed Functional Data.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2026
Same journal

Efficient Decision Trees for Tensor Regressions.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2026
Same journal

Distributed Nonparametric Regression with Heterogeneity Through Prediction-Based Aggregation.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2026
See all related articles

Related Experiment Video

Updated: Aug 20, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.6K

MIP-BOOST: Efficient and Effective L 0 Feature Selection for Linear Regression.

Ana Kenney1, Francesca Chiaromonte2, Giovanni Felici3

  • 1Dept. of Statistics, Penn State University. University Park PA, USA.

Journal of Computational and Graphical Statistics : a Joint Publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America
|November 21, 2022
PubMed
Summary
This summary is machine-generated.

MIP-BOOST enhances Mixed Integer Programming (MIP) for L0 feature selection, improving efficiency and performance in regression problems with complex data. This method addresses challenges like parameter tuning and feature collinearity.

Keywords:
LASSOMixed Integer OptimizationRegressioncross-validationfeature selectionwhitening

More Related Videos

Constructing and Visualizing Models using Mime-based Machine-learning Framework
06:19

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Published on: July 22, 2025

235
A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
12:18

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

7.6K

Related Experiment Videos

Last Updated: Aug 20, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.6K
Constructing and Visualizing Models using Mime-based Machine-learning Framework
06:19

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Published on: July 22, 2025

235
A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
12:18

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

7.6K

Area of Science:

  • Mathematical Optimization
  • Statistical Learning
  • Computational Statistics

Background:

  • Mixed Integer Programming (MIP) is an emerging feature selection technique.
  • Standard MIP methods face challenges in parameter tuning and handling feature collinearity.
  • Existing regularization methods are popular but may not be optimal for all regression scenarios.

Purpose of the Study:

  • To propose MIP-BOOST, an improved MIP-based L0 feature selection method.
  • To reduce the computational burden of parameter tuning in MIP feature selection.
  • To enhance performance in regression with collinear features and varying signal strengths.

Main Methods:

  • Revision of standard Mixed Integer Programming feature selection.
  • Development of strategies to reduce computational burden for the sparsity bound parameter.
  • Integration of cross-validation tuning and exact optimization of Mixed Integer Programs.
  • Implementation of three synergistic proposals for improved efficiency and effectiveness.

Main Results:

  • MIP-BOOST offers a more efficient and effective L0 feature selection method.
  • Reduced computational burden in tuning the sparsity bound parameter.
  • Improved performance with feature collinearity and diverse signal characteristics.
  • Demonstrated computational viability for realistic applications.

Conclusions:

  • MIP-BOOST provides a robust and efficient alternative for L0 feature selection.
  • The method is suitable for large-scale and complex regression problems.
  • Enhanced performance and reduced tuning complexity make MIP-BOOST practical for real-world applications.