Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Regression Toward the Mean

Regression Toward the Mean

Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when...

Regression Analysis

Regression Analysis

Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:

Residuals and Least-Squares Property

Residuals and Least-Squares Property

The vertical distance between the actual value of y and the estimated value of y. In other words, it measures the vertical distance between the actual data point and the predicted point on the line
If the observed data point lies above the line, the residual is positive, and the line underestimates the actual data value for y. If the observed data point lies below the line, the residual is negative, and the line overestimates the actual data value for y.
The process of fitting the best-fit...

Multiple Regression

Multiple Regression

Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...

Linear Approximation in Frequency Domain

Linear Approximation in Frequency Domain

Linear systems are characterized by two main properties: superposition and homogeneity. Superposition allows the response to multiple inputs to be the sum of the responses to each individual input. Homogeneity ensures that scaling an input by a scalar results in the response being scaled by the same scalar.
In contrast, nonlinear systems do not inherently possess these properties. However, for small deviations around an operating point, a nonlinear system can often be approximated as linear....

Calibration Curves: Linear Least Squares

Calibration Curves: Linear Least Squares

A calibration curve is a plot of the instrument's response against a series of known concentrations of a substance. This curve is used to set the instrument response levels, using the substance and its concentrations as standards. Alternatively, or additionally, an equation is fitted to the calibration curve plot and subsequently used to calculate the unknown concentrations of other samples reliably.
For data that follow a straight line, the standard method for fitting is the linear...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Substitution Spectrum and Selection at G-quadruplexes in Great Ape Telomere-to-Telomere Genomes.

Genome biology and evolution·2026

Same author

Mammalian mitochondrial DNA accumulates insertions and deletions with age in energetically demanding tissues.

Molecular biology and evolution·2026

Same author

Allele Frequency Selection and No Age-Related Increase in Human Oocyte Mitochondrial Mutations.

Obstetrical & gynecological survey·2026

Same author

Contrasting pre-vaccine COVID-19 waves in Italy through functional data analysis.

Scientific reports·2025

Same author

Comparative analysis of single-stranded and non-canonical DNA formation in human and other ape cells with telomere-to-telomere genomes.

bioRxiv : the preprint server for biology·2025

Same author

Non-canonical DNA in bird telomere-to-telomere genomes.

bioRxiv : the preprint server for biology·2025

Same journal

Probabilistic Joint and Individual Variation Explained (ProJIVE) for Data Integration.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2026

Same journal

fastkqr: A Fast Algorithm for Kernel Quantile Regression.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2026

Same journal

Empirical Bayes Covariance Decomposition, and a Solution to the Multiple Tuning Problem in Sparse PCA.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2026

Same journal

Joint Registration and Conformal Prediction for Partially Observed Functional Data.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2026

Same journal

Efficient Decision Trees for Tensor Regressions.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2026

Same journal

Distributed Nonparametric Regression with Heterogeneity Through Prediction-Based Aggregation.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Aug 20, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

MIP-BOOST: Efficient and Effective L ₀ Feature Selection for Linear Regression.

Ana Kenney¹, Francesca Chiaromonte², Giovanni Felici³

¹Dept. of Statistics, Penn State University. University Park PA, USA.

Journal of Computational and Graphical Statistics : a Joint Publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America

|November 21, 2022

Summary

This summary is machine-generated.

MIP-BOOST enhances Mixed Integer Programming (MIP) for L0 feature selection, improving efficiency and performance in regression problems with complex data. This method addresses challenges like parameter tuning and feature collinearity.

Keywords:

LASSO Mixed Integer Optimization Regression cross-validation feature selection whitening

More Related Videos

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Published on: July 22, 2025

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

Related Experiment Videos

Last Updated: Aug 20, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Published on: July 22, 2025

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

Area of Science:

Mathematical Optimization
Statistical Learning
Computational Statistics

Background:

Mixed Integer Programming (MIP) is an emerging feature selection technique.
Standard MIP methods face challenges in parameter tuning and handling feature collinearity.
Existing regularization methods are popular but may not be optimal for all regression scenarios.

Purpose of the Study:

To propose MIP-BOOST, an improved MIP-based L0 feature selection method.
To reduce the computational burden of parameter tuning in MIP feature selection.
To enhance performance in regression with collinear features and varying signal strengths.

Main Methods:

Revision of standard Mixed Integer Programming feature selection.
Development of strategies to reduce computational burden for the sparsity bound parameter.
Integration of cross-validation tuning and exact optimization of Mixed Integer Programs.
Implementation of three synergistic proposals for improved efficiency and effectiveness.

Main Results:

MIP-BOOST offers a more efficient and effective L0 feature selection method.
Reduced computational burden in tuning the sparsity bound parameter.
Improved performance with feature collinearity and diverse signal characteristics.
Demonstrated computational viability for realistic applications.

Conclusions:

MIP-BOOST provides a robust and efficient alternative for L0 feature selection.
The method is suitable for large-scale and complex regression problems.
Enhanced performance and reduced tuning complexity make MIP-BOOST practical for real-world applications.