Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Residuals and Least-Squares Property

Residuals and Least-Squares Property

The vertical distance between the actual value of y and the estimated value of y. In other words, it measures the vertical distance between the actual data point and the predicted point on the line
If the observed data point lies above the line, the residual is positive, and the line underestimates the actual data value for y. If the observed data point lies below the line, the residual is negative, and the line overestimates the actual data value for y.
The process of fitting the best-fit...

Regression Toward the Mean

Regression Toward the Mean

Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when...

Residual Plots

Residual Plots

A residual plot is a statistical representation of data used to analyze correlation and regression results. It helps verify the requirements for drawing specific conclusions about correlation and regression. To obtain the residual plot, first, the residual for each data value is calculated, which is simply the vertical distance between the observed and the predicted value obtained from the regression equation.
When the residual values are plotted against the variable x, it is called a residual...

Expected Frequencies in Goodness-of-Fit Tests

Expected Frequencies in Goodness-of-Fit Tests

A goodness-of-fit test is conducted to determine whether the observed frequency values are statistically similar to the frequencies expected for the dataset. Suppose the expected frequencies for a dataset are equal such as when predicting the frequency of any number appearing when casting a die. In that case, the expected frequency is the ratio of the total number of observations (n) to the number of categories (k).

Linearization and Approximation

Linearization and Approximation

Linearization is a mathematical technique used to approximate complex, nonlinear functions with simpler linear models in the vicinity of a chosen reference point. The method is based on the idea that, although a function may be difficult to evaluate exactly, its behavior near a specific input value can often be closely approximated by the tangent line at that point. This approach is particularly useful when small deviations from a known value are involved.Consider the square root function, for...

Quadratic Models

Quadratic Models

Quadratic models are mathematical representations used to describe relationships in which the rate of change changes at a constant rate. These models appear in a wide variety of natural and engineered systems, especially those involving motion, forces, and optimization. One common application is analyzing the vertical motion of objects influenced by gravity, such as a ball thrown into the air.In such scenarios, the object's height changes over time in a curved pattern, rising to a maximum point...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Finding distributions that differ, with false discovery rate control.

Biometrika·2026

Same author

EFFICIENT AND MULTIPLY ROBUST RISK ESTIMATION UNDER GENERAL FORMS OF DATASET SHIFT.

Annals of statistics·2026

Same author

Anomalous Saturation of CO Adsorption at 26% on Cu(111) Governed by Nanometer-Scale Substrate-Mediated Interactions.

Journal of the American Chemical Society·2025

Same author

Sharp-SSL: Selective High-Dimensional Axis-Aligned Random Projections for Semi-Supervised Learning.

Journal of the American Statistical Association·2025

Same author

Opportunities and challenges of diffusion models for generative AI.

National science review·2024

Same author

Are Latent Factor Regression and Sparse Regression Adequate?

Journal of the American Statistical Association·2024

Same journal

A Class of Structured High-Dimensional Dynamic Covariance Matrices.

Communications in mathematics and statistics·2025

Same journal

Nested Group Testing Procedure.

Communications in mathematics and statistics·2022

Same journal

Multicategory Classification via Forward-Backward Support Vector Machine.

Communications in mathematics and statistics·2021

Same journal

Effects of Nonmonotonic Functional Responses on a Disease Transmission Model: Modeling and Simulation.

Communications in mathematics and statistics·2021

Same journal

Sharp Convergence of Nonlinear Functionals of a Class of Gaussian Random Fields.

Communications in mathematics and statistics·2019

See all related articles

Search research articles

Related Experiment Video

Updated: Mar 19, 2026

O-cresol Concentration Online Measurement Based On Near Infrared Spectroscopy Via Partial Least Square Regression

O-cresol Concentration Online Measurement Based On Near Infrared Spectroscopy Via Partial Least Square Regression

Published on: November 8, 2019

Regularity Properties for Sparse Regression.

Edgar Dobriban¹, Jianqing Fan²

¹Department of Statistics, Stanford University, dobriban@stanford.edu.

Communications in Mathematics and Statistics

|June 23, 2016

Summary

This summary is machine-generated.

Checking conditions for high-dimensional sparse regression is NP-hard. However, the [Formula: see text] sensitivity condition is more robust and holds in many practical scenarios, offering guidance for data analysis.

Keywords:

computational complexity high-dimensional statistics restricted eigenvalue sparse regression ℓq sensitivity

More Related Videos

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

Related Experiment Videos

Last Updated: Mar 19, 2026

O-cresol Concentration Online Measurement Based On Near Infrared Spectroscopy Via Partial Least Square Regression

O-cresol Concentration Online Measurement Based On Near Infrared Spectroscopy Via Partial Least Square Regression

Published on: November 8, 2019

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

Area of Science:

Statistics
Machine Learning
Computational Complexity

Background:

High-dimensional sparse regression relies on theoretical conditions like restricted eigenvalue and compatibility for estimator performance.
The practical verifiability of these core theoretical conditions has remained an open and critical question.

Purpose of the Study:

To rigorously investigate the computational complexity of checking central conditions in high-dimensional sparse regression theory.
To explore alternative conditions that may be more computationally tractable and broadly applicable.

Main Methods:

Proving the NP-hardness of checking restricted eigenvalue, compatibility, and [Formula: see text] sensitivity conditions.
Analyzing the [Formula: see text] sensitivity condition from an average-case complexity perspective.
Demonstrating probabilistic guarantees for the [Formula: see text] sensitivity condition under specific model assumptions.

Main Results:

Established that checking key conditions for Lasso and Dantzig selector performance is NP-hard.
Showed that the [Formula: see text] sensitivity condition is computationally weaker and more general.
Proved that [Formula: see text] sensitivity holds with high probability in well-behaved populations and is robust to data processing.

Conclusions:

The computational intractability of verifying core sparse regression conditions raises concerns about their direct application.
The [Formula: see text] sensitivity condition offers a more practical and robust alternative, providing valuable insights for analyzing high-dimensional correlated data.