Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Residuals and Least-Squares Property

Residuals and Least-Squares Property

The vertical distance between the actual value of y and the estimated value of y. In other words, it measures the vertical distance between the actual data point and the predicted point on the line
If the observed data point lies above the line, the residual is positive, and the line underestimates the actual data value for y. If the observed data point lies below the line, the residual is negative, and the line overestimates the actual data value for y.
The process of fitting the best-fit...

Quantifying and Rejecting Outliers: The Grubbs Test

Quantifying and Rejecting Outliers: The Grubbs Test

Sometimes, a data set can have a recorded numerical observation that greatly deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier. To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...

Regression Toward the Mean

Regression Toward the Mean

Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when...

Calibration Curves: Linear Least Squares

Calibration Curves: Linear Least Squares

A calibration curve is a plot of the instrument's response against a series of known concentrations of a substance. This curve is used to set the instrument response levels, using the substance and its concentrations as standards. Alternatively, or additionally, an equation is fitted to the calibration curve plot and subsequently used to calculate the unknown concentrations of other samples reliably.
For data that follow a straight line, the standard method for fitting is the linear...

Linear Approximation in Frequency Domain

Linear Approximation in Frequency Domain

Linear systems are characterized by two main properties: superposition and homogeneity. Superposition allows the response to multiple inputs to be the sum of the responses to each individual input. Homogeneity ensures that scaling an input by a scalar results in the response being scaled by the same scalar.
In contrast, nonlinear systems do not inherently possess these properties. However, for small deviations around an operating point, a nonlinear system can often be approximated as linear....

Linear Approximation in Time Domain

Linear Approximation in Time Domain

Nonlinear systems often require sophisticated approaches for accurate modeling and analysis, with state-space representation being particularly effective. This method is especially useful for systems where variables and parameters vary with time or operating conditions, such as in a simple pendulum or a translational mechanical system with nonlinear springs.
For a simple pendulum with a mass evenly distributed along its length and the center of mass located at half the pendulum's length,...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Anomalous Saturation of CO Adsorption at 26% on Cu(111) Governed by Nanometer-Scale Substrate-Mediated Interactions.

Journal of the American Chemical Society·2025

Same author

Opportunities and challenges of diffusion models for generative AI.

National science review·2024

Same author

Are Latent Factor Regression and Sparse Regression Adequate?

Journal of the American Statistical Association·2024

Same author

Understanding Implicit Regularization in Over-Parameterized Single Index Model.

Journal of the American Statistical Association·2024

Same author

Communication-Efficient Accurate Statistical Estimation.

Journal of the American Statistical Association·2023

Same author

Convex and Nonconvex Optimization Are Both Minimax-Optimal for Noisy Blind Deconvolution under Random Designs.

Journal of the American Statistical Association·2023

Same journal

Towards a Unified Theory for Semiparametric Data Fusion with Individual-Level Data.

Annals of statistics·2026

Same journal

One-Step Estimation of Differentiable Hilbert-Valued Parameters.

Annals of statistics·2026

Same journal

GENERALIZATION ERROR BOUNDS OF DYNAMIC TREATMENT REGIMES IN PENALIZED REGRESSION-BASED LEARNING.

Annals of statistics·2026

Same journal

EFFICIENT AND MULTIPLY ROBUST RISK ESTIMATION UNDER GENERAL FORMS OF DATASET SHIFT.

Annals of statistics·2026

Same journal

TESTING HIGH-DIMENSIONAL REGRESSION COEFFICIENTS IN LINEAR MODELS.

Annals of statistics·2026

Same journal

COUNTERFACTUAL INFERENCE IN SEQUENTIAL EXPERIMENTS.

Annals of statistics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Aug 27, 2025

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

CANONICAL THRESHOLDING FOR NON-SPARSE HIGH-DIMENSIONAL LINEAR REGRESSION.

Igor Silin¹, Jianqing Fan¹

¹Princeton University.

Annals of Statistics

|September 23, 2022

Summary

This summary is machine-generated.

This study introduces canonical thresholding estimators for high-dimensional linear regression, relaxing sparsity assumptions. These estimators, linked to LASSO and Principal Component Regression (PCR), offer improved performance and a new measure of problem complexity.

Keywords:

62H25 High-dimensional linear regression Primary 62J05 covariance eigenvalues decay principal component regression relative errors secondary 62H12 thresholding

More Related Videos

Psychophysically-anchored, Robust Thresholding in Studying Pain-related Lateralization of Oscillatory Prestimulus Activity

Psychophysically-anchored, Robust Thresholding in Studying Pain-related Lateralization of Oscillatory Prestimulus Activity

Published on: January 21, 2017

Author Spotlight: UAV Remote Sensing for Efficient Invasive Plant Biomass Estimation

Author Spotlight: UAV Remote Sensing for Efficient Invasive Plant Biomass Estimation

Published on: February 9, 2024

Related Experiment Videos

Last Updated: Aug 27, 2025

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

Psychophysically-anchored, Robust Thresholding in Studying Pain-related Lateralization of Oscillatory Prestimulus Activity

Psychophysically-anchored, Robust Thresholding in Studying Pain-related Lateralization of Oscillatory Prestimulus Activity

Published on: January 21, 2017

Author Spotlight: UAV Remote Sensing for Efficient Invasive Plant Biomass Estimation

Author Spotlight: UAV Remote Sensing for Efficient Invasive Plant Biomass Estimation

Published on: February 9, 2024

Area of Science:

Statistics
Machine Learning
Data Science

Background:

High-dimensional linear regression often assumes sparse coefficients.
Existing methods may struggle when coefficients are not sparse.
Covariance matrix properties are crucial for understanding data structure.

Purpose of the Study:

To develop novel estimators for high-dimensional linear regression without sparsity assumptions.
To introduce a new family of estimators based on eigenvalue decay.
To analyze the theoretical properties and performance of these estimators.

Main Methods:

Proposing canonical thresholding estimators.
Analyzing mean squared error and prediction error bounds.
Investigating relative errors and introducing joint effective dimension.
Establishing minimax lower bounds for optimality.

Main Results:

Canonical thresholding estimators are proposed and analyzed.
Sufficient conditions for convergence based on eigenvalue decay are identified.
A new concept, joint effective dimension, is introduced to characterize problem complexity.
Minimax lower bounds demonstrate the optimality of the proposed methods.

Conclusions:

The proposed canonical thresholding estimators perform well in high-dimensional linear regression.
Eigenvalue decay is a key structural assumption for effective estimation.
The joint effective dimension provides a comprehensive measure of regression problem complexity.