Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Friedman Two-way Analysis of Variance by Ranks

Friedman Two-way Analysis of Variance by Ranks

Friedman's Two-Way Analysis of Variance by Ranks is a nonparametric test designed to identify differences across multiple test attempts when traditional assumptions of normality and equal variances do not apply. Unlike conventional ANOVA, which requires normally distributed data with equal variances, Friedman's test is ideal for ordinal or non-normally distributed data, making it particularly useful for analyzing dependent samples, such as matched subjects over time or repeated measures...

Quantifying and Rejecting Outliers: The Grubbs Test

Quantifying and Rejecting Outliers: The Grubbs Test

Sometimes, a data set can have a recorded numerical observation that greatly deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier. To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...

Regression Toward the Mean

Regression Toward the Mean

Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when...

Residuals and Least-Squares Property

Residuals and Least-Squares Property

The vertical distance between the actual value of y and the estimated value of y. In other words, it measures the vertical distance between the actual data point and the predicted point on the line
If the observed data point lies above the line, the residual is positive, and the line underestimates the actual data value for y. If the observed data point lies below the line, the residual is negative, and the line overestimates the actual data value for y.
The process of fitting the best-fit...

Routh-Hurwitz Criterion II

Routh-Hurwitz Criterion II

In the application of the Routh-Hurwitz criterion, two specific scenarios can arise that complicate stability analysis.
The first scenario occurs when a singular zero appears in the first column of the Routh table. This situation creates a division by zero issues. To resolve this, a small positive or negative number, denoted as epsilon (∈), is substituted for the zero. The stability analysis proceeds by assuming a sign for ∈. If ∈ is positive, any sign change in the first...

Multiple Regression

Multiple Regression

Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Educational disparities in STEM during COVID-induced distance learning and a potential strategy to address them.

Nature communications·2026

Same author

Discussion of "Data fission: splitting a single data point".

Journal of the American Statistical Association·2025

Same author

Inferring independent sets of Gaussian variables after thresholding correlations.

Journal of the American Statistical Association·2025

Same author

Generalized data thinning using sufficient statistics.

Journal of the American Statistical Association·2025

Same author

Selective Inference for Hierarchical Clustering.

Journal of the American Statistical Association·2024

Same author

Controlling costs: Feature selection on a budget.

Stat·2024

Same journal

Instrumental Variable Estimation of Marginal Structural Mean Models for Time-Varying Treatment.

Journal of the American Statistical Association·2026

Same journal

Semiparametric Joint Modeling for Survival Analysis with Longitudinal Covariates.

Journal of the American Statistical Association·2026

Same journal

Dimension Reduction for Large-Scale Federated Data: Statistical Rate and Asymptotic Inference.

Journal of the American Statistical Association·2026

Same journal

Facilitating Heterogeneous Effect Estimation via Statistically Efficient Categorical Modifiers.

Journal of the American Statistical Association·2026

Same journal

Nonparametric Density Estimation of a Long-Term Trend from Repeated Semicontinuous Data.

Journal of the American Statistical Association·2026

Same journal

Functional Integrative Bayesian Analysis of High-dimensional Multiplatform Clinicogenomic Data.

Journal of the American Statistical Association·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jul 4, 2025

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

Sparse Reduced Rank Huber Regression in High Dimensions.

Kean Ming Tan¹, Qiang Sun², Daniela Witten³

¹Department of Statistics, University of Michigan, Ann Arbor, MI.

Journal of the American Statistical Association

|January 29, 2024

Summary

This summary is machine-generated.

We introduce a novel sparse reduced rank Huber regression method for high-dimensional data analysis with heavy-tailed noise. This approach offers improved statistical bias analysis and error bounds, outperforming existing methods.

Keywords:

Convex relaxation Huber Low rank Sparsity approximation loss

More Related Videos

Basics of Multivariate Analysis in Neuroimaging Data

Basics of Multivariate Analysis in Neuroimaging Data

Published on: July 24, 2010

O-cresol Concentration Online Measurement Based On Near Infrared Spectroscopy Via Partial Least Square Regression

O-cresol Concentration Online Measurement Based On Near Infrared Spectroscopy Via Partial Least Square Regression

Published on: November 8, 2019

Related Experiment Videos

Last Updated: Jul 4, 2025

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

Basics of Multivariate Analysis in Neuroimaging Data

Basics of Multivariate Analysis in Neuroimaging Data

Published on: July 24, 2010

O-cresol Concentration Online Measurement Based On Near Infrared Spectroscopy Via Partial Least Square Regression

O-cresol Concentration Online Measurement Based On Near Infrared Spectroscopy Via Partial Least Square Regression

Published on: November 8, 2019

Area of Science:

Statistics
Machine Learning
Data Science

Background:

High-dimensional data analysis presents challenges due to noise and complexity.
Existing reduced rank regression methods often overlook heavy-tailed noise characteristics.
Robust statistical methods are crucial for reliable analysis of complex datasets.

Purpose of the Study:

To develop a robust regression method for high-dimensional data with heavy-tailed noise.
To establish theoretical guarantees for the proposed method's estimation accuracy.
To analyze the trade-off between noise properties and statistical bias.

Main Methods:

Proposing a sparse reduced rank Huber regression.
Employing convex relaxation of a non-convex optimization problem.
Utilizing block coordinate descent and alternating direction method of multipliers algorithms.

Main Results:

Established non-asymptotic estimation error bounds under Frobenius and nuclear norms.
Quantified the trade-off between noise heavy-tailedness and statistical bias.
Demonstrated convergence rates dependent on noise moment bounds, matching sub-Gaussian rates for second-moment bounded noise.

Conclusions:

The proposed sparse reduced rank Huber regression effectively handles high-dimensional data with heavy-tailed noise.
Theoretical analysis provides crucial insights into the method's performance under varying noise conditions.
Numerical studies and a data application validate the method's practical utility.