Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Friedman Two-way Analysis of Variance by Ranks01:21

Friedman Two-way Analysis of Variance by Ranks

197
Friedman's Two-Way Analysis of Variance by Ranks is a nonparametric test designed to identify differences across multiple test attempts when traditional assumptions of normality and equal variances do not apply. Unlike conventional ANOVA, which requires normally distributed data with equal variances, Friedman's test is ideal for ordinal or non-normally distributed data, making it particularly useful for analyzing dependent samples, such as matched subjects over time or repeated measures...
197
Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

1.6K
Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...
1.6K
Regression Toward the Mean01:52

Regression Toward the Mean

6.3K
Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when...
6.3K
Residuals and Least-Squares Property01:11

Residuals and Least-Squares Property

7.4K
The vertical distance between the actual value of y and the estimated value of y. In other words, it measures the vertical distance between the actual data point and the predicted point on the line
If the observed data point lies above the line, the residual is positive, and the line underestimates the actual data value for y. If the observed data point lies below the line, the residual is negative, and the line overestimates the actual data value for y.
The process of fitting the best-fit...
7.4K
Routh-Hurwitz Criterion II01:19

Routh-Hurwitz Criterion II

248
In the application of the Routh-Hurwitz criterion, two specific scenarios can arise that complicate stability analysis.
The first scenario occurs when a singular zero appears in the first column of the Routh table. This situation creates a division by zero issues. To resolve this, a small positive or negative number, denoted as epsilon (∈), is substituted for the zero. The stability analysis proceeds by assuming a sign for ∈. If ∈ is positive, any sign change in the first...
248
Multiple Regression01:25

Multiple Regression

3.0K
Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
3.0K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Educational disparities in STEM during COVID-induced distance learning and a potential strategy to address them.

Nature communications·2026
Same author

Discussion of "Data fission: splitting a single data point".

Journal of the American Statistical Association·2025
Same author

Inferring independent sets of Gaussian variables after thresholding correlations.

Journal of the American Statistical Association·2025
Same author

Generalized data thinning using sufficient statistics.

Journal of the American Statistical Association·2025
Same author

Selective Inference for Hierarchical Clustering.

Journal of the American Statistical Association·2024
Same author

Controlling costs: Feature selection on a budget.

Stat·2024
Same journal

Instrumental Variable Estimation of Marginal Structural Mean Models for Time-Varying Treatment.

Journal of the American Statistical Association·2026
Same journal

Semiparametric Joint Modeling for Survival Analysis with Longitudinal Covariates.

Journal of the American Statistical Association·2026
Same journal

Dimension Reduction for Large-Scale Federated Data: Statistical Rate and Asymptotic Inference.

Journal of the American Statistical Association·2026
Same journal

Facilitating Heterogeneous Effect Estimation via Statistically Efficient Categorical Modifiers.

Journal of the American Statistical Association·2026
Same journal

Nonparametric Density Estimation of a Long-Term Trend from Repeated Semicontinuous Data.

Journal of the American Statistical Association·2026
Same journal

Functional Integrative Bayesian Analysis of High-dimensional Multiplatform Clinicogenomic Data.

Journal of the American Statistical Association·2026
See all related articles

Related Experiment Video

Updated: Jul 4, 2025

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments
08:12

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

2.5K

Sparse Reduced Rank Huber Regression in High Dimensions.

Kean Ming Tan1, Qiang Sun2, Daniela Witten3

  • 1Department of Statistics, University of Michigan, Ann Arbor, MI.

Journal of the American Statistical Association
|January 29, 2024
PubMed
Summary
This summary is machine-generated.

We introduce a novel sparse reduced rank Huber regression method for high-dimensional data analysis with heavy-tailed noise. This approach offers improved statistical bias analysis and error bounds, outperforming existing methods.

Keywords:
Convex relaxationHuberLow rankSparsityapproximationloss

More Related Videos

Basics of Multivariate Analysis in Neuroimaging Data
06:35

Basics of Multivariate Analysis in Neuroimaging Data

Published on: July 24, 2010

16.9K
O-cresol Concentration Online Measurement Based On Near Infrared Spectroscopy Via Partial Least Square Regression
06:50

O-cresol Concentration Online Measurement Based On Near Infrared Spectroscopy Via Partial Least Square Regression

Published on: November 8, 2019

6.6K

Related Experiment Videos

Last Updated: Jul 4, 2025

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments
08:12

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

2.5K
Basics of Multivariate Analysis in Neuroimaging Data
06:35

Basics of Multivariate Analysis in Neuroimaging Data

Published on: July 24, 2010

16.9K
O-cresol Concentration Online Measurement Based On Near Infrared Spectroscopy Via Partial Least Square Regression
06:50

O-cresol Concentration Online Measurement Based On Near Infrared Spectroscopy Via Partial Least Square Regression

Published on: November 8, 2019

6.6K

Area of Science:

  • Statistics
  • Machine Learning
  • Data Science

Background:

  • High-dimensional data analysis presents challenges due to noise and complexity.
  • Existing reduced rank regression methods often overlook heavy-tailed noise characteristics.
  • Robust statistical methods are crucial for reliable analysis of complex datasets.

Purpose of the Study:

  • To develop a robust regression method for high-dimensional data with heavy-tailed noise.
  • To establish theoretical guarantees for the proposed method's estimation accuracy.
  • To analyze the trade-off between noise properties and statistical bias.

Main Methods:

  • Proposing a sparse reduced rank Huber regression.
  • Employing convex relaxation of a non-convex optimization problem.
  • Utilizing block coordinate descent and alternating direction method of multipliers algorithms.

Main Results:

  • Established non-asymptotic estimation error bounds under Frobenius and nuclear norms.
  • Quantified the trade-off between noise heavy-tailedness and statistical bias.
  • Demonstrated convergence rates dependent on noise moment bounds, matching sub-Gaussian rates for second-moment bounded noise.

Conclusions:

  • The proposed sparse reduced rank Huber regression effectively handles high-dimensional data with heavy-tailed noise.
  • Theoretical analysis provides crucial insights into the method's performance under varying noise conditions.
  • Numerical studies and a data application validate the method's practical utility.