Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Multiple Regression01:25

Multiple Regression

4.2K
Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
4.2K
Regression Analysis01:11

Regression Analysis

8.7K
Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:
8.7K
Regression Toward the Mean01:52

Regression Toward the Mean

7.2K
Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when...
7.2K
Calculating and Interpreting the Linear Correlation Coefficient01:11

Calculating and Interpreting the Linear Correlation Coefficient

8.3K
The correlation coefficient, r, developed by Karl Pearson in the early 1900s, is numerical and provides a measure of strength and direction of the linear association between the independent variable, x, and the dependent variable, y. Hence, it is also known as the Pearson product-moment correlation coefficient. It can be calculated using the following equation:
8.3K
Friedman Two-way Analysis of Variance by Ranks01:21

Friedman Two-way Analysis of Variance by Ranks

522
Friedman's Two-Way Analysis of Variance by Ranks is a nonparametric test designed to identify differences across multiple test attempts when traditional assumptions of normality and equal variances do not apply. Unlike conventional ANOVA, which requires normally distributed data with equal variances, Friedman's test is ideal for ordinal or non-normally distributed data, making it particularly useful for analyzing dependent samples, such as matched subjects over time or repeated measures...
522
Residuals and Least-Squares Property01:11

Residuals and Least-Squares Property

9.7K
The vertical distance between the actual value of y and the estimated value of y. In other words, it measures the vertical distance between the actual data point and the predicted point on the line
If the observed data point lies above the line, the residual is positive, and the line underestimates the actual data value for y. If the observed data point lies below the line, the residual is negative, and the line overestimates the actual data value for y.
The process of fitting the best-fit...
9.7K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Delphinidin targets voltage-dependent anion channel 1 to inhibit ferroptosis and protect against retinal photochemical damage.

Phytomedicine : international journal of phytotherapy and phytopharmacology·2026
Same author

Genome-wide analysis of the <i>GID</i> gene family in soybean and analysis of expression under gibberellin treatment.

Frontiers in plant science·2026
Same author

Decentralized EM algorithm for Gaussian mixtures under data heterogeneity and partial labeling.

Biometrics·2026
Same author

Multiscale multimodal graph convolutional networks for identifying essential tremor and dystonic tremor.

Neurobiology of disease·2026
Same author

Is splenic artery embolization beneficial when splenic angiography is negative? A multicenter observational study.

The journal of trauma and acute care surgery·2026
Same author

Quantitative venous outflow profiles based on four-dimensional computed tomography angiography are associated with tissue level collaterals and clinical outcomes of acute ischemic stroke patients.

Quantitative imaging in medicine and surgery·2026
Same journal

High-dimensional model-assisted inference for treatment effects with multi-valued treatments.

Journal of econometrics·2025
Same journal

Feature-splitting Algorithms for Ultrahigh Dimensional Quantile Regression.

Journal of econometrics·2025
Same journal

Semiparametric approach to estimation of marginal mean effects and marginal quantile effects.

Journal of econometrics·2025
Same journal

Profiling the plight of disconnected youth in America.

Journal of econometrics·2025
Same journal

Econometric Causality: The Central Role of Thought Experiments.

Journal of econometrics·2024
Same journal

Dealing with imperfect randomization: Inference for the highscope perry preschool program.

Journal of econometrics·2024
See all related articles

Related Experiment Video

Updated: Feb 27, 2026

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach
04:35

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

3.8K

Testing a single regression coefficient in high dimensional linear models.

Wei Lan1, Ping-Shou Zhong2, Runze Li3

  • 1Statistics School and Center of Statistical Research, Southwestern University of Finance and Economics, Chengdu, PR China.

Journal of Econometrics
|July 1, 2017
PubMed
Summary
This summary is machine-generated.

High-dimensional linear regression faces challenges with standard significance tests. The Correlated Predictors Screening (CPS) method offers a novel solution for testing covariate significance, enabling consistent model selection even with correlated predictors.

Keywords:
Correlated Predictors ScreeningFalse discovery rateHigh dimensional dataSingle coefficient test

More Related Videos

Probing the Limits of Egg Recognition Using Egg Rejection Experiments Along Phenotypic Gradients
07:34

Probing the Limits of Egg Recognition Using Egg Rejection Experiments Along Phenotypic Gradients

Published on: August 22, 2018

8.7K
The Innovation Arena: A Method for Comparing Innovative Problem-Solving Across Groups
14:14

The Innovation Arena: A Method for Comparing Innovative Problem-Solving Across Groups

Published on: May 13, 2022

6.4K

Related Experiment Videos

Last Updated: Feb 27, 2026

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach
04:35

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

3.8K
Probing the Limits of Egg Recognition Using Egg Rejection Experiments Along Phenotypic Gradients
07:34

Probing the Limits of Egg Recognition Using Egg Rejection Experiments Along Phenotypic Gradients

Published on: August 22, 2018

8.7K
The Innovation Arena: A Method for Comparing Innovative Problem-Solving Across Groups
14:14

The Innovation Arena: A Method for Comparing Innovative Problem-Solving Across Groups

Published on: May 13, 2022

6.4K

Area of Science:

  • Statistics
  • Econometrics
  • Machine Learning

Background:

  • Classical z-tests (or t-tests) for regression coefficients are invalid in high-dimensional data where covariates outnumber samples.
  • High dimensionality and multicollinearity in linear regression models pose significant challenges for traditional statistical inference.

Purpose of the Study:

  • To propose a novel and simple method, Correlated Predictors Screening (CPS), for testing the significance of regression coefficients in high-dimensional settings.
  • To enable the application of z-tests for covariate significance assessment even when predictors are highly correlated.
  • To achieve consistent model selection through multiple hypothesis testing with controlled false discovery rate.

Main Methods:

  • Introduction of the Correlated Predictors Screening (CPS) method to manage highly correlated predictors.
  • Application of ordinary least squares (OLS) for estimating regression coefficients after CPS.
  • Demonstration of estimator consistency and asymptotic normality, even with heteroscedastic errors.
  • Multiple hypothesis testing using p-values from covariate significance tests, controlling the false discovery rate (FDR).

Main Results:

  • The proposed CPS method allows for the valid application of z-tests to assess individual covariate significance.
  • The resulting estimators are proven to be consistent and asymptotically normal, robust to heteroscedasticity.
  • Multiple hypothesis testing with controlled FDR leads to consistent model selection.
  • Simulation studies and empirical examples validate the method's finite sample performance and practical utility.

Conclusions:

  • The Correlated Predictors Screening (CPS) method provides a statistically sound and practical approach for significance testing in high-dimensional linear regression.
  • This method effectively addresses challenges posed by multicollinearity, enabling reliable inference and model selection.
  • The CPS approach enhances the applicability of classical statistical tests in modern, data-rich environments.