Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Goodness-of-Fit Test

Goodness-of-Fit Test

The goodness-of-fit test is a type of hypothesis test which determines whether the data "fits" a particular distribution. For example, one may suspect that some anonymous data may fit a binomial distribution. A chi-square test (meaning the distribution for the hypothesis test is chi-square) can be used to determine if there is a fit. The null and alternative hypotheses may be written in sentences or stated as equations or inequalities. The test statistic for a goodness-of-fit test is given as...

Multiple Regression

Multiple Regression

Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...

Expected Frequencies in Goodness-of-Fit Tests

Expected Frequencies in Goodness-of-Fit Tests

A goodness-of-fit test is conducted to determine whether the observed frequency values are statistically similar to the frequencies expected for the dataset. Suppose the expected frequencies for a dataset are equal such as when predicting the frequency of any number appearing when casting a die. In that case, the expected frequency is the ratio of the total number of observations (n) to the number of categories (k).

Regression Toward the Mean

Regression Toward the Mean

Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when...

Friedman Two-way Analysis of Variance by Ranks

Friedman Two-way Analysis of Variance by Ranks

Friedman's Two-Way Analysis of Variance by Ranks is a nonparametric test designed to identify differences across multiple test attempts when traditional assumptions of normality and equal variances do not apply. Unlike conventional ANOVA, which requires normally distributed data with equal variances, Friedman's test is ideal for ordinal or non-normally distributed data, making it particularly useful for analyzing dependent samples, such as matched subjects over time or repeated measures...

Regression Analysis

Regression Analysis

Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Fostering a culture of inquiry through a dedicated Nursing Research Clinic: a mixed-methods evaluation of the CREATE model.

Frontiers in medicine·2026

Same author

The CDK4/6 inhibitor dalpiciclib augments the antitumor efficacy of enzalutamide in preclinical models of castration-resistant prostate cancer through inhibition of MCM4-mediated DNA replication.

Cell death & disease·2026

Same author

The GHK-Cu delays aging in Caenorhabditis elegans via coordinated regulation of mitochondrial function and activation of DAF-16/SKN-1 pathways.

Biogerontology·2026

Same author

DysNet: Learning Implicit Many-Body Interactions via Dynamically Attending to Body-Orders in Equivariant Graph Networks.

Journal of chemical theory and computation·2026

Same author

The Effect of Medication Beliefs on Medication Adherence in Patients After TKA: A Moderated Mediation Model.

International journal of behavioral medicine·2026

Same author

Mapping spatial and social inequities of long COVID across the United States: a retrospective cohort study.

Lancet regional health. Americas·2026

Same journal

High-dimensional model-assisted inference for treatment effects with multi-valued treatments.

Journal of econometrics·2025

Same journal

Feature-splitting Algorithms for Ultrahigh Dimensional Quantile Regression.

Journal of econometrics·2025

Same journal

Semiparametric approach to estimation of marginal mean effects and marginal quantile effects.

Journal of econometrics·2025

Same journal

Profiling the plight of disconnected youth in America.

Journal of econometrics·2025

Same journal

Econometric Causality: The Central Role of Thought Experiments.

Journal of econometrics·2024

Same journal

Dealing with imperfect randomization: Inference for the highscope perry preschool program.

Journal of econometrics·2024

See all related articles

Search research articles

Related Experiment Video

Updated: Dec 24, 2025

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

Factor-Adjusted Regularized Model Selection.

Jianqing Fan¹, Yuan Ke², Kaizheng Wang¹

¹Department of ORFE, Princeton University, USA.

Journal of Econometrics

|April 10, 2020

Summary

This summary is machine-generated.

This study introduces Factor-Adjusted Regularized Model Selection (FarmSelect) for high-dimensional sparse regression with dependent data. FarmSelect consistently identifies the true model even with highly correlated covariates.

Keywords:

C52 C58 Correlated covariates Factor model Model selection consistency Regularized M-estimator Time series

More Related Videos

Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment

Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment

Published on: June 25, 2019

An R-Based Landscape Validation of a Competing Risk Model

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

Related Experiment Videos

Last Updated: Dec 24, 2025

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment

Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment

Published on: June 25, 2019

An R-Based Landscape Validation of a Competing Risk Model

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

Area of Science:

Statistics
Econometrics
Machine Learning

Background:

High-dimensional sparse regression models often face challenges with correlated covariates.
Existing model selection methods struggle with cross-sectional and serial dependencies in data.
Factor models offer a way to reduce covariate dependence in econometric and financial studies.

Purpose of the Study:

To develop a consistent model selection strategy for high-dimensional sparse regression with dependent data.
To address the limitations of current methods when dealing with highly correlated covariates.
To propose a novel approach that handles both cross-sectional and serial dependencies.

Main Methods:

Proposing Factor-Adjusted Regularized Model Selection (FarmSelect).
Utilizing latent factors and idiosyncratic components as predictors.
Transforming the problem from correlated to weakly correlated covariates via lifting.

Main Results:

Achieving model selection consistency under mild conditions.
Obtaining optimal rates of convergence.
Demonstrating strong finite sample performance in model selection and prediction.
Showing flexibility for weakly correlated and uncorrelated cases.

Conclusions:

FarmSelect provides a robust solution for model selection in high-dimensional sparse regression with dependent data.
The method effectively handles covariate dependence by leveraging factor models.
FarmSelect is applicable to a broad range of problems and is available via an R-package.