Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Goodness-of-Fit Test01:16

Goodness-of-Fit Test

7.8K
The goodness-of-fit test is a type of hypothesis test which determines whether the data "fits" a particular distribution. For example, one may suspect that some anonymous data may fit a binomial distribution. A chi-square test (meaning the distribution for the hypothesis test is chi-square) can be used to determine if there is a fit. The null and alternative hypotheses may be written in sentences or stated as equations or inequalities. The test statistic for a goodness-of-fit test is given as...
7.8K
Multiple Regression01:25

Multiple Regression

3.7K
Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
3.7K
Expected Frequencies in Goodness-of-Fit Tests01:19

Expected Frequencies in Goodness-of-Fit Tests

6.6K
A goodness-of-fit test is conducted to determine whether the observed frequency values are statistically similar to the frequencies expected for the dataset. Suppose the expected frequencies for a dataset are equal such as when predicting the frequency of any number appearing when casting a die. In that case, the expected frequency is the ratio of the total number of observations (n)  to the number of categories (k).
6.6K
Regression Toward the Mean01:52

Regression Toward the Mean

6.8K
Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when...
6.8K
Friedman Two-way Analysis of Variance by Ranks01:21

Friedman Two-way Analysis of Variance by Ranks

438
Friedman's Two-Way Analysis of Variance by Ranks is a nonparametric test designed to identify differences across multiple test attempts when traditional assumptions of normality and equal variances do not apply. Unlike conventional ANOVA, which requires normally distributed data with equal variances, Friedman's test is ideal for ordinal or non-normally distributed data, making it particularly useful for analyzing dependent samples, such as matched subjects over time or repeated measures...
438
Regression Analysis01:11

Regression Analysis

7.6K
Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:
7.6K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Fostering a culture of inquiry through a dedicated Nursing Research Clinic: a mixed-methods evaluation of the CREATE model.

Frontiers in medicine·2026
Same author

The CDK4/6 inhibitor dalpiciclib augments the antitumor efficacy of enzalutamide in preclinical models of castration-resistant prostate cancer through inhibition of MCM4-mediated DNA replication.

Cell death & disease·2026
Same author

The GHK-Cu delays aging in Caenorhabditis elegans via coordinated regulation of mitochondrial function and activation of DAF-16/SKN-1 pathways.

Biogerontology·2026
Same author

DysNet: Learning Implicit Many-Body Interactions via Dynamically Attending to Body-Orders in Equivariant Graph Networks.

Journal of chemical theory and computation·2026
Same author

The Effect of Medication Beliefs on Medication Adherence in Patients After TKA: A Moderated Mediation Model.

International journal of behavioral medicine·2026
Same author

Mapping spatial and social inequities of long COVID across the United States: a retrospective cohort study.

Lancet regional health. Americas·2026
Same journal

High-dimensional model-assisted inference for treatment effects with multi-valued treatments.

Journal of econometrics·2025
Same journal

Feature-splitting Algorithms for Ultrahigh Dimensional Quantile Regression.

Journal of econometrics·2025
Same journal

Semiparametric approach to estimation of marginal mean effects and marginal quantile effects.

Journal of econometrics·2025
Same journal

Profiling the plight of disconnected youth in America.

Journal of econometrics·2025
Same journal

Econometric Causality: The Central Role of Thought Experiments.

Journal of econometrics·2024
Same journal

Dealing with imperfect randomization: Inference for the highscope perry preschool program.

Journal of econometrics·2024
See all related articles

Related Experiment Video

Updated: Dec 24, 2025

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach
04:35

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

3.6K

Factor-Adjusted Regularized Model Selection.

Jianqing Fan1, Yuan Ke2, Kaizheng Wang1

  • 1Department of ORFE, Princeton University, USA.

Journal of Econometrics
|April 10, 2020
PubMed
Summary
This summary is machine-generated.

This study introduces Factor-Adjusted Regularized Model Selection (FarmSelect) for high-dimensional sparse regression with dependent data. FarmSelect consistently identifies the true model even with highly correlated covariates.

Keywords:
C52C58Correlated covariatesFactor modelModel selection consistencyRegularized M-estimatorTime series

More Related Videos

Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment
06:48

Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment

Published on: June 25, 2019

9.6K
An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.4K

Related Experiment Videos

Last Updated: Dec 24, 2025

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach
04:35

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

3.6K
Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment
06:48

Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment

Published on: June 25, 2019

9.6K
An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.4K

Area of Science:

  • Statistics
  • Econometrics
  • Machine Learning

Background:

  • High-dimensional sparse regression models often face challenges with correlated covariates.
  • Existing model selection methods struggle with cross-sectional and serial dependencies in data.
  • Factor models offer a way to reduce covariate dependence in econometric and financial studies.

Purpose of the Study:

  • To develop a consistent model selection strategy for high-dimensional sparse regression with dependent data.
  • To address the limitations of current methods when dealing with highly correlated covariates.
  • To propose a novel approach that handles both cross-sectional and serial dependencies.

Main Methods:

  • Proposing Factor-Adjusted Regularized Model Selection (FarmSelect).
  • Utilizing latent factors and idiosyncratic components as predictors.
  • Transforming the problem from correlated to weakly correlated covariates via lifting.

Main Results:

  • Achieving model selection consistency under mild conditions.
  • Obtaining optimal rates of convergence.
  • Demonstrating strong finite sample performance in model selection and prediction.
  • Showing flexibility for weakly correlated and uncorrelated cases.

Conclusions:

  • FarmSelect provides a robust solution for model selection in high-dimensional sparse regression with dependent data.
  • The method effectively handles covariate dependence by leveraging factor models.
  • FarmSelect is applicable to a broad range of problems and is available via an R-package.