Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Regression Toward the Mean01:52

Regression Toward the Mean

6.9K
Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when...
6.9K
Multiple Regression01:25

Multiple Regression

3.8K
Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
3.8K
Correlation and Regression00:53

Correlation and Regression

3.2K
In statistics, correlation describes the degree of association between two variables. In the subfield of linear regression, correlation is mathematically expressed by the correlation coefficient, which describes the strength and direction of the relationship between two variables. The coefficient is symbolically represented by 'r' and ranges from -1 to +1. A positive value indicates a positive correlation where the two variables move in the same direction. A negative value suggests a...
3.2K
Regression Analysis01:11

Regression Analysis

8.1K
Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:
8.1K
Theory of Attribution II: Kelley's Covariation Theory01:29

Theory of Attribution II: Kelley's Covariation Theory

550
Attribution theory plays a crucial role in social psychology, helping to explain how individuals interpret the causes of behavior. One prominent model within this field is Harold Kelley's covariation theory, which provides a systematic approach to determining whether internal traits or external circumstances drive a person's actions. The model posits that individuals rely on three key types of information—consensus, consistency, and distinctiveness—to make these judgments.Consensus:...
550
Microsoft Excel: Regression Analysis01:18

Microsoft Excel: Regression Analysis

1.5K
Regression analysis in Microsoft Excel is a powerful statistical method for examining the relationship between a dependent variable and one or more independent variables. It's used extensively in fields such as economics, biology, and business to predict outcomes, understand relationships, and make data-driven decisions. The most common type is linear regression, which attempts to fit a straight line through the data points to model the relationship between variables.
To perform regression...
1.5K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Multimetabolite Scores of Alignment with the 2018 World Cancer Research Fund/American Institute for Cancer Research Cancer Prevention Recommendations in the Interactive Diet and Activity Tracking in American Association of Retired Persons Study.

The Journal of nutrition·2026
Same author

Title: Ultra-processed food intake and postmenopausal breast cancer risk in the NIH-AARP diet and health study.

Journal of the National Cancer Institute·2026
Same author

Step Counts and Stepping Intensity Among U.S. Adults: National Health and Nutrition Examination Survey 2011-2014.

Medicine and science in sports and exercise·2026
Same author

Reduced varying coefficient models for regional quantile regression with multiple responses.

Biometrics·2026
Same author

Modifying Influence of Neighborhood Socioeconomic Deprivation in the Association of Diet Quality and Mortality: The National Institutes of Health (NIH)-AARP Diet and Health Study.

The Journal of nutrition·2026
Same author

Quantile regression of the relationship between demographic factors and leukocyte telomere length, measured by Southern blot and qPCR.

GeroScience·2026
Same journal

A Bayesian method for analyzing combinations of continuous, ordinal, and nominal categorical data with missing values.

Journal of multivariate analysis·2026
Same journal

Hierarchical structure-guided high-dimensional multi-view clustering.

Journal of multivariate analysis·2026
Same journal

Quadratic inference with dense functional responses.

Journal of multivariate analysis·2025
Same journal

Graph-constrained Analysis for Multivariate Functional Data.

Journal of multivariate analysis·2025
Same journal

From multivariate to functional data analysis: fundamentals, recent developments, and emerging areas.

Journal of multivariate analysis·2024
Same journal

Modeling the Cholesky factors of covariance matrices of multivariate longitudinal data.

Journal of multivariate analysis·2024
See all related articles

Related Experiment Video

Updated: Jan 26, 2026

Establishing a Competing Risk Regression Nomogram Model for Survival Data
04:57

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

10.8K

Forward regression for Cox models with high-dimensional covariates.

Hyokyoung G Hong1, Qi Zheng2, Yi Li3

  • 1Department of Statistics and Probability, Michigan State University, 19 Red Cedar Road, East Lansing, MI 48823, USA.

Journal of Multivariate Analysis
|April 23, 2019
PubMed
Summary
This summary is machine-generated.

Forward regression with an extended Bayesian information criterion (EBIC) stopping rule successfully identifies all relevant predictors in high-dimensional Cox models. This method ensures selection consistency for survival data analysis.

Keywords:
Forward selectionextended Bayesian information criteriahigh-dimensional predictorspartial likelihoodsure screening properties

More Related Videos

Measurements of CO2 Fluxes at Non-Ideal Eddy Covariance Sites
09:05

Measurements of CO2 Fluxes at Non-Ideal Eddy Covariance Sites

Published on: June 24, 2019

8.4K
Imaging Neurons within Thick Brain Sections Using the Golgi-Cox Method
10:26

Imaging Neurons within Thick Brain Sections Using the Golgi-Cox Method

Published on: April 18, 2017

19.2K

Related Experiment Videos

Last Updated: Jan 26, 2026

Establishing a Competing Risk Regression Nomogram Model for Survival Data
04:57

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

10.8K
Measurements of CO2 Fluxes at Non-Ideal Eddy Covariance Sites
09:05

Measurements of CO2 Fluxes at Non-Ideal Eddy Covariance Sites

Published on: June 24, 2019

8.4K
Imaging Neurons within Thick Brain Sections Using the Golgi-Cox Method
10:26

Imaging Neurons within Thick Brain Sections Using the Golgi-Cox Method

Published on: April 18, 2017

19.2K

Area of Science:

  • Statistics
  • Biostatistics
  • Machine Learning

Background:

  • Forward regression is a classical variable screening method, but its application is limited in high-dimensional settings due to computational complexity and theoretical uncertainties.
  • Existing research shows forward regression with an extended Bayesian information criterion (EBIC) can identify predictors in linear models, but its use in survival analysis remains unexplored.

Purpose of the Study:

  • To introduce and investigate a novel forward variable selection procedure for high-dimensional Cox proportional hazards models.
  • To establish theoretical guarantees for the selection consistency of this procedure in survival data settings.

Main Methods:

  • A sequential variable selection approach is proposed, utilizing the increment of partial likelihood to identify important predictors.
  • An extended Bayesian information criterion (EBIC) is employed as a stopping rule for the forward selection process.
  • New theoretical results on partial likelihood are developed to prove sure screening properties.

Main Results:

  • The proposed forward regression method consistently identifies all relevant predictors in high-dimensional Cox models.
  • The order of variable entry is determined by the magnitude of the partial likelihood increment.
  • The method demonstrates practical utility through simulations and analysis of lung cancer survival data.

Conclusions:

  • This study presents the first partial likelihood-based forward regression for high-dimensional survival analysis.
  • The developed method offers a theoretically sound and practically useful tool for variable selection in complex survival data.
  • The findings have implications for biomarker discovery in cancer research and other survival-related studies.