Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Regression Toward the Mean01:52

Regression Toward the Mean

6.3K
Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when...
6.3K
Regression Analysis01:11

Regression Analysis

5.9K
Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:
5.9K
Bootstrapping01:24

Bootstrapping

656
The term "bootstrap" originated in the 19th century as a metaphor for self-improvement or achieving something independently, without external assistance. This concept extends to statistical bootstrapping, a self-contained method for estimating population parameters through resampling, even though it can be computationally intensive. Developed by the American statistician Dr. Bradley Efron in 1979, bootstrapping provides a robust way to perform inference when the original sample size is...
656
Residual Plots01:07

Residual Plots

5.0K
A residual plot is a statistical representation of data used to analyze correlation and regression results. It helps verify the requirements for drawing specific conclusions about correlation and regression. To obtain the residual plot, first, the residual for each data value is calculated, which is simply the vertical distance between the observed and the predicted value obtained from the regression equation.
When the residual values are plotted against the variable x, it is called a residual...
5.0K
Residuals and Least-Squares Property01:11

Residuals and Least-Squares Property

7.8K
The vertical distance between the actual value of y and the estimated value of y. In other words, it measures the vertical distance between the actual data point and the predicted point on the line
If the observed data point lies above the line, the residual is positive, and the line underestimates the actual data value for y. If the observed data point lies below the line, the residual is negative, and the line overestimates the actual data value for y.
The process of fitting the best-fit...
7.8K
Survival Tree01:19

Survival Tree

131
Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
 Building a Survival Tree
Constructing a...
131

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Breastfeeding and risk of maternal type 2 diabetes: a prospective cohort study of 280 000 women in China.

BMJ open·2026
Same author

Burden and Correlates of Multiple Chronic Infections and Their Associations With Cancer Incidence in Chinese Adults: A Large Case-Cohort Study.

International journal of cancer·2026
Same author

Associations of 6600 SomaScan proteins with demographic, lifestyle, environmental and health characteristics in Chinese adults.

Scientific reports·2026
Same author

Diagnostic accuracy, treatment and prognosis of myocardial infarction: an 11-year follow-up of a community-based cohort of 0.5 million Chinese adults.

BMJ public health·2026
Same author

Phenome-Wide Associations of Polygenic Scores for Schizophrenia and Major Depression in 100,000 Chinese Adults.

Biological psychiatry global open science·2026
Same author

Dairy Consumption and Risk of Cardiometabolic Diseases: A Prospective Cohort Study of the China Kadoorie Biobank.

The Journal of nutrition·2026
Same journal

A model specification test for semiparametric nonignorable missing data modeling.

Econometrics and statistics·2024
Same journal

Constructing a polygenic risk score for childhood obesity using functional data analysis.

Econometrics and statistics·2023
Same journal

A Hierarchical Bayesian Model for Differential Connectivity in Multi-trial Brain Signals.

Econometrics and statistics·2020
Same journal

Identifying gene-environment interactions for prognosis using a robust approach.

Econometrics and statistics·2019
Same journal

Estimation of a Semiparametric Varying-Coefficient Mixed Regressive Spatial Autoregressive Model.

Econometrics and statistics·2019
See all related articles

Related Experiment Video

Updated: Aug 12, 2025

A Method of Trigonometric Modelling of Seasonal Variation Demonstrated with Multiple Sclerosis Relapse Data
10:46

A Method of Trigonometric Modelling of Seasonal Variation Demonstrated with Multiple Sclerosis Relapse Data

Published on: December 9, 2015

10.7K

Regression Reconstruction from a Retrospective Sample.

Christiana Kartsonaki1, D R Cox2

  • 1MRC Population Health Research Unit, Nuffield Department of Population Health, University of Oxford, Oxford OX3 7LF, UK.

Econometrics and Statistics
|February 2, 2023
PubMed
Summary
This summary is machine-generated.

This study explores reconstructing originating distributions from case-control data. The linear regression coefficient of explanatory variables on outcomes shows remarkable stability, offering a reliable statistical measure.

Keywords:
Bias removalCase-control studyIndirect sampling

More Related Videos

Eye-tracking to Distinguish Comprehension-based and Oculomotor-based Regressive Eye Movements During Reading
05:54

Eye-tracking to Distinguish Comprehension-based and Oculomotor-based Regressive Eye Movements During Reading

Published on: October 18, 2018

6.3K
Establishing a Competing Risk Regression Nomogram Model for Survival Data
04:57

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

10.3K

Related Experiment Videos

Last Updated: Aug 12, 2025

A Method of Trigonometric Modelling of Seasonal Variation Demonstrated with Multiple Sclerosis Relapse Data
10:46

A Method of Trigonometric Modelling of Seasonal Variation Demonstrated with Multiple Sclerosis Relapse Data

Published on: December 9, 2015

10.7K
Eye-tracking to Distinguish Comprehension-based and Oculomotor-based Regressive Eye Movements During Reading
05:54

Eye-tracking to Distinguish Comprehension-based and Oculomotor-based Regressive Eye Movements During Reading

Published on: October 18, 2018

6.3K
Establishing a Competing Risk Regression Nomogram Model for Survival Data
04:57

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

10.3K

Area of Science:

  • Biostatistics
  • Epidemiological Research Methods

Background:

  • Case-control studies are fundamental in retrospective research.
  • Understanding variable relationships is crucial for disease analysis.

Purpose of the Study:

  • To reconstruct originating distributions from case-control data.
  • To analyze the stability of regression coefficients between distinct explanatory variables and outcomes.

Main Methods:

  • Utilizing retrospective study designs.
  • Applying linear regression analysis for variable dependence.
  • Conducting theoretical analysis and simulations.

Main Results:

  • The linear regression coefficient of explanatory variables on outcomes demonstrates significant stability.
  • The intercept term shows less stability compared to the coefficient.
  • An approximation for the coefficient, independent of a specific variable, is derived.

Conclusions:

  • The stability of the linear regression coefficient provides a robust method for analysis.
  • This finding enhances the interpretation of case-control study results.
  • The derived approximation offers a valuable tool for statistical modeling.