Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Regression Toward the Mean01:52

Regression Toward the Mean

7.2K
Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when...
7.2K
Regression Analysis01:11

Regression Analysis

8.5K
Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:
8.5K
Microsoft Excel: Regression Analysis01:18

Microsoft Excel: Regression Analysis

1.6K
Regression analysis in Microsoft Excel is a powerful statistical method for examining the relationship between a dependent variable and one or more independent variables. It's used extensively in fields such as economics, biology, and business to predict outcomes, understand relationships, and make data-driven decisions. The most common type is linear regression, which attempts to fit a straight line through the data points to model the relationship between variables.
To perform regression...
1.6K
Variability: Analysis01:11

Variability: Analysis

541
Measures of variability are statistical metrics that reveal the dispersion pattern within a dataset. They are pivotal in biostatistics, providing insights into the heterogeneity within health and biological data. Variability signifies the degree to which data points diverge from one another, helping researchers understand the potential range of values and associated uncertainty within the data.
The range is a simple measure of variability, indicating the difference between the highest and...
541
Multiple Regression01:25

Multiple Regression

4.0K
Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
4.0K
Correlation and Regression00:53

Correlation and Regression

3.5K
In statistics, correlation describes the degree of association between two variables. In the subfield of linear regression, correlation is mathematically expressed by the correlation coefficient, which describes the strength and direction of the relationship between two variables. The coefficient is symbolically represented by 'r' and ranges from -1 to +1. A positive value indicates a positive correlation where the two variables move in the same direction. A negative value suggests a...
3.5K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Association of lipoprotein(a) and peripheral artery disease: Findings from The hispanic community health study/study of latinos (HCHS/SOL).

American journal of preventive cardiology·2026
Same author

Feasibility of Escape Room Simulation in Community Health Nursing Education: A Quasi-Experimental Study of Student Perceptions of Competence and Confidence.

Nursing reports (Pavia, Italy)·2026
Same author

Non-Vigorous Physical Activity Associated with Reduced Hospitalization Risk with or without Diabetes or Peripheral Artery Disease: Study of Latinos.

Medicine and science in sports and exercise·2026
Same author

Visual Impairment and Cardiovascular Risk Factors in Hispanic and Latino Adults.

JAMA network open·2026
Same author

Interpersonal Victimization and Post-Traumatic Stress Among Transgender and Gender Expansive People: A Systematic Review.

International journal of environmental research and public health·2026
Same author

Types and frequencies of adverse events across clinical trials for patients with amyotrophic lateral sclerosis: an analysis of the Pooled Resource Open-Access ALS Clinical Trials (PRO-ACT) database.

Amyotrophic lateral sclerosis & frontotemporal degeneration·2026
Same journal

Fast penalized generalized estimating equations for large longitudinal functional datasets.

Biometrics·2026
Same journal

Causally-interpretable random-effects meta-analysis.

Biometrics·2026
Same journal

Statistical inference for mean function of partially observed functional time series.

Biometrics·2026
Same journal

Subgroup identification via Interaction Tree and Mixed Model for Repeated Measures with application to Alzheimer's disease.

Biometrics·2026
Same journal

Finite mixtures of linear quantile regressions with concomitant variables: a solution to endogeneity in longitudinal data modeling.

Biometrics·2026
Same journal

Discussion on "INTACT: a method for integration of longitudinal physical activity data from multiple sources" by Jingru Zhang, Erjia Cui, Hongzhe Li, and Haochang Shou.

Biometrics·2026
See all related articles

Related Experiment Video

Updated: Feb 16, 2026

Establishing a Competing Risk Regression Nomogram Model for Survival Data
04:57

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

10.9K

Regression analysis for secondary response variable in a case-cohort study.

Yinghao Pan1, Jianwen Cai1, Sangmi Kim2

  • 1Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, U.S.A.

Biometrics
|December 30, 2017
PubMed
Summary
This summary is machine-generated.

This study introduces a new statistical method for analyzing secondary outcomes in case-cohort studies. The approach efficiently uses existing data to explore relationships between exposures and additional health outcomes beyond the primary failure time.

Keywords:
Case-cohort designEstimated likelihoodSecondary outcomeSemiparametricValidation sample

More Related Videos

Candidate Gene Testing in Clinical Cohort Studies with Multiplexed Genotyping and Mass Spectrometry
05:53

Candidate Gene Testing in Clinical Cohort Studies with Multiplexed Genotyping and Mass Spectrometry

Published on: June 21, 2018

10.8K
Evaluating Leaf Responses to Microbial Secondary Metabolites Using A High-Throughput Format
05:51

Evaluating Leaf Responses to Microbial Secondary Metabolites Using A High-Throughput Format

Published on: December 5, 2025

513

Related Experiment Videos

Last Updated: Feb 16, 2026

Establishing a Competing Risk Regression Nomogram Model for Survival Data
04:57

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

10.9K
Candidate Gene Testing in Clinical Cohort Studies with Multiplexed Genotyping and Mass Spectrometry
05:53

Candidate Gene Testing in Clinical Cohort Studies with Multiplexed Genotyping and Mass Spectrometry

Published on: June 21, 2018

10.8K
Evaluating Leaf Responses to Microbial Secondary Metabolites Using A High-Throughput Format
05:51

Evaluating Leaf Responses to Microbial Secondary Metabolites Using A High-Throughput Format

Published on: December 5, 2025

513

Area of Science:

  • Biostatistics
  • Epidemiology
  • Health Research Methods

Background:

  • Case-cohort study designs are cost-effective epidemiological tools.
  • Existing case-cohort data often contain valuable information on secondary outcomes not initially planned for analysis.
  • Methods for analyzing secondary outcomes within case-cohort studies are underdeveloped.

Purpose of the Study:

  • To propose a novel statistical approach for analyzing secondary outcomes using existing case-cohort study data.
  • To develop a method that efficiently integrates primary failure time data with secondary outcome data.
  • To provide a statistically robust framework for exploring exposure-secondary outcome relationships in case-cohort studies.

Main Methods:

  • A non-parametric estimated likelihood approach is proposed.
  • The method maximizes a semiparametric likelihood function incorporating both time-to-failure and secondary outcomes.
  • Statistical properties including consistency, efficiency, and asymptotic normality are theoretically established.

Main Results:

  • The proposed non-parametric estimated likelihood estimator is demonstrated to be consistent and efficient.
  • Asymptotic normality of the estimator is proven, supporting its use in statistical inference.
  • Simulation studies confirm the finite sample performance of the method.

Conclusions:

  • The developed method offers a statistically sound and efficient way to analyze secondary outcomes in case-cohort studies.
  • This approach maximizes the utility of existing case-cohort data for broader research questions.
  • The method was successfully illustrated using data from the Sister Study, demonstrating its practical applicability.