Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Survival Tree01:19

Survival Tree

88
Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
 Building a Survival Tree
Constructing a...
88
Comparing the Survival Analysis of Two or More Groups01:20

Comparing the Survival Analysis of Two or More Groups

200
Survival analysis is a cornerstone of medical research, used to evaluate the time until an event of interest occurs, such as death, disease recurrence, or recovery. Unlike standard statistical methods, survival analysis is particularly adept at handling censored data—instances where the event has not occurred for some participants by the end of the study or remains unobserved. To address these unique challenges, specialized techniques like the Kaplan-Meier estimator, log-rank test, and...
200
Kaplan-Meier Approach01:24

Kaplan-Meier Approach

154
The Kaplan-Meier estimator is a non-parametric method used to estimate the survival function from time-to-event data. In medical research, it is frequently employed to measure the proportion of patients surviving for a certain period after treatment. This estimator is fundamental in analyzing time-to-event data, making it indispensable in clinical trials, epidemiological studies, and reliability engineering. By estimating survival probabilities, researchers can evaluate treatment effectiveness,...
154
Introduction To Survival Analysis01:18

Introduction To Survival Analysis

250
Survival analysis is a statistical method used to study time-to-event data, where the "event" might represent outcomes like death, disease relapse, system failure, or recovery. A unique feature of survival data is censoring, which occurs when the event of interest has not been observed for some individuals during the study period. This requires specialized techniques to handle incomplete data effectively.
The primary goal of survival analysis is to estimate survival time—the time...
250
Survival Curves01:18

Survival Curves

167
Survival curves are graphical representations that depict the survival experience of a population over time, offering an intuitive way to track the proportion of individuals who remain event-free at each time point. These curves are widely used in fields such as medicine, public health, and reliability engineering to visualize and compare survival probabilities across different groups or conditions.
The Kaplan-Meier estimator is the most common method for constructing survival curves. This...
167
Cancer Survival Analysis01:21

Cancer Survival Analysis

357
Cancer survival analysis focuses on quantifying and interpreting the time from a key starting point, such as diagnosis or the initiation of treatment, to a specific endpoint, such as remission or death. This analysis provides critical insights into treatment effectiveness and factors that influence patient outcomes, helping to shape clinical decisions and guide prognostic evaluations. A cornerstone of oncology research, survival analysis tackles the challenges of skewed, non-normally...
357

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Radiologic response assessment in patients with desmoid-type fibromatosis treated with percutaneous cryoablation.

European radiology experimental·2026
Same author

Targeted Therapy-Induced Erythrocytosis in Thyroid Cancers: An Underrecognized Safety Signal from a Retrospective Study.

Thyroid : official journal of the American Thyroid Association·2026
Same author

End-of-life care: A retrospective study comparing sarcoma and patients with carcinoma.

Tumori·2026
Same author

Adjuvant chemotherapy in stage I triple-negative breast cancer: A systematic review and meta-analysis of survival outcomes.

Cancer treatment reviews·2026
Same author

Toxicity-over-time analysis of anti-EGFR-based maintenance versus stop&go in left-sided, RAS/BRAF wild-type metastatic colorectal cancer.

Journal of the National Cancer Institute·2026
Same author

Bayesian sequential learning for prognostication in extremity soft tissue sarcoma (BayeSarc): a retrospective, multicentre cohort study.

The Lancet. Oncology·2026
Same journal

Optimal Weighted Tests for Replication Studies and the 'Two-Trials Rule' With Multiple Hypotheses.

Statistics in medicine·2026
Same journal

Identifiable Copula-Double-Cox Models: A Fully Parametric Framework for Dependent Right-Censored Survival Data.

Statistics in medicine·2026
Same journal

Moving From Individualized Risk-Based Prevention to Benefit-Based Prevention: Estimating Individualized Life-Years Gained From Prevention Services as a Basis for Eligibility.

Statistics in medicine·2026
Same journal

A Mixture of Distributed Lag Non-Linear Models to Account for Spatially Heterogeneous Exposure-Lag-Response Associations.

Statistics in medicine·2026
Same journal

Practical Considerations for Gaussian Process Modeling for Causal Inference in Quasi-Experimental Studies With Panel Data.

Statistics in medicine·2026
Same journal

Covariate Adjustment for Wilcoxon Two Sample Statistic and Test.

Statistics in medicine·2026
See all related articles

Related Experiment Video

Updated: Jul 11, 2025

An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.1K

Sample size and predictive performance of machine learning methods with survival data: A simulation study.

Gabriele Infante1,2, Rosalba Miceli2, Federico Ambrogi1,3

  • 1Department of Clinical Sciences and Community Health, University of Milan, Milan, Italy.

Statistics in Medicine
|November 10, 2023
PubMed
Summary
This summary is machine-generated.

This study introduces a simulation framework to assess sample size needs for survival prediction models. Machine learning methods like random survival forests and neural networks require careful consideration of sample size for optimal performance.

Keywords:
machine learningpredictionsample sizesimulationtime-to-event

More Related Videos

Establishing a Competing Risk Regression Nomogram Model for Survival Data
04:57

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

10.2K
Predicting Treatment Response to Image-Guided Therapies Using Machine Learning: An Example for Trans-Arterial Treatment of Hepatocellular Carcinoma
04:09

Predicting Treatment Response to Image-Guided Therapies Using Machine Learning: An Example for Trans-Arterial Treatment of Hepatocellular Carcinoma

Published on: October 10, 2018

8.3K

Related Experiment Videos

Last Updated: Jul 11, 2025

An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.1K
Establishing a Competing Risk Regression Nomogram Model for Survival Data
04:57

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

10.2K
Predicting Treatment Response to Image-Guided Therapies Using Machine Learning: An Example for Trans-Arterial Treatment of Hepatocellular Carcinoma
04:09

Predicting Treatment Response to Image-Guided Therapies Using Machine Learning: An Example for Trans-Arterial Treatment of Hepatocellular Carcinoma

Published on: October 10, 2018

8.3K

Area of Science:

  • Biostatistics
  • Machine Learning in Healthcare
  • Survival Analysis

Background:

  • Machine learning (ML) methods are increasingly popular for diagnostic and prognostic prediction models, often surpassing traditional regression techniques.
  • While the Cox proportional hazards model is standard for survival outcomes, ML offers potential for improved performance by capturing complex data patterns.
  • Determining adequate sample size for developing ML-based survival prediction models remains a challenge, unlike traditional statistical models.

Purpose of the Study:

  • To develop a time-to-event simulation framework for evaluating the performance of survival prediction models.
  • To compare the performance of Cox regression against various machine learning techniques, including random survival forest, gradient boosting, and neural networks.
  • To investigate the impact of varying sample sizes on the performance of these prediction models.

Main Methods:

  • A time-to-event simulation framework was developed using subject replications from publicly available databases.
  • Event times were simulated based on a Cox model incorporating nonlinearities, covariate interactions, and time-varying effects.
  • The performance of Cox regression was evaluated against tuned random survival forest, gradient boosting, and neural networks across different sample sizes.

Main Results:

  • The simulation framework allowed for direct comparison of model performances under various conditions.
  • Performance differences between Cox regression and ML techniques were observed at different sample sizes.
  • The study provides insights into the sample size requirements for developing robust survival prediction models using ML.

Conclusions:

  • The developed framework is valuable for understanding sample size requirements in survival prediction modeling.
  • Machine learning techniques show promise but necessitate careful sample size considerations for optimal predictive performance.
  • Further research is needed to establish specific sample size guidelines for various ML methods in survival analysis.