Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Comparing the Survival Analysis of Two or More Groups01:20

Comparing the Survival Analysis of Two or More Groups

548
Survival analysis is a cornerstone of medical research, used to evaluate the time until an event of interest occurs, such as death, disease recurrence, or recovery. Unlike standard statistical methods, survival analysis is particularly adept at handling censored data—instances where the event has not occurred for some participants by the end of the study or remains unobserved. To address these unique challenges, specialized techniques like the Kaplan-Meier estimator, log-rank test, and...
548
Assumptions of Survival Analysis01:15

Assumptions of Survival Analysis

391
Survival models analyze the time until one or more events occur, such as death in biological organisms or failure in mechanical systems. These models are widely used across fields like medicine, biology, engineering, and public health to study time-to-event phenomena. To ensure accurate results, survival analysis relies on key assumptions and careful study design.
391
Actuarial Approach01:20

Actuarial Approach

284
The actuarial approach, a statistical method originally developed for life insurance risk assessment, is widely used to calculate survival rates in clinical and population studies. This method accounts for participants lost to follow-up or those who die from causes unrelated to the study, ensuring a more accurate representation of survival probabilities.
Consider the example of a high-risk surgical procedure with significant early-stage mortality. A two-year clinical study is conducted,...
284
Kaplan-Meier Approach01:24

Kaplan-Meier Approach

556
The Kaplan-Meier estimator is a non-parametric method used to estimate the survival function from time-to-event data. In medical research, it is frequently employed to measure the proportion of patients surviving for a certain period after treatment. This estimator is fundamental in analyzing time-to-event data, making it indispensable in clinical trials, epidemiological studies, and reliability engineering. By estimating survival probabilities, researchers can evaluate treatment effectiveness,...
556
Survival Tree01:19

Survival Tree

382
Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
 Building a Survival Tree
Constructing a...
382
Censoring Survival Data01:09

Censoring Survival Data

518
Survival analysis is a statistical method used to analyze time-to-event data, often employed in fields such as medicine, engineering, and social sciences. One of the key challenges in survival analysis is dealing with incomplete data, a phenomenon known as "censoring." Censoring occurs when the event of interest (such as death, relapse, or system failure) has not occurred for some individuals by the end of the study period or is otherwise unobservable, and it might have many different...
518

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Adjusted predictions for generalized estimating equations.

Biometrics·2025
Same author

Robust variable selection methods with Cox model-a selective practical benchmark study.

Briefings in bioinformatics·2024
Same author

Endoplasmic reticulum protein 5 attenuates platelet endoplasmic reticulum stress and secretion in a mouse model.

Blood advances·2022
Same author

SurvBenchmark: comprehensive benchmarking study of survival analysis methods using both omics data and clinical data.

GigaScience·2022
Same author

Cross-Platform Omics Prediction procedure: a statistical machine learning framework for wider implementation of precision medicine.

NPJ digital medicine·2022
Same journal

Methods for incorporating test result information within the high-dimensional propensity score framework: application in UK electronic health record data.

BMC medical research methodology·2026
Same journal

Sparse multi-way DMDC for longitudinal classification in high dimension low sample size data.

BMC medical research methodology·2026
Same journal

Tree-based exploratory identification of predictive biomarkers in non-randomized data.

BMC medical research methodology·2026
Same journal

Comparative evaluation of interrupted time series analytical methods for healthcare quality improvement research: a Monte Carlo simulation study.

BMC medical research methodology·2026
Same journal

Methodological advances in claims-based dementia algorithms: integrating medication and clinical data for medicare populations.

BMC medical research methodology·2026
Same journal

An interpretable XGboost algorithm for predicting 30-day mortality in acute pancreatitis using routine biomarkers.

BMC medical research methodology·2026
See all related articles

Related Experiment Video

Updated: Jan 13, 2026

Establishing a Competing Risk Regression Nomogram Model for Survival Data
04:57

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

10.7K

Two-stage sampling for better survival model performance.

Yunwei Zhang1,2,3, Samuel Muller4,5

  • 1School of Mathematics, Statistics, Chemistry and Physics, Murdoch University, Perth, WA, Australia. yunwei.zhang@murdoch.edu.au.

BMC Medical Research Methodology
|October 29, 2025
PubMed
Summary
This summary is machine-generated.

Careful data splitting improves high-dimensional survival model performance. A novel two-stage purposive sampling method effectively reduces data diversity, enhancing risk prediction accuracy for censored survival data.

Keywords:
Lasso Cox modelSimple random samplingStratified samplingSurvival analysisSurvival model performance

More Related Videos

Comparison of Predictive Performance of Three Lymph Node Staging Systems in Colorectal Signet Ring Cell Carcinoma Based on Machine Learning Model
07:13

Comparison of Predictive Performance of Three Lymph Node Staging Systems in Colorectal Signet Ring Cell Carcinoma Based on Machine Learning Model

Published on: April 18, 2025

482
Competing-Risk Nomogram for Predicting Cancer-Specific Survival in Multiple Primary Colorectal Cancer Patients after Surgery
06:46

Competing-Risk Nomogram for Predicting Cancer-Specific Survival in Multiple Primary Colorectal Cancer Patients after Surgery

Published on: September 27, 2024

637

Related Experiment Videos

Last Updated: Jan 13, 2026

Establishing a Competing Risk Regression Nomogram Model for Survival Data
04:57

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

10.7K
Comparison of Predictive Performance of Three Lymph Node Staging Systems in Colorectal Signet Ring Cell Carcinoma Based on Machine Learning Model
07:13

Comparison of Predictive Performance of Three Lymph Node Staging Systems in Colorectal Signet Ring Cell Carcinoma Based on Machine Learning Model

Published on: April 18, 2025

482
Competing-Risk Nomogram for Predicting Cancer-Specific Survival in Multiple Primary Colorectal Cancer Patients after Surgery
06:46

Competing-Risk Nomogram for Predicting Cancer-Specific Survival in Multiple Primary Colorectal Cancer Patients after Surgery

Published on: September 27, 2024

637

Area of Science:

  • Biostatistics
  • Bioinformatics
  • Genomics

Background:

  • High-dimensional censored survival data is increasingly used in health and medicine for risk prediction.
  • Existing data splitting techniques for model training and evaluation lack examination regarding data splitting ratios and survival-specific characteristics in high-dimensional settings.

Purpose of the Study:

  • To investigate the impact of data splitting ratios and survival-specific characteristics on survival model performance using high-dimensional censored data.
  • To develop and validate an improved data sampling approach for enhanced survival model development.

Main Methods:

  • Empirical study comparing simple random sampling and stratified sampling techniques on gene expression datasets using Lasso Cox models.
  • Investigation of various data splitting ratios for simple random sampling and survival-specific variables for stratified sampling.
  • Development and validation of a two-stage purposive sampling approach.

Main Results:

  • Survival-specific characteristics significantly influence survival model performance across training, testing, and validation datasets.
  • The proposed two-stage purposive sampling approach effectively mitigates excessive diversity in training data.
  • This mitigation leads to improved survival model performance in both simulation and real-world data analyses.

Conclusions:

  • Selecting appropriate sampling techniques and considering key factors are crucial for developing and validating survival models.
  • The proposed two-stage purposive sampling method offers a viable solution for reducing data diversity and enhancing model performance.