Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Regression Toward the Mean01:52

Regression Toward the Mean

6.9K
Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when...
6.9K
Cancer Survival Analysis01:21

Cancer Survival Analysis

654
Cancer survival analysis focuses on quantifying and interpreting the time from a key starting point, such as diagnosis or the initiation of treatment, to a specific endpoint, such as remission or death. This analysis provides critical insights into treatment effectiveness and factors that influence patient outcomes, helping to shape clinical decisions and guide prognostic evaluations. A cornerstone of oncology research, survival analysis tackles the challenges of skewed, non-normally...
654
Statistical Significance01:50

Statistical Significance

21.2K
Once data is collected from both the experimental and the control groups, a statistical analysis is conducted to find out if there are meaningful differences between the two groups. A statistical analysis determines how likely any difference found is due to chance (and thus not meaningful). In psychology, group differences are considered meaningful, or significant, if the odds that these differences occurred by chance alone are 5 percent or less. Stated another way, if we repeated this...
21.2K
Multiple Regression01:25

Multiple Regression

3.8K
Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
3.8K
Correlation and Regression00:53

Correlation and Regression

3.2K
In statistics, correlation describes the degree of association between two variables. In the subfield of linear regression, correlation is mathematically expressed by the correlation coefficient, which describes the strength and direction of the relationship between two variables. The coefficient is symbolically represented by 'r' and ranges from -1 to +1. A positive value indicates a positive correlation where the two variables move in the same direction. A negative value suggests a...
3.2K
Regression Analysis01:11

Regression Analysis

8.1K
Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:
8.1K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Glioma region-specific metabolites associated with patient seizures.

Journal of neuro-oncology·2026
Same author

Electrospun nanofibers for antibiotic and probiotic co-delivery to target bacterial vaginosis.

International journal of pharmaceutics·2026
Same author

Glioblastoma region-specific metabolic signatures reflect patient survival duration.

Journal of neuro-oncology·2026
Same author

Fabrication of bioprints self-coated with thermally sensitive lactobacilli for CAUTI applications.

European journal of pharmaceutics and biopharmaceutics : official journal of Arbeitsgemeinschaft fur Pharmazeutische Verfahrenstechnik e.V·2026
Same author

Idiopathic Pulmonary Fibrosis Identification and Prediction of Disease Severity via Combination of Different Biomarker Types.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference·2025
Same author

Non-invasive hair metabolome analysis reveals GERD and thyroid dysfunction in pulmonary fibrosis patients.

Metabolism open·2025
Same journal

An Ensemble Feature Selection Method for Biomarker Discovery.

Proceedings of the ... IEEE International Symposium on Signal Processing and Information Technology. IEEE International Symposium on Signal Processing and Information Technology·2019
Same journal

Evidence of bias towards buffered codons in human transcripts.

Proceedings of the ... IEEE International Symposium on Signal Processing and Information Technology. IEEE International Symposium on Signal Processing and Information Technology·2011
See all related articles

Related Experiment Video

Updated: Jan 22, 2026

Comparison of Predictive Performance of Three Lymph Node Staging Systems in Colorectal Signet Ring Cell Carcinoma Based on Machine Learning Model
07:13

Comparison of Predictive Performance of Three Lymph Node Staging Systems in Colorectal Signet Ring Cell Carcinoma Based on Machine Learning Model

Published on: April 18, 2025

497

Lung Cancer Survival Prediction via Machine Learning Regression, Classification, and Statistical Techniques.

James A Bartholomai1, Hermann B Frieboes1

  • 1Dept. of Bioengineering University of Louisville Louisville, KY.

Proceedings of the ... IEEE International Symposium on Signal Processing and Information Technology. IEEE International Symposium on Signal Processing and Information Technology
|July 18, 2019
PubMed
Summary
This summary is machine-generated.

This study enhances lung cancer survival prediction by combining regression and classification models. The hybrid approach improves accuracy, particularly for shorter survival times, offering better patient outcome insights.

Keywords:
SEER databasebiomedical big datalung cancermachine learningsupervised classification

More Related Videos

Establishing a Competing Risk Regression Nomogram Model for Survival Data
04:57

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

10.8K
Competing-Risk Nomogram for Predicting Cancer-Specific Survival in Multiple Primary Colorectal Cancer Patients after Surgery
06:46

Competing-Risk Nomogram for Predicting Cancer-Specific Survival in Multiple Primary Colorectal Cancer Patients after Surgery

Published on: September 27, 2024

657

Related Experiment Videos

Last Updated: Jan 22, 2026

Comparison of Predictive Performance of Three Lymph Node Staging Systems in Colorectal Signet Ring Cell Carcinoma Based on Machine Learning Model
07:13

Comparison of Predictive Performance of Three Lymph Node Staging Systems in Colorectal Signet Ring Cell Carcinoma Based on Machine Learning Model

Published on: April 18, 2025

497
Establishing a Competing Risk Regression Nomogram Model for Survival Data
04:57

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

10.8K
Competing-Risk Nomogram for Predicting Cancer-Specific Survival in Multiple Primary Colorectal Cancer Patients after Surgery
06:46

Competing-Risk Nomogram for Predicting Cancer-Specific Survival in Multiple Primary Colorectal Cancer Patients after Surgery

Published on: September 27, 2024

657

Area of Science:

  • Oncology
  • Biostatistics
  • Machine Learning

Background:

  • Predictive models for lung cancer survival often struggle with accuracy for longer timeframes.
  • Existing regression models show reduced performance when predicting survival beyond 6 months.

Purpose of the Study:

  • To develop an improved approach for predicting lung cancer patient survival time.
  • To enhance model accuracy by combining regression and classification techniques.

Main Methods:

  • Utilized de-identified lung cancer patient data from the Surveillance, Epidemiology, and End Results (SEER) database.
  • Employed ANOVA for feature selection, Random Forests for classification, and Linear Regression, Gradient Boosted Machines (GBM), and Random Forests for regression.
  • Assessed classification accuracy using a confusion matrix and regression accuracy via Root Mean Square Error (RMSE).

Main Results:

  • Random Forests (RF) demonstrated superior performance for survival times less than or equal to 6 months (RMSE 10.52) and greater than 24 months (RMSE 20.51).
  • Gradient Boosted Machines (GBM) achieved the best performance for predicting survival times between 7 and 24 months (RMSE 15.65).
  • Comparative analysis indicated that regression models generally perform better for shorter survival durations than RMSE values alone suggest.

Conclusions:

  • Combining regression and classification models offers a more robust method for predicting lung cancer survival.
  • The developed hybrid models show promise in improving the accuracy of survival time predictions across different timeframes.
  • Further analysis suggests that visual comparisons may reveal model performance nuances not fully captured by RMSE, especially for shorter survival predictions.