Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Cancer Survival Analysis01:21

Cancer Survival Analysis

478
Cancer survival analysis focuses on quantifying and interpreting the time from a key starting point, such as diagnosis or the initiation of treatment, to a specific endpoint, such as remission or death. This analysis provides critical insights into treatment effectiveness and factors that influence patient outcomes, helping to shape clinical decisions and guide prognostic evaluations. A cornerstone of oncology research, survival analysis tackles the challenges of skewed, non-normally...
478
Survival Tree01:19

Survival Tree

180
Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
 Building a Survival Tree
Constructing a...
180
Multiple Regression01:25

Multiple Regression

3.3K
Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
3.3K
Comparing the Survival Analysis of Two or More Groups01:20

Comparing the Survival Analysis of Two or More Groups

339
Survival analysis is a cornerstone of medical research, used to evaluate the time until an event of interest occurs, such as death, disease recurrence, or recovery. Unlike standard statistical methods, survival analysis is particularly adept at handling censored data—instances where the event has not occurred for some participants by the end of the study or remains unobserved. To address these unique challenges, specialized techniques like the Kaplan-Meier estimator, log-rank test, and...
339
Comparing Copy Number Variations and SNPs02:26

Comparing Copy Number Variations and SNPs

18.1K
Sequencing of the human genome has opened up several best-kept secrets of the genome. Scientists have identified thousands of genome variations that exist within a population. These variations can be a single nucleotide or a larger chromosomal variation.
Copy number variations or CNVs are the structural variations that cover more than 1kb of DNA sequence. The single nucleotide polymorphism (SNP), on the other hand, is a single nucleotide change or a point mutation that is found in more than 1%...
18.1K
Regression Analysis01:11

Regression Analysis

6.4K
Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:
6.4K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

[<sup>68</sup>Ga]Ga-PSMA-11 PET/CT for baseline staging of high-risk prostate cancer: A real-world study.

BJUI compass·2026
Same author

Toward a standardized and interoperable imaging biomarker catalog.

Insights into imaging·2026
Same author

The AI doctor will see you now: public perspectives on artificial intelligence in healthcare.

BJR artificial intelligence·2026
Same author

A Randomized, Phase II Clinical Trial of FLT-PET and FDG-PET for Early Response Assessment of Neoadjuvant Systemic Therapy in Triple Negative Breast Cancer.

Clinical cancer research : an official journal of the American Association for Cancer Research·2026
Same author

Diagnostic Accuracy of [<sup>68</sup>Ga]Ga-PSMA-11 PET-CT in Characterising Bone Lesions in Prostate Cancer: A Single-Centre Study.

Cancers·2026
Same author

Current patient attitudes to artificial intelligence applications in radiology.

The British journal of radiology·2026
Same journal

Widening Health Inequality and Causal Metabolic Drivers in Global Colorectal Cancer: A Multi-Dimensional Study.

Cancer informatics·2026
Same journal

GFAP-Dependent Transcriptional Dynamics and Cellular Heterogeneity in Primary, Recurrent, and Grade III Gliomas.

Cancer informatics·2026
Same journal

Translating Data Into Clinical Tools: An Integrative Strategy for Precision Biomarker Identification in Soft Tissue Sarcoma Diagnosis and Prognosis.

Cancer informatics·2026
Same journal

The MAPK Pathway Coordinates an Immunosuppressive Microenvironment in Colorectal Cancer: A Single-Cell Guided Prognostic Model.

Cancer informatics·2026
Same journal

Multi-Scale Cross-Attention Multiple Instance Learning Network for Automated Classification of Colorectal Polyps.

Cancer informatics·2026
Same journal

LEPR Contributes to Lung Squamous Cell Carcinoma: Insights From Mendelian Randomization and Experimental Studies.

Cancer informatics·2026
See all related articles

Related Experiment Video

Updated: Oct 11, 2025

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

954

Sparse Regression in Cancer Genomics: Comparing Variable Selection and Predictions in Real World Data.

Robert J O'Shea1, Sophia Tsoka2, Gary Jr Cook1,3

  • 1Department of Cancer Imaging, School of Biomedical Engineering and Imaging Sciences, King's College London, London, UK.

Cancer Informatics
|December 6, 2021
PubMed
Summary
This summary is machine-generated.

This study introduces a novel method to evaluate gene interaction models using real cancer genomics data. L0L2 penalisation excelled in structural selection, while L1L2 penalisation improved coefficient recovery, outperforming traditional cross-validation.

Keywords:
Artificial intelligencecomputational biologygene regulatory networksgenomicsmodelsstatistical

More Related Videos

Comparison of Predictive Performance of Three Lymph Node Staging Systems in Colorectal Signet Ring Cell Carcinoma Based on Machine Learning Model
07:13

Comparison of Predictive Performance of Three Lymph Node Staging Systems in Colorectal Signet Ring Cell Carcinoma Based on Machine Learning Model

Published on: April 18, 2025

247
Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.7K

Related Experiment Videos

Last Updated: Oct 11, 2025

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

954
Comparison of Predictive Performance of Three Lymph Node Staging Systems in Colorectal Signet Ring Cell Carcinoma Based on Machine Learning Model
07:13

Comparison of Predictive Performance of Three Lymph Node Staging Systems in Colorectal Signet Ring Cell Carcinoma Based on Machine Learning Model

Published on: April 18, 2025

247
Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.7K

Area of Science:

  • Genomics
  • Bioinformatics
  • Computational Biology

Background:

  • Evaluating gene interaction models in cancer genomics is difficult due to uncertain true distributions.
  • Existing methods using synthetic or incomplete experimental data have limitations.
  • A real-world data-driven approach is needed for robust model comparison.

Purpose of the Study:

  • To develop and apply a novel benchmarking approach for genomic model inference algorithms using real-world data.
  • To compare the performance of LASSO, elastic net, best-subset selection, L0L1, and L0L2 penalisation methods.
  • To assess the efficacy of algorithmic preselection versus internal cross-validation for model selection.

Main Methods:

  • Extracted five large genomic datasets (n=4000) from Gene Expression Omnibus.
  • Trained 'gold-standard' regression models on data subspaces (n=4000, p=500).
  • Trained penalised regression models on smaller samples (n=25, 75, 150) and validated against gold-standard models, assessing variable selection and prediction accuracy.

Main Results:

  • L1L2 penalisation showed the highest cosine similarity for coefficient recovery.
  • L0L2 penalisation explained the most variance in test responses and achieved the highest variable selection F1 score.
  • Algorithmic preselection significantly outperformed internal cross-validation across all evaluated metrics.

Conclusions:

  • The study presents a novel, data-driven approach for comparing model selection methods in cancer genomics.
  • Benchmarking datasets are publicly available for future research.
  • L0L2 penalisation is recommended for structural selection, and L1L2 penalisation for coefficient recovery in genomic data analysis.