Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Regression Toward the Mean

Regression Toward the Mean

Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when...

Regression Analysis

Regression Analysis

Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:

Microsoft Excel: Regression Analysis

Microsoft Excel: Regression Analysis

Regression analysis in Microsoft Excel is a powerful statistical method for examining the relationship between a dependent variable and one or more independent variables. It's used extensively in fields such as economics, biology, and business to predict outcomes, understand relationships, and make data-driven decisions. The most common type is linear regression, which attempts to fit a straight line through the data points to model the relationship between variables.
To perform regression...

Multiple Regression

Multiple Regression

Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...

Correlation and Regression

Correlation and Regression

In statistics, correlation describes the degree of association between two variables. In the subfield of linear regression, correlation is mathematically expressed by the correlation coefficient, which describes the strength and direction of the relationship between two variables. The coefficient is symbolically represented by 'r' and ranges from -1 to +1. A positive value indicates a positive correlation where the two variables move in the same direction. A negative value suggests a...

Cancer Survival Analysis

Cancer Survival Analysis

Cancer survival analysis focuses on quantifying and interpreting the time from a key starting point, such as diagnosis or the initiation of treatment, to a specific endpoint, such as remission or death. This analysis provides critical insights into treatment effectiveness and factors that influence patient outcomes, helping to shape clinical decisions and guide prognostic evaluations. A cornerstone of oncology research, survival analysis tackles the challenges of skewed, non-normally...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

A foundation model for continuous glucose monitoring data.

Nature·2026

Same author

The hidden risk of round numbers and sharp thresholds in clinical practice.

NPJ digital medicine·2025

Same author

Deep phenotyping of health-disease continuum in the Human Phenotype Project.

Nature medicine·2025

Same author

Causal Representation Learning from Multi-modal Biomedical Observations.

ArXiv·2025

Same author

Scaling Structure Aware Virtual Screening to Billions of Molecules with SPRINT.

ArXiv·2025

Same author

Interpretable adenylation domain specificity prediction using protein language models.

bioRxiv : the preprint server for biology·2025

Same journal

Biomedical Concept Recognition with Error-aware Negative-enhanced Ranking Framework.

Bioinformatics (Oxford, England)·2026

Same journal

TEDLH: Domain HMMs for sensitive detection of remote homologues.

Bioinformatics (Oxford, England)·2026

Same journal

PLNFGL: Joint Estimation of Multi-Condition Gene Networks from Single-cell RNA-seq Data.

Bioinformatics (Oxford, England)·2026

Same journal

MCFST: Spatial domain identification method based on multi-view graph convolutional network and graph fusion network.

Bioinformatics (Oxford, England)·2026

Same journal

SpaBiT: Enhancing Spatial Transcriptomics Resolution via Bidirectional Attention Transformers.

Bioinformatics (Oxford, England)·2026

Same journal

EDEL: Enhancing Dense Retrievers for Curation of Biomedical Knowledge Bases.

Bioinformatics (Oxford, England)·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Feb 8, 2026

Author Spotlight: Unveiling Transmembrane Protein Family-Related Markers in Gastric Cancer and Implications for Targeted Therapies

Author Spotlight: Unveiling Transmembrane Protein Family-Related Markers in Gastric Cancer and Implications for Targeted Therapies

Published on: September 15, 2023

Personalized regression enables sample-specific pan-cancer analysis.

Benjamin J Lengerich¹, Bryon Aragam², Eric P Xing^1,2,3

¹Computer Science Department, Carnegie Mellon University, Pittsburgh, PA, USA.

Bioinformatics (Oxford, England)

|June 29, 2018

Summary

This summary is machine-generated.

This study introduces a novel regularizer to uncover inter-sample heterogeneity in genomic analysis, enabling personalized statistical models for complex diseases like cancer. The method reveals sample-specific aberrations missed by traditional approaches.

More Related Videos

Using a Pan-Viral Microarray Assay Virochip to Screen Clinical Samples for Viral Pathogens

Using a Pan-Viral Microarray Assay Virochip to Screen Clinical Samples for Viral Pathogens

Published on: April 27, 2011

Isolation of Cancer Stem Cells From Human Prostate Cancer Samples

Isolation of Cancer Stem Cells From Human Prostate Cancer Samples

Published on: March 14, 2014

Related Experiment Videos

Last Updated: Feb 8, 2026

Author Spotlight: Unveiling Transmembrane Protein Family-Related Markers in Gastric Cancer and Implications for Targeted Therapies

Author Spotlight: Unveiling Transmembrane Protein Family-Related Markers in Gastric Cancer and Implications for Targeted Therapies

Published on: September 15, 2023

Using a Pan-Viral Microarray Assay Virochip to Screen Clinical Samples for Viral Pathogens

Using a Pan-Viral Microarray Assay Virochip to Screen Clinical Samples for Viral Pathogens

Published on: April 27, 2011

Isolation of Cancer Stem Cells From Human Prostate Cancer Samples

Isolation of Cancer Stem Cells From Human Prostate Cancer Samples

Published on: March 14, 2014

Area of Science:

Genomics
Statistical modeling
Bioinformatics

Background:

Inter-sample heterogeneity is crucial for understanding complex biological processes, particularly in cancer genomics.
Traditional genomic analysis methods often average data, masking individual patient variations and hindering the identification of causal mutations.
Developing personalized statistical models is essential for accurately analyzing patient heterogeneity.

Purpose of the Study:

To propose a novel regularizer for achieving patient-specific personalized estimation.
To address the limitations of population-level analysis in uncovering inter-sample heterogeneity.
To develop interpretable, patient-specific models for complex diseases.

Main Methods:

A novel regularizer is proposed that learns two latent distance metrics (between personalized parameters and clinical covariates).
The method allows data to dictate the structure of these latent distance metrics, avoiding prior assumptions.
Applied to a pan-cancer gene expression dataset from over 30 cancer types.

Main Results:

The method successfully learned patient-specific, interpretable models for a pan-cancer gene expression dataset.
Strong evidence of personalization effects was found both between cancer types and between individual patients.
Sample-specific aberrations, overlooked by population-level methods, were uncovered.

Conclusions:

The proposed regularizer offers a promising new path for the precision analysis of complex diseases like cancer.
Personalized statistical models can effectively capture inter-sample heterogeneity, leading to more accurate biological insights.
The findings highlight the importance of moving beyond averaged views in genomic analysis.