Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Regression Toward the Mean01:52

Regression Toward the Mean

7.2K
Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when...
7.2K
Regression Analysis01:11

Regression Analysis

8.4K
Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:
8.4K
Microsoft Excel: Regression Analysis01:18

Microsoft Excel: Regression Analysis

1.6K
Regression analysis in Microsoft Excel is a powerful statistical method for examining the relationship between a dependent variable and one or more independent variables. It's used extensively in fields such as economics, biology, and business to predict outcomes, understand relationships, and make data-driven decisions. The most common type is linear regression, which attempts to fit a straight line through the data points to model the relationship between variables.
To perform regression...
1.6K
Multiple Regression01:25

Multiple Regression

4.0K
Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
4.0K
Correlation and Regression00:53

Correlation and Regression

3.5K
In statistics, correlation describes the degree of association between two variables. In the subfield of linear regression, correlation is mathematically expressed by the correlation coefficient, which describes the strength and direction of the relationship between two variables. The coefficient is symbolically represented by 'r' and ranges from -1 to +1. A positive value indicates a positive correlation where the two variables move in the same direction. A negative value suggests a...
3.5K
Cancer Survival Analysis01:21

Cancer Survival Analysis

774
Cancer survival analysis focuses on quantifying and interpreting the time from a key starting point, such as diagnosis or the initiation of treatment, to a specific endpoint, such as remission or death. This analysis provides critical insights into treatment effectiveness and factors that influence patient outcomes, helping to shape clinical decisions and guide prognostic evaluations. A cornerstone of oncology research, survival analysis tackles the challenges of skewed, non-normally...
774

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

A foundation model for continuous glucose monitoring data.

Nature·2026
Same author

The hidden risk of round numbers and sharp thresholds in clinical practice.

NPJ digital medicine·2025
Same author

Deep phenotyping of health-disease continuum in the Human Phenotype Project.

Nature medicine·2025
Same author

Causal Representation Learning from Multi-modal Biomedical Observations.

ArXiv·2025
Same author

Scaling Structure Aware Virtual Screening to Billions of Molecules with SPRINT.

ArXiv·2025
Same author

Interpretable adenylation domain specificity prediction using protein language models.

bioRxiv : the preprint server for biology·2025
Same journal

Biomedical Concept Recognition with Error-aware Negative-enhanced Ranking Framework.

Bioinformatics (Oxford, England)·2026
Same journal

TEDLH: Domain HMMs for sensitive detection of remote homologues.

Bioinformatics (Oxford, England)·2026
Same journal

PLNFGL: Joint Estimation of Multi-Condition Gene Networks from Single-cell RNA-seq Data.

Bioinformatics (Oxford, England)·2026
Same journal

MCFST: Spatial domain identification method based on multi-view graph convolutional network and graph fusion network.

Bioinformatics (Oxford, England)·2026
Same journal

SpaBiT: Enhancing Spatial Transcriptomics Resolution via Bidirectional Attention Transformers.

Bioinformatics (Oxford, England)·2026
Same journal

EDEL: Enhancing Dense Retrievers for Curation of Biomedical Knowledge Bases.

Bioinformatics (Oxford, England)·2026
See all related articles

Related Experiment Video

Updated: Feb 8, 2026

Author Spotlight: Unveiling Transmembrane Protein Family-Related Markers in Gastric Cancer and Implications for Targeted Therapies
07:47

Author Spotlight: Unveiling Transmembrane Protein Family-Related Markers in Gastric Cancer and Implications for Targeted Therapies

Published on: September 15, 2023

2.3K

Personalized regression enables sample-specific pan-cancer analysis.

Benjamin J Lengerich1, Bryon Aragam2, Eric P Xing1,2,3

  • 1Computer Science Department, Carnegie Mellon University, Pittsburgh, PA, USA.

Bioinformatics (Oxford, England)
|June 29, 2018
PubMed
Summary
This summary is machine-generated.

This study introduces a novel regularizer to uncover inter-sample heterogeneity in genomic analysis, enabling personalized statistical models for complex diseases like cancer. The method reveals sample-specific aberrations missed by traditional approaches.

More Related Videos

Using a Pan-Viral Microarray Assay Virochip to Screen Clinical Samples for Viral Pathogens
13:45

Using a Pan-Viral Microarray Assay Virochip to Screen Clinical Samples for Viral Pathogens

Published on: April 27, 2011

19.4K
Isolation of Cancer Stem Cells From Human Prostate Cancer Samples
07:16

Isolation of Cancer Stem Cells From Human Prostate Cancer Samples

Published on: March 14, 2014

14.6K

Related Experiment Videos

Last Updated: Feb 8, 2026

Author Spotlight: Unveiling Transmembrane Protein Family-Related Markers in Gastric Cancer and Implications for Targeted Therapies
07:47

Author Spotlight: Unveiling Transmembrane Protein Family-Related Markers in Gastric Cancer and Implications for Targeted Therapies

Published on: September 15, 2023

2.3K
Using a Pan-Viral Microarray Assay Virochip to Screen Clinical Samples for Viral Pathogens
13:45

Using a Pan-Viral Microarray Assay Virochip to Screen Clinical Samples for Viral Pathogens

Published on: April 27, 2011

19.4K
Isolation of Cancer Stem Cells From Human Prostate Cancer Samples
07:16

Isolation of Cancer Stem Cells From Human Prostate Cancer Samples

Published on: March 14, 2014

14.6K

Area of Science:

  • Genomics
  • Statistical modeling
  • Bioinformatics

Background:

  • Inter-sample heterogeneity is crucial for understanding complex biological processes, particularly in cancer genomics.
  • Traditional genomic analysis methods often average data, masking individual patient variations and hindering the identification of causal mutations.
  • Developing personalized statistical models is essential for accurately analyzing patient heterogeneity.

Purpose of the Study:

  • To propose a novel regularizer for achieving patient-specific personalized estimation.
  • To address the limitations of population-level analysis in uncovering inter-sample heterogeneity.
  • To develop interpretable, patient-specific models for complex diseases.

Main Methods:

  • A novel regularizer is proposed that learns two latent distance metrics (between personalized parameters and clinical covariates).
  • The method allows data to dictate the structure of these latent distance metrics, avoiding prior assumptions.
  • Applied to a pan-cancer gene expression dataset from over 30 cancer types.

Main Results:

  • The method successfully learned patient-specific, interpretable models for a pan-cancer gene expression dataset.
  • Strong evidence of personalization effects was found both between cancer types and between individual patients.
  • Sample-specific aberrations, overlooked by population-level methods, were uncovered.

Conclusions:

  • The proposed regularizer offers a promising new path for the precision analysis of complex diseases like cancer.
  • Personalized statistical models can effectively capture inter-sample heterogeneity, leading to more accurate biological insights.
  • The findings highlight the importance of moving beyond averaged views in genomic analysis.