Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Prediction Intervals

Prediction Intervals

The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y.
The...

Multiple Regression

Multiple Regression

Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...

Regression Analysis

Regression Analysis

Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:

Regression Toward the Mean

Regression Toward the Mean

Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when researchers try to extrapolate results...

DNA Microarrays

DNA Microarrays

Microarrays are high-throughput and relatively inexpensive assays that can be automated to analyze large quantities of data at a time. They are used in genome-wide studies to compare gene or protein expression under two varied conditions, such as healthy and diseased states. Microarrays consist of glass or silica slides on which probe molecules are covalently attached through surface functionalization. Most commonly, the slides are prepared through the chemisorption of silanes to silica...

Survival Tree

Survival Tree

Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
Building a Survival Tree
Constructing a survival tree begins...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Silica dioxide nanoparticles combined with cold exposure induce stronger systemic inflammatory response.

Environmental science and pollution research international·2016

Same author

Surgical readmissions: results of integrating pre-, peri- and postsurgical care.

Nursing open·2016

Same author

On Robust Association Testing for Quantitative Traits and Rare Variants.

G3 (Bethesda, Md.)·2016

Same author

In vitro and in vivo study of additive manufactured porous Ti6Al4V scaffolds for repairing bone defects.

Scientific reports·2016

Same author

Different Antibody Response against the Coxsackievirus A16 VP1 Capsid Protein: Specific or Non-Specific.

PloS one·2016

Same author

Depression-like behaviors and heme oxygenase-1 are regulated by Lycopene in lipopolysaccharide-induced neuroinflammation.

Journal of neuroimmunology·2016

Same journal

Fast penalized generalized estimating equations for large longitudinal functional datasets.

Biometrics·2026

Same journal

Causally-interpretable random-effects meta-analysis.

Biometrics·2026

Same journal

Statistical inference for mean function of partially observed functional time series.

Biometrics·2026

Same journal

Subgroup identification via Interaction Tree and Mixed Model for Repeated Measures with application to Alzheimer's disease.

Biometrics·2026

Same journal

Finite mixtures of linear quantile regressions with concomitant variables: a solution to endogeneity in longitudinal data modeling.

Biometrics·2026

Same journal

Discussion on "INTACT: a method for integration of longitudinal physical activity data from multiple sources" by Jingru Zhang, Erjia Cui, Hongzhe Li, and Haochang Shou.

Biometrics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 21, 2026

Generating the Transcriptional Regulation View of Transcriptomic Features for Prediction Task and Dark Biomarker Detection on Small Datasets

Generating the Transcriptional Regulation View of Transcriptomic Features for Prediction Task and Dark Biomarker Detection on Small Datasets

Published on: March 1, 2024

Incorporating predictor network in penalized regression with application to microarray data.

Wei Pan¹, Benhuai Xie, Xiaotong Shen

¹Division of Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota 55455, USA. weip@biostat.umn.edu

|August 4, 2009

Summary

This summary is machine-generated.

This study introduces a novel network-based penalized regression method for "large p, small n" problems. The approach enhances grouped variable selection and outperforms existing methods in simulations and glioblastoma patient survival prediction.

Related Experiment Videos

Last Updated: Jun 21, 2026

Generating the Transcriptional Regulation View of Transcriptomic Features for Prediction Task and Dark Biomarker Detection on Small Datasets

Generating the Transcriptional Regulation View of Transcriptomic Features for Prediction Task and Dark Biomarker Detection on Small Datasets

Published on: March 1, 2024

Area of Science:

Statistics
Bioinformatics
Genomics

Background:

Penalized linear regression is crucial for high-dimensional data ('large p, small n').
Prior network information, like gene pathways, can improve predictive modeling.
Existing methods may not fully leverage network structures for variable selection.

Purpose of the Study:

To develop a penalized regression method incorporating prior network information.
To enable grouped variable selection by smoothing coefficients over a network.
To improve prediction accuracy and variable selection in 'large p, small n' settings.

Main Methods:

A novel grouped penalty based on the L(gamma)-norm is proposed.
The penalty smooths regression coefficients of connected predictors in a network.
The method is evaluated using simulation studies and a glioblastoma microarray dataset.

Main Results:

The proposed method demonstrates superior finite-sample performance compared to Lasso, elastic net, and other network-based methods.
It excels in grouped variable selection across various simulation scenarios.
The method successfully predicts glioblastoma patient survival using gene expression and network data.

Conclusions:

The proposed network-based penalized regression method effectively utilizes prior network information for variable selection.
It offers an advantage over existing methods, particularly in 'large p, small n' scenarios.
The approach has practical applications in analyzing complex biological data, such as gene expression.