Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Multiple Regression

Multiple Regression

Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...

Regression Analysis

Regression Analysis

Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:

Prediction Intervals

Prediction Intervals

The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y.

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

Statistical inference techniques, paramount in hypothesis testing, differentiate into two broad categories: parametric and nonparametric statistics.
Parametric statistics, as the name suggests, assumes that data follow a specific distribution, often a normal distribution. This assumption enables robust hypothesis testing and estimation. Parametric methods, like the Student's t-test or Goodness-of-fit test, are frequently employed in biostatistics due to their robustness. For instance,...

Correlation and Regression

Correlation and Regression

In statistics, correlation describes the degree of association between two variables. In the subfield of linear regression, correlation is mathematically expressed by the correlation coefficient, which describes the strength and direction of the relationship between two variables. The coefficient is symbolically represented by 'r' and ranges from -1 to +1. A positive value indicates a positive correlation where the two variables move in the same direction. A negative value suggests a...

How Data are Classified: Categorical Data

How Data are Classified: Categorical Data

A variable, usually notated by capital letters such as X and Y, is a characteristic or measurement that can be determined for each member of a population. Data are the actual values of variables. They may be numbers, or they may be words. Datum is a single value.
Data are classified based on whether they are measurable or not. Categorical data cannot be measured; instead, it can be divided into categories. For example, if Y denotes a person's party affiliation, some examples of Y include...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Prognostic value of peri-operative circulating tumour DNA levels estimated by cell-free DNA methylation in patients with resectable colorectal liver metastases.

EBioMedicine·2026

Same author

Signpost Testing to Navigate the Parameter Space of the Gaussian Graphical Model With High-Dimensional Data.

Biometrical journal. Biometrische Zeitschrift·2026

Same author

False Discovery Estimation in Record Linkage.

Statistics in medicine·2025

Same author

Leveraging external information by guided adaptive shrinkage to improve variable selection in high-dimensional regression settings.

The international journal of biostatistics·2025

Same author

Alternatives to default shrinkage methods can improve prediction accuracy, calibration, and coverage: A methods comparison study.

Statistical methods in medical research·2025

Same author

High-throughput 3D spheroid screens identify microRNA sensitizers for improved thermoradiotherapy in locally advanced cancers.

Molecular therapy. Nucleic acids·2025

Same journal

Ensuring Quality in Preclinical Research: The Importance of Being Human.

Biometrical journal. Biometrische Zeitschrift·2026

Same journal

Addressing Cluster-Level Treatment Effect Heterogeneity in Sample Size Determination for Hierarchical 2 × 2 Factorial Designs.

Biometrical journal. Biometrische Zeitschrift·2026

Same journal

A Multiple Imputation Approach to Distinguish Curative From Life-Prolonging Effects in the Presence of Missing Covariates.

Biometrical journal. Biometrische Zeitschrift·2026

Same journal

Tests for Categorical Data Beyond Pearson: A Distance Covariance and Energy Distance Approach.

Biometrical journal. Biometrische Zeitschrift·2026

Same journal

Nonparametric Estimation of the Patient-Weighted While-Alive Estimand.

Biometrical journal. Biometrische Zeitschrift·2026

Same journal

Two-Stage Multiple Test Procedures Controlling False Discovery Rate With Auxiliary Variable and Their Application to Set4 <math><semantics><mi>Δ</mi> <annotation>$\Delta$</annotation></semantics></math> Mutant Data.

Biometrical journal. Biometrische Zeitschrift·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jan 7, 2026

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

Informative Co-Data Learning for High-Dimensional Horseshoe Regression.

Claudio Busatto¹, Mark A van de Wiel²

¹Department of Statistics, Computer Science, Applications "G. Parenti,", University of Florence, Florence, Italy.

Biometrical Journal. Biometrische Zeitschrift

|December 30, 2025

Summary

This summary is machine-generated.

We introduce informative Horseshoe regression (infHS), a Bayesian model that improves high-dimensional regression by incorporating prior knowledge (co-data). This method enhances variable selection and prediction accuracy in genomics.

Keywords:

Bayesian inference Horseshoe prior Variational Bayes co‐data information informative shrinkage prior

More Related Videos

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

Related Experiment Videos

Last Updated: Jan 7, 2026

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

Area of Science:

Genomics
Biostatistics
Computational Biology

Background:

High-dimensional data are common in clinical genomics for identifying trait predictors.
Incorporating prior knowledge (co-data) can enhance predictive model performance.

Purpose of the Study:

To develop a novel Bayesian regression model for high-dimensional data that integrates co-data.
To improve variable selection and prediction accuracy by leveraging external information.

Main Methods:

Developed the informative Horseshoe regression (infHS) model.
Implemented Gibbs sampler for moderate dimensions and Variational approximation for large-scale data.
Regressed prior variances of regression parameters on co-data variables.

Main Results:

The simulation study demonstrated the benefits of including co-data.
The infHS model showed superior performance compared to existing methods in two genomics applications.
The Variational approximation enabled efficient analysis of very large datasets.

Conclusions:

The infHS model effectively incorporates co-data into high-dimensional regression.
This approach offers improved variable selection and predictive performance in genomics.
infHS provides flexible computational tools for different data scales and inference goals.