Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Multiple Regression01:25

Multiple Regression

3.7K
Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
3.7K
Regression Analysis01:11

Regression Analysis

7.7K
Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:
7.7K
Prediction Intervals01:03

Prediction Intervals

3.1K
The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y. 
3.1K
Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data01:16

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

414
Statistical inference techniques, paramount in hypothesis testing, differentiate into two broad categories: parametric and nonparametric statistics.
Parametric statistics, as the name suggests, assumes that data follow a specific distribution, often a normal distribution. This assumption enables robust hypothesis testing and estimation. Parametric methods, like the Student's t-test or Goodness-of-fit test, are frequently employed in biostatistics due to their robustness. For instance,...
414
Correlation and Regression00:53

Correlation and Regression

3.0K
In statistics, correlation describes the degree of association between two variables. In the subfield of linear regression, correlation is mathematically expressed by the correlation coefficient, which describes the strength and direction of the relationship between two variables. The coefficient is symbolically represented by 'r' and ranges from -1 to +1. A positive value indicates a positive correlation where the two variables move in the same direction. A negative value suggests a...
3.0K
How Data are Classified: Categorical Data01:11

How Data are Classified: Categorical Data

42.4K
A variable, usually notated by capital letters such as X and Y, is a characteristic or measurement that can be determined for each member of a population. Data are the actual values of variables. They may be numbers, or they may be words. Datum is a single value.
Data are classified based on whether they are measurable or not. Categorical data cannot be measured; instead, it can be divided into categories. For example, if Y denotes a person's party affiliation, some examples of Y include...
42.4K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Prognostic value of peri-operative circulating tumour DNA levels estimated by cell-free DNA methylation in patients with resectable colorectal liver metastases.

EBioMedicine·2026
Same author

Signpost Testing to Navigate the Parameter Space of the Gaussian Graphical Model With High-Dimensional Data.

Biometrical journal. Biometrische Zeitschrift·2026
Same author

False Discovery Estimation in Record Linkage.

Statistics in medicine·2025
Same author

Leveraging external information by guided adaptive shrinkage to improve variable selection in high-dimensional regression settings.

The international journal of biostatistics·2025
Same author

Alternatives to default shrinkage methods can improve prediction accuracy, calibration, and coverage: A methods comparison study.

Statistical methods in medical research·2025
Same author

High-throughput 3D spheroid screens identify microRNA sensitizers for improved thermoradiotherapy in locally advanced cancers.

Molecular therapy. Nucleic acids·2025
Same journal

Ensuring Quality in Preclinical Research: The Importance of Being Human.

Biometrical journal. Biometrische Zeitschrift·2026
Same journal

Addressing Cluster-Level Treatment Effect Heterogeneity in Sample Size Determination for Hierarchical 2 × 2 Factorial Designs.

Biometrical journal. Biometrische Zeitschrift·2026
Same journal

A Multiple Imputation Approach to Distinguish Curative From Life-Prolonging Effects in the Presence of Missing Covariates.

Biometrical journal. Biometrische Zeitschrift·2026
Same journal

Tests for Categorical Data Beyond Pearson: A Distance Covariance and Energy Distance Approach.

Biometrical journal. Biometrische Zeitschrift·2026
Same journal

Nonparametric Estimation of the Patient-Weighted While-Alive Estimand.

Biometrical journal. Biometrische Zeitschrift·2026
Same journal

Two-Stage Multiple Test Procedures Controlling False Discovery Rate With Auxiliary Variable and Their Application to Set4 <math><semantics><mi>Δ</mi> <annotation>$\Delta$</annotation></semantics></math> Mutant Data.

Biometrical journal. Biometrische Zeitschrift·2026
See all related articles

Related Experiment Video

Updated: Jan 7, 2026

Establishing a Competing Risk Regression Nomogram Model for Survival Data
04:57

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

10.7K

Informative Co-Data Learning for High-Dimensional Horseshoe Regression.

Claudio Busatto1, Mark A van de Wiel2

  • 1Department of Statistics, Computer Science, Applications "G. Parenti,", University of Florence, Florence, Italy.

Biometrical Journal. Biometrische Zeitschrift
|December 30, 2025
PubMed
Summary
This summary is machine-generated.

We introduce informative Horseshoe regression (infHS), a Bayesian model that improves high-dimensional regression by incorporating prior knowledge (co-data). This method enhances variable selection and prediction accuracy in genomics.

Keywords:
Bayesian inferenceHorseshoe priorVariational Bayesco‐data informationinformative shrinkage prior

More Related Videos

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments
08:12

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

2.9K

Related Experiment Videos

Last Updated: Jan 7, 2026

Establishing a Competing Risk Regression Nomogram Model for Survival Data
04:57

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

10.7K
A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments
08:12

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

2.9K

Area of Science:

  • Genomics
  • Biostatistics
  • Computational Biology

Background:

  • High-dimensional data are common in clinical genomics for identifying trait predictors.
  • Incorporating prior knowledge (co-data) can enhance predictive model performance.

Purpose of the Study:

  • To develop a novel Bayesian regression model for high-dimensional data that integrates co-data.
  • To improve variable selection and prediction accuracy by leveraging external information.

Main Methods:

  • Developed the informative Horseshoe regression (infHS) model.
  • Implemented Gibbs sampler for moderate dimensions and Variational approximation for large-scale data.
  • Regressed prior variances of regression parameters on co-data variables.

Main Results:

  • The simulation study demonstrated the benefits of including co-data.
  • The infHS model showed superior performance compared to existing methods in two genomics applications.
  • The Variational approximation enabled efficient analysis of very large datasets.

Conclusions:

  • The infHS model effectively incorporates co-data into high-dimensional regression.
  • This approach offers improved variable selection and predictive performance in genomics.
  • infHS provides flexible computational tools for different data scales and inference goals.