Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Regression Analysis01:11

Regression Analysis

5.5K
Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:
5.5K
Microsoft Excel: Regression Analysis01:18

Microsoft Excel: Regression Analysis

426
Regression analysis in Microsoft Excel is a powerful statistical method for examining the relationship between a dependent variable and one or more independent variables. It's used extensively in fields such as economics, biology, and business to predict outcomes, understand relationships, and make data-driven decisions. The most common type is linear regression, which attempts to fit a straight line through the data points to model the relationship between variables.
To perform regression...
426
Multiple Regression01:25

Multiple Regression

2.9K
Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
2.9K
Statistical Analysis: Overview01:11

Statistical Analysis: Overview

5.6K
When we take repeated measurements on the same or replicated samples, we will observe inconsistencies in the magnitude. These inconsistencies are called errors. To categorize and characterize these results and their errors, the researcher can use statistical analysis to determine the quality of the measurements and/or suitability of the methods.
One of the most commonly used statistical quantifiers is the mean, which is the ratio between the sum of the numerical values of all results and the...
5.6K
Correlation and Regression00:53

Correlation and Regression

1.2K
In statistics, correlation describes the degree of association between two variables. In the subfield of linear regression, correlation is mathematically expressed by the correlation coefficient, which describes the strength and direction of the relationship between two variables. The coefficient is symbolically represented by 'r' and ranges from -1 to +1. A positive value indicates a positive correlation where the two variables move in the same direction. A negative value suggests a...
1.2K
Statistical Methods to Analyze Parametric Data: Student t-Test and Goodness-of-Fit Test01:09

Statistical Methods to Analyze Parametric Data: Student t-Test and Goodness-of-Fit Test

1.6K
In parametric statistics, two fundamental tests stand out for their utility and wide application: the Student's t-test and goodness-of-fit tests. These tests provide researchers with a robust method for drawing insights from data, testing hypotheses, and making informed decisions based on their findings.
The Student's t-test is a statistical test that examines if there is a statistically significant difference between the means of two groups. This test is instrumental when dealing with...
1.6K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Bayesian Structured Mediation analysis with Unobserved confounders.

Biometrics·2026
Same author

Scalable Bayesian Image-on-Scalar Regression for Population-Scale Neuroimaging Data Analysis.

Journal of the American Statistical Association·2026
Same author

Bayesian Image Mediation Analysis.

Journal of the American Statistical Association·2026
Same author

Synergy Area With FDR-Controlled Evaluation (SAFE) to Robustly Assess Safety Profile in Clinical Trials.

Statistics in medicine·2026
Same author

Bayesian Signal Matching for Transfer Learning in ERP-Based Brain Computer Interface.

Journal of the American Statistical Association·2026
Same author

Adding highly variable genes to spatially variable genes can improve cell type clustering performance in spatial transcriptomics data.

Bioinformatics advances·2026
Same journal

Fast penalized generalized estimating equations for large longitudinal functional datasets.

Biometrics·2026
Same journal

Causally-interpretable random-effects meta-analysis.

Biometrics·2026
Same journal

Statistical inference for mean function of partially observed functional time series.

Biometrics·2026
Same journal

Subgroup identification via Interaction Tree and Mixed Model for Repeated Measures with application to Alzheimer's disease.

Biometrics·2026
Same journal

Finite mixtures of linear quantile regressions with concomitant variables: a solution to endogeneity in longitudinal data modeling.

Biometrics·2026
Same journal

Discussion on "INTACT: a method for integration of longitudinal physical activity data from multiple sources" by Jingru Zhang, Erjia Cui, Hongzhe Li, and Haochang Shou.

Biometrics·2026
See all related articles

Related Experiment Video

Updated: May 24, 2025

Establishing a Competing Risk Regression Nomogram Model for Survival Data
04:57

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

10.1K

A general, flexible, and harmonious framework to construct interpretable functions in regression analysis.

Tianyu Zhan1, Jian Kang2

  • 1Data and Statistical Sciences, AbbVie Inc., 1 Waukegan Road, North Chicago, IL 60064, United States.

Biometrics
|March 4, 2025
PubMed
Summary
This summary is machine-generated.

This study introduces a flexible framework for creating interpretable regression models, enhancing reliability and transparency. The approach uses a novel Mallows's Cp-based measure for model selection, balancing accuracy and generalizability.

Keywords:
complexityestimationgeneralizabilityinterpretabilitymodel selection

More Related Videos

Characterization of Complex Systems Using the Design of Experiments Approach: Transient Protein Expression in Tobacco as a Case Study
20:24

Characterization of Complex Systems Using the Design of Experiments Approach: Transient Protein Expression in Tobacco as a Case Study

Published on: January 31, 2014

16.4K
An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.0K

Related Experiment Videos

Last Updated: May 24, 2025

Establishing a Competing Risk Regression Nomogram Model for Survival Data
04:57

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

10.1K
Characterization of Complex Systems Using the Design of Experiments Approach: Transient Protein Expression in Tobacco as a Case Study
20:24

Characterization of Complex Systems Using the Design of Experiments Approach: Transient Protein Expression in Tobacco as a Case Study

Published on: January 31, 2014

16.4K
An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.0K

Area of Science:

  • Statistics
  • Machine Learning
  • Biostatistics

Background:

  • Interpretability in models is crucial for reliability, transparency, and communication.
  • Defining and evaluating interpretability remains subjective, with factors like simplicity, accuracy, and generalizability being key.
  • Existing methods may not offer a unified approach to constructing interpretable functions.

Purpose of the Study:

  • To present a general, flexible framework for constructing interpretable functions in regression analysis, focusing on continuous outcomes.
  • To introduce a new model selection measure based on Mallows's Cp-statistic.
  • To demonstrate the framework's application in clinical trial design and Bayesian decision-making.

Main Methods:

  • Formulation of a functional skeleton guided by user expectations of interpretability.
  • Development of a new model selection criterion using Mallows's Cp-statistic to balance approximation, generalizability, and interpretability.
  • Application of the framework to derive sample size formulas for adaptive clinical trials and analyze operating characteristics in Bayesian Go/No-Go designs.

Main Results:

  • A novel framework for building interpretable regression models is established.
  • A new Mallows's Cp-based statistic is proposed for effective model selection.
  • The framework is successfully applied to adaptive clinical trials, Bayesian Go/No-Go paradigms, and hypothesis testing for categorical outcomes.

Conclusions:

  • The proposed framework offers a harmonious approach to constructing interpretable functions in regression analysis.
  • The new model selection measure aids in balancing key aspects of model evaluation.
  • The method demonstrates broad applicability across various statistical and biomedical applications, including real-world data analysis.