Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Multiple Regression01:25

Multiple Regression

3.0K
Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
3.0K
Pharmacokinetic Models: Comparison and Selection Criterion01:26

Pharmacokinetic Models: Comparison and Selection Criterion

52
Physiological and compartmental models are valuable tools used in studying biological systems. These models rely on differential equations to maintain mass balance within the system, ensuring an accurate representation of the dynamic processes at play.
Physiological models take a detailed approach by considering specific molecular processes. They can predict drug distribution, metabolism, and elimination changes, providing a comprehensive understanding of how drugs interact with the body.
52
Survival Tree01:19

Survival Tree

74
Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
 Building a Survival Tree
Constructing a...
74
Multi-input and Multi-variable systems01:22

Multi-input and Multi-variable systems

105
Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence...
105
Regression Analysis01:11

Regression Analysis

5.7K
Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:
5.7K
Variation01:19

Variation

6.8K
An important characteristic of any set of data is the variation in the data. In some data sets, the data values are concentrated closely near the mean; in other data sets, the data values are more widely spread out from the mean. The most common measure of variation, or spread, is the standard deviation, which is the square root of variance.
When independent and dependent variables are plotted on a scatter plot, the slope of a line is a value that describes the rate of change between the two...
6.8K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

When to Adjust for Multiple Testing: A Unifying Guiding Principle.

Biometrical journal. Biometrische Zeitschrift·2026
Same author

Donor bone marrow together with recipient regulatory T cells induces chimerism without irradiation in kidney transplantation.

Science translational medicine·2026
Same author

Detection of previously undiagnosed conditions in midlife preventive health examinations.

Scientific reports·2026
Same author

Using routinely collected data for research purposes: challenges and mitigation strategies.

BMJ (Clinical research ed.)·2026
Same author

First attempt success rate of intraosseous access in preterm infants and neonates: a systematic review.

Resuscitation plus·2026
Same author

Multiple linear regression modeling with values below a lower limit of quantification - a statistical method comparison.

BMC medical research methodology·2026
Same journal

Thymidylate synthase inhibitory drugs induce p53-dependent pathways differently.

PloS one·2026
Same journal

Top-down and bottom-up attention for joint pattern classification and reconstruction.

PloS one·2026
Same journal

Short- and long-term scaling behavior of blood pressure and pulse arrival time during sleep in healthy controls and patients with obstructive sleep apnea.

PloS one·2026
Same journal

Double DQN-based secrecy energy efficiency and fairness performance in IRS-assisted NOMA systems with friendly jamming.

PloS one·2026
Same journal

10 recommendations for strengthening citizen science for improved societal and ecological outcomes: A co-produced analysis of challenges and opportunities in the 21st century.

PloS one·2026
Same journal

Paying in public: Peer effects, impression management, and willingness to pay on digital payment platforms.

PloS one·2026
See all related articles

Related Experiment Video

Updated: Jun 17, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.4K

Evaluating variable selection methods for multivariable regression models: A simulation study protocol.

Theresa Ullmann1, Georg Heinze1, Lorena Hafermann2

  • 1Institute of Clinical Biometrics, Center for Medical Data Science, Medical University of Vienna, Vienna, Austria.

Plos One
|August 9, 2024
PubMed
Summary
This summary is machine-generated.

This study protocol outlines a simulation to compare variable selection methods in regression analysis. It aims to neutrally evaluate their impact on model accuracy and variable identification, using real-world data.

More Related Videos

An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.0K
Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach
04:35

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

3.3K

Related Experiment Videos

Last Updated: Jun 17, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.4K
An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.0K
Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach
04:35

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

3.3K

Area of Science:

  • Statistical modeling
  • Regression analysis
  • Data-driven variable selection

Background:

  • Variable selection in regression analysis aims to enhance model interpretability and predictive accuracy.
  • However, it can lead to issues like biased coefficients, underestimated standard errors, and model instability.
  • Few large-scale simulations have neutrally compared these methods' consequences.

Purpose of the Study:

  • To present a simulation study protocol for evaluating various data-driven variable selection methods.
  • To neutrally compare these methods based on their impact on model performance and variable identification.
  • To inform best practices in regression modeling through rigorous simulation.

Main Methods:

  • The study protocol details a simulation comparing forward selection, backward elimination, and penalized likelihood (Lasso) approaches.
  • Methods will be assessed on variable inclusion/exclusion accuracy, coefficient bias/variance, confidence interval validity, and predictive performance.
  • Simulations will use linear and logistic regression on National Health and Nutrition Examination Survey (NHANES) data.

Main Results:

  • This section will report findings from the simulation study, detailing the performance of each variable selection method.
  • Results will quantify false variable inclusions/exclusions and assess impacts on regression coefficient estimation and model predictive accuracy.
  • The study will provide empirical evidence on the relative strengths and weaknesses of different variable selection techniques.

Conclusions:

  • The simulation results will offer insights into the trade-offs associated with various data-driven variable selection methods.
  • Findings will guide researchers in choosing appropriate methods to avoid common pitfalls like biased estimates and invalid inference.
  • This research contributes to more reliable and accurate statistical modeling in applied research.