Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Multiple Regression01:25

Multiple Regression

4.2K
Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
4.2K
Pharmacokinetic Models: Comparison and Selection Criterion01:26

Pharmacokinetic Models: Comparison and Selection Criterion

423
Physiological and compartmental models are valuable tools used in studying biological systems. These models rely on differential equations to maintain mass balance within the system, ensuring an accurate representation of the dynamic processes at play.
Physiological models take a detailed approach by considering specific molecular processes. They can predict drug distribution, metabolism, and elimination changes, providing a comprehensive understanding of how drugs interact with the body.
423
Frequency-dependent Selection01:21

Frequency-dependent Selection

24.3K
When the fitness of a trait is influenced by how common it is (i.e., its frequency) relative to different traits within a population, this is referred to as frequency-dependent selection. Frequency-dependent selection may occur between species or within a single species. This type of selection can either be positive—with more common phenotypes having higher fitness—or negative, with rarer phenotypes conferring increased fitness.
24.3K
Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

4.2K
Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...
4.2K
Prediction Intervals01:03

Prediction Intervals

3.5K
The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y. 
3.5K
Multi-input and Multi-variable systems01:22

Multi-input and Multi-variable systems

436
Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence of...
436

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

When to Adjust for Multiple Testing: A Unifying Guiding Principle.

Biometrical journal. Biometrische Zeitschrift·2026
Same author

Methodological guidance on clinical prediction models in mental health research.

Psychological medicine·2026
Same author

Using routinely collected data for research purposes: challenges and mitigation strategies.

BMJ (Clinical research ed.)·2026
Same author

A large-scale neutral comparison study of survival models on low-dimensional data.

Bioinformatics (Oxford, England)·2026
Same author

Detecting gene-environment interactions to guide personalized intervention: Boosting distributional regression for polygenic scores.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same author

The Influence of Anesthesiologist Gender and Experience on Risk Understanding and Anxiety Changes After Online Preoperative Patient Education: A Sub-Analysis of the iPREDICT Randomized Controlled Trial.

Journal of clinical medicine·2025
Same journal

Correction to "Mathematical Modelling of COVID-19 Transmission in Kenya: A Model with Reinfection Transmission Mechanism".

Computational and mathematical methods in medicine·2025
Same journal

RETRACTION: Ligustrazine Inhibits Lung Phosphodiesterase Activity in a Rat Model of Allergic Asthma.

Computational and mathematical methods in medicine·2025
Same journal

RETRACTION: Delivery of miR-224-5p by Exosomes from Cancer-Associated Fibroblasts Potentiates Progression of Clear Cell Renal Cell Carcinoma.

Computational and mathematical methods in medicine·2025
Same journal

RETRACTION: Empirical Analysis of the Nursing Effect of Intelligent Medical Internet of Things in Postoperative Osteoarthritis.

Computational and mathematical methods in medicine·2025
Same journal

RETRACTION: Evaluation and Analysis of the Intervention Effect of Systematic Parent Training Based on Computational Intelligence on Child Autism.

Computational and mathematical methods in medicine·2024
Same journal

RETRACTION: Humanistic Spirit Training of Medical Students Based on Multisource Medical Data Fusion.

Computational and mathematical methods in medicine·2024
See all related articles

Related Experiment Video

Updated: Feb 24, 2026

Constructing and Visualizing Models using Mime-based Machine-learning Framework
06:19

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Published on: July 22, 2025

2.6K

Probing for Sparse and Fast Variable Selection with Model-Based Boosting.

Janek Thomas1, Tobias Hepp2, Andreas Mayr2,3

  • 1Department of Statistics, LMU München, München, Germany.

Computational and Mathematical Methods in Medicine
|August 24, 2017
PubMed
Summary
This summary is machine-generated.

This study introduces a novel variable selection technique using gradient boosting and shadow variables. This method efficiently identifies important variables in a single model fit, outperforming existing approaches in high-dimensional data analysis.

More Related Videos

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

8.1K

Related Experiment Videos

Last Updated: Feb 24, 2026

Constructing and Visualizing Models using Mime-based Machine-learning Framework
06:19

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Published on: July 22, 2025

2.6K
Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

8.1K

Area of Science:

  • Machine Learning
  • Statistical Modeling
  • Bioinformatics

Background:

  • Model-based gradient boosting offers simultaneous statistical modeling and variable selection.
  • Current methods require multiple data alterations (e.g., cross-validation) to prevent overfitting, increasing computational cost.
  • Efficient variable selection is crucial for high-dimensional datasets.

Purpose of the Study:

  • To develop a novel, computationally efficient variable selection method.
  • To integrate variable selection directly into the model fitting process.
  • To evaluate the performance of the new method against established techniques.

Main Methods:

  • A new approach augmenting data with randomly permuted 'shadow variables'.
  • Stopping the boosting process when a shadow variable is selected, enabling single-fit selection.
  • Benchmarking against stability selection in high-dimensional classification tasks.

Main Results:

  • The proposed probing method achieves competitive performance compared to state-of-the-art selection techniques.
  • Variable selection is accomplished in a single model fit, eliminating the need for parameter tuning.
  • Successful application demonstrated on three gene expression datasets.

Conclusions:

  • The novel method provides an efficient and effective alternative for variable selection in gradient boosting.
  • This approach simplifies the process by integrating selection into a single model fit.
  • The technique shows promise for analyzing complex biological data, such as gene expression profiles.