Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Multiple Regression

Multiple Regression

Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...

Pharmacokinetic Models: Comparison and Selection Criterion

Pharmacokinetic Models: Comparison and Selection Criterion

Physiological and compartmental models are valuable tools used in studying biological systems. These models rely on differential equations to maintain mass balance within the system, ensuring an accurate representation of the dynamic processes at play.
Physiological models take a detailed approach by considering specific molecular processes. They can predict drug distribution, metabolism, and elimination changes, providing a comprehensive understanding of how drugs interact with the body.

Frequency-dependent Selection

Frequency-dependent Selection

When the fitness of a trait is influenced by how common it is (i.e., its frequency) relative to different traits within a population, this is referred to as frequency-dependent selection. Frequency-dependent selection may occur between species or within a single species. This type of selection can either be positive—with more common phenotypes having higher fitness—or negative, with rarer phenotypes conferring increased fitness.

Quantifying and Rejecting Outliers: The Grubbs Test

Quantifying and Rejecting Outliers: The Grubbs Test

Sometimes, a data set can have a recorded numerical observation that greatly deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier. To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...

Prediction Intervals

Prediction Intervals

The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y.

Multi-input and Multi-variable systems

Multi-input and Multi-variable systems

Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence of...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

When to Adjust for Multiple Testing: A Unifying Guiding Principle.

Biometrical journal. Biometrische Zeitschrift·2026

Same author

Methodological guidance on clinical prediction models in mental health research.

Psychological medicine·2026

Same author

Using routinely collected data for research purposes: challenges and mitigation strategies.

BMJ (Clinical research ed.)·2026

Same author

A large-scale neutral comparison study of survival models on low-dimensional data.

Bioinformatics (Oxford, England)·2026

Same author

Detecting gene-environment interactions to guide personalized intervention: Boosting distributional regression for polygenic scores.

Proceedings of the National Academy of Sciences of the United States of America·2026

Same author

The Influence of Anesthesiologist Gender and Experience on Risk Understanding and Anxiety Changes After Online Preoperative Patient Education: A Sub-Analysis of the iPREDICT Randomized Controlled Trial.

Journal of clinical medicine·2025

Same journal

Correction to "Mathematical Modelling of COVID-19 Transmission in Kenya: A Model with Reinfection Transmission Mechanism".

Computational and mathematical methods in medicine·2025

Same journal

RETRACTION: Ligustrazine Inhibits Lung Phosphodiesterase Activity in a Rat Model of Allergic Asthma.

Computational and mathematical methods in medicine·2025

Same journal

RETRACTION: Delivery of miR-224-5p by Exosomes from Cancer-Associated Fibroblasts Potentiates Progression of Clear Cell Renal Cell Carcinoma.

Computational and mathematical methods in medicine·2025

Same journal

RETRACTION: Empirical Analysis of the Nursing Effect of Intelligent Medical Internet of Things in Postoperative Osteoarthritis.

Computational and mathematical methods in medicine·2025

Same journal

RETRACTION: Evaluation and Analysis of the Intervention Effect of Systematic Parent Training Based on Computational Intelligence on Child Autism.

Computational and mathematical methods in medicine·2024

Same journal

RETRACTION: Humanistic Spirit Training of Medical Students Based on Multisource Medical Data Fusion.

Computational and mathematical methods in medicine·2024

See all related articles

Search research articles

Related Experiment Video

Updated: Feb 24, 2026

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Published on: July 22, 2025

Probing for Sparse and Fast Variable Selection with Model-Based Boosting.

Janek Thomas¹, Tobias Hepp², Andreas Mayr^2,3

¹Department of Statistics, LMU München, München, Germany.

Computational and Mathematical Methods in Medicine

|August 24, 2017

Summary

This summary is machine-generated.

This study introduces a novel variable selection technique using gradient boosting and shadow variables. This method efficiently identifies important variables in a single model fit, outperforming existing approaches in high-dimensional data analysis.

More Related Videos

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Related Experiment Videos

Last Updated: Feb 24, 2026

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Published on: July 22, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Area of Science:

Machine Learning
Statistical Modeling
Bioinformatics

Background:

Model-based gradient boosting offers simultaneous statistical modeling and variable selection.
Current methods require multiple data alterations (e.g., cross-validation) to prevent overfitting, increasing computational cost.
Efficient variable selection is crucial for high-dimensional datasets.

Purpose of the Study:

To develop a novel, computationally efficient variable selection method.
To integrate variable selection directly into the model fitting process.
To evaluate the performance of the new method against established techniques.

Main Methods:

A new approach augmenting data with randomly permuted 'shadow variables'.
Stopping the boosting process when a shadow variable is selected, enabling single-fit selection.
Benchmarking against stability selection in high-dimensional classification tasks.

Main Results:

The proposed probing method achieves competitive performance compared to state-of-the-art selection techniques.
Variable selection is accomplished in a single model fit, eliminating the need for parameter tuning.
Successful application demonstrated on three gene expression datasets.

Conclusions:

The novel method provides an efficient and effective alternative for variable selection in gradient boosting.
This approach simplifies the process by integrating selection into a single model fit.
The technique shows promise for analyzing complex biological data, such as gene expression profiles.