Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Quantifying and Rejecting Outliers: The Grubbs Test

Quantifying and Rejecting Outliers: The Grubbs Test

Sometimes, a data set can have a recorded numerical observation that greatly deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier. To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...

Parametric Survival Analysis: Weibull and Exponential Methods

Parametric Survival Analysis: Weibull and Exponential Methods

Parametric survival analysis models survival data by assuming a specific probability distribution for the time until an event occurs. The Weibull and exponential distributions are two of the most commonly used methods in this context, due to their versatility and relatively straightforward application.
Weibull Distribution
The Weibull distribution is a flexible model used in parametric survival analysis. It can handle both increasing and decreasing hazard rates, depending on its shape parameter...

Regression Toward the Mean

Regression Toward the Mean

Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when...

Assumptions of Survival Analysis

Assumptions of Survival Analysis

Survival models analyze the time until one or more events occur, such as death in biological organisms or failure in mechanical systems. These models are widely used across fields like medicine, biology, engineering, and public health to study time-to-event phenomena. To ensure accurate results, survival analysis relies on key assumptions and careful study design.

Truncation in Survival Analysis

Truncation in Survival Analysis

Truncation in survival analysis refers to the exclusion of individuals or events from the dataset based on specific criteria related to the time of the event. This exclusion can happen in two primary forms: left truncation and right truncation.
Left truncation occurs when individuals who experienced the event of interest before a certain time are not included in the study. This is often due to a "delayed entry" into the study where only those who survive until a certain entry point are...

Prediction Intervals

Prediction Intervals

The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y.

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

When to Adjust for Multiple Testing: A Unifying Guiding Principle.

Biometrical journal. Biometrische Zeitschrift·2026

Same author

Polygenic modeling of genetic effects on both phenotypic mean and variance: distributional regression for BMI, blood and urine biomarkers in the UK Biobank.

Frontiers in bioinformatics·2026

Same author

Methodological guidance on clinical prediction models in mental health research.

Psychological medicine·2026

Same author

CleanFinder: a scalable framework for comprehensive genome editing analysis.

Trends in biotechnology·2026

Same author

Sunscreen Efficacy Against UVA1- And Visible Light-Induced Skin Pigmentation Is Influenced by Ancestry.

Photodermatology, photoimmunology & photomedicine·2026

Same author

Detecting gene-environment interactions to guide personalized intervention: Boosting distributional regression for polygenic scores.

Proceedings of the National Academy of Sciences of the United States of America·2026

Same journal

Targeted maximum likelihood estimation (TMLE) in regulatory submissions and research: a landscape analysis.

The international journal of biostatistics·2026

Same journal

Predicting birth weight by multivariate functional principal component regressions.

The international journal of biostatistics·2026

Same journal

Robust median regression for count data with general lower truncation using a contaminated discrete Weibull model.

The international journal of biostatistics·2026

Same journal

Handling the uncertainty issue of missingness via a mixture-structure-based method.

The international journal of biostatistics·2026

Same journal

Statistical method for pooling categorical biomarker data from multi-center matched/nested case-control studies.

The international journal of biostatistics·2026

Same journal

Prognostic score methods for the estimation of the average causal effect.

The international journal of biostatistics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Sep 2, 2025

An R-Based Landscape Validation of a Competing Risk Model

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

Robust statistical boosting with quantile-based adaptive loss functions.

Jan Speller¹, Christian Staerk¹, Andreas Mayr¹

¹Medical Faculty, Institute of Medical Biometrics, Informatics and Epidemiology (IMBIE), University of Bonn, Bonn, Germany.

The International Journal of Biostatistics

|August 11, 2022

Summary

This summary is machine-generated.

This study introduces adaptive robust loss functions for boosting algorithms, improving variable selection and predictive modeling in biomedical data, especially with outliers. The new method enhances accuracy and model sparsity, outperforming standard approaches.

Keywords:

Bisquare loss Huber loss gradient boosting robust regression

More Related Videos

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

Psychophysically-anchored, Robust Thresholding in Studying Pain-related Lateralization of Oscillatory Prestimulus Activity

Psychophysically-anchored, Robust Thresholding in Studying Pain-related Lateralization of Oscillatory Prestimulus Activity

Published on: January 21, 2017

Related Experiment Videos

Last Updated: Sep 2, 2025

An R-Based Landscape Validation of a Competing Risk Model

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

Psychophysically-anchored, Robust Thresholding in Studying Pain-related Lateralization of Oscillatory Prestimulus Activity

Psychophysically-anchored, Robust Thresholding in Studying Pain-related Lateralization of Oscillatory Prestimulus Activity

Published on: January 21, 2017

Area of Science:

Biostatistics
Machine Learning
Bioinformatics

Background:

High-dimensional biomedical data often contains outliers, complicating predictive modeling and variable selection.
Traditional methods struggle with data corrupted by outliers, leading to inaccurate results.
Robust statistical methods are needed to handle such data effectively.

Purpose of the Study:

To develop and evaluate adaptive robust loss functions within statistical boosting algorithms.
To enhance variable selection and predictive modeling for high-dimensional biomedical data with potential outliers.
To improve robustness against vertical outliers in the outcome variable.

Main Methods:

Proposed an adaptive approach for composite robust loss functions (Huber, Bisquare) in boosting.
Adapted the threshold parameter of loss functions based on residual sizes in each boosting iteration.
Compared performance against M-regression, standard boosting losses, and lasso using simulated data and NCI-60 cell line expression data.

Main Results:

Adaptive Huber and Bisquare losses demonstrated superior performance in prediction accuracy and variable selection when data contained outliers or corruption.
For non-corrupted data, the adaptive approach performed comparably to standard L2 loss boosting and lasso.
In analyzing KRT19 protein expression data, the adaptive loss functions yielded favorable prediction accuracy and highly sparse models.

Conclusions:

Adaptive robust loss functions integrated with boosting algorithms offer a powerful tool for analyzing complex biomedical data.
The proposed method effectively handles outliers, leading to more reliable variable selection and predictive models.
This approach shows promise for applications in bioinformatics, particularly with gene expression data.