Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

1.8K
Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...
1.8K
Parametric Survival Analysis: Weibull and Exponential Methods01:14

Parametric Survival Analysis: Weibull and Exponential Methods

565
Parametric survival analysis models survival data by assuming a specific probability distribution for the time until an event occurs. The Weibull and exponential distributions are two of the most commonly used methods in this context, due to their versatility and relatively straightforward application.
Weibull Distribution
The Weibull distribution is a flexible model used in parametric survival analysis. It can handle both increasing and decreasing hazard rates, depending on its shape parameter...
565
Regression Toward the Mean01:52

Regression Toward the Mean

6.5K
Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when...
6.5K
Assumptions of Survival Analysis01:15

Assumptions of Survival Analysis

180
Survival models analyze the time until one or more events occur, such as death in biological organisms or failure in mechanical systems. These models are widely used across fields like medicine, biology, engineering, and public health to study time-to-event phenomena. To ensure accurate results, survival analysis relies on key assumptions and careful study design.
180
Truncation in Survival Analysis01:09

Truncation in Survival Analysis

288
Truncation in survival analysis refers to the exclusion of individuals or events from the dataset based on specific criteria related to the time of the event. This exclusion can happen in two primary forms: left truncation and right truncation.
Left truncation occurs when individuals who experienced the event of interest before a certain time are not included in the study. This is often due to a "delayed entry" into the study where only those who survive until a certain entry point are...
288
Prediction Intervals01:03

Prediction Intervals

2.3K
The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y. 
2.3K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

When to Adjust for Multiple Testing: A Unifying Guiding Principle.

Biometrical journal. Biometrische Zeitschrift·2026
Same author

Polygenic modeling of genetic effects on both phenotypic mean and variance: distributional regression for BMI, blood and urine biomarkers in the UK Biobank.

Frontiers in bioinformatics·2026
Same author

Methodological guidance on clinical prediction models in mental health research.

Psychological medicine·2026
Same author

CleanFinder: a scalable framework for comprehensive genome editing analysis.

Trends in biotechnology·2026
Same author

Sunscreen Efficacy Against UVA1- And Visible Light-Induced Skin Pigmentation Is Influenced by Ancestry.

Photodermatology, photoimmunology & photomedicine·2026
Same author

Detecting gene-environment interactions to guide personalized intervention: Boosting distributional regression for polygenic scores.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same journal

Targeted maximum likelihood estimation (TMLE) in regulatory submissions and research: a landscape analysis.

The international journal of biostatistics·2026
Same journal

Predicting birth weight by multivariate functional principal component regressions.

The international journal of biostatistics·2026
Same journal

Robust median regression for count data with general lower truncation using a contaminated discrete Weibull model.

The international journal of biostatistics·2026
Same journal

Handling the uncertainty issue of missingness via a mixture-structure-based method.

The international journal of biostatistics·2026
Same journal

Statistical method for pooling categorical biomarker data from multi-center matched/nested case-control studies.

The international journal of biostatistics·2026
Same journal

Prognostic score methods for the estimation of the average causal effect.

The international journal of biostatistics·2026
See all related articles

Related Experiment Video

Updated: Sep 2, 2025

An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.2K

Robust statistical boosting with quantile-based adaptive loss functions.

Jan Speller1, Christian Staerk1, Andreas Mayr1

  • 1Medical Faculty, Institute of Medical Biometrics, Informatics and Epidemiology (IMBIE), University of Bonn, Bonn, Germany.

The International Journal of Biostatistics
|August 11, 2022
PubMed
Summary
This summary is machine-generated.

This study introduces adaptive robust loss functions for boosting algorithms, improving variable selection and predictive modeling in biomedical data, especially with outliers. The new method enhances accuracy and model sparsity, outperforming standard approaches.

Keywords:
Bisquare lossHuber lossgradient boostingrobust regression

More Related Videos

Establishing a Competing Risk Regression Nomogram Model for Survival Data
04:57

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

10.3K
Psychophysically-anchored, Robust Thresholding in Studying Pain-related Lateralization of Oscillatory Prestimulus Activity
07:28

Psychophysically-anchored, Robust Thresholding in Studying Pain-related Lateralization of Oscillatory Prestimulus Activity

Published on: January 21, 2017

7.0K

Related Experiment Videos

Last Updated: Sep 2, 2025

An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.2K
Establishing a Competing Risk Regression Nomogram Model for Survival Data
04:57

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

10.3K
Psychophysically-anchored, Robust Thresholding in Studying Pain-related Lateralization of Oscillatory Prestimulus Activity
07:28

Psychophysically-anchored, Robust Thresholding in Studying Pain-related Lateralization of Oscillatory Prestimulus Activity

Published on: January 21, 2017

7.0K

Area of Science:

  • Biostatistics
  • Machine Learning
  • Bioinformatics

Background:

  • High-dimensional biomedical data often contains outliers, complicating predictive modeling and variable selection.
  • Traditional methods struggle with data corrupted by outliers, leading to inaccurate results.
  • Robust statistical methods are needed to handle such data effectively.

Purpose of the Study:

  • To develop and evaluate adaptive robust loss functions within statistical boosting algorithms.
  • To enhance variable selection and predictive modeling for high-dimensional biomedical data with potential outliers.
  • To improve robustness against vertical outliers in the outcome variable.

Main Methods:

  • Proposed an adaptive approach for composite robust loss functions (Huber, Bisquare) in boosting.
  • Adapted the threshold parameter of loss functions based on residual sizes in each boosting iteration.
  • Compared performance against M-regression, standard boosting losses, and lasso using simulated data and NCI-60 cell line expression data.

Main Results:

  • Adaptive Huber and Bisquare losses demonstrated superior performance in prediction accuracy and variable selection when data contained outliers or corruption.
  • For non-corrupted data, the adaptive approach performed comparably to standard L2 loss boosting and lasso.
  • In analyzing KRT19 protein expression data, the adaptive loss functions yielded favorable prediction accuracy and highly sparse models.

Conclusions:

  • Adaptive robust loss functions integrated with boosting algorithms offer a powerful tool for analyzing complex biomedical data.
  • The proposed method effectively handles outliers, leading to more reliable variable selection and predictive models.
  • This approach shows promise for applications in bioinformatics, particularly with gene expression data.