Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Types of Errors: Detection and Minimization

Types of Errors: Detection and Minimization

Error is the deviation of the obtained result from the true, expected value or the estimated central value. Errors are expressed in absolute or relative terms.
Absolute error in a measurement is the numerical difference from the true or central value. Relative error is the ratio between absolute error and the true or central value, expressed as a percentage.
Errors can be classified by source, magnitude, and sign. There are three types of errors: systematic, random, and gross.
Systematic or...

Random and Systematic Errors

Random and Systematic Errors

Scientists always try their best to record measurements with the utmost accuracy and precision. However, sometimes errors do occur. These errors can be random or systematic. Random errors are observed due to the inconsistency or fluctuation in the measurement process, or variations in the quantity itself that is being measured. Such errors fluctuate from being greater than or less than the true value in repeated measurements. Consider a scientist measuring the length of an earthworm using a...

Propagation of Uncertainty from Systematic Error

Propagation of Uncertainty from Systematic Error

The atomic mass of an element varies due to the relative ratio of its isotopes. A sample's relative proportion of oxygen isotopes influences its average atomic mass. For instance, if we were to measure the atomic mass of oxygen from a sample, the mass would be a weighted average of the isotopic masses of oxygen in that sample. Since a single sample is not likely to perfectly reflect the true atomic mass of oxygen for all the molecules of oxygen on Earth, the mass we obtain from this...

Margin of Error

Margin of Error

The margin of error is also called the maximum error of an estimate. The margin of error is the maximum possible or expected difference between the observed sample parameter value and the actual population parameter value. For proportion, it is the maximum difference between the value of sample proportion obtained from the data and the true value of population proportion. As the true value of the population parameter is not known, the margin of error is calculated using the sample statistic.

Accuracy and Errors in Hypothesis Testing

Accuracy and Errors in Hypothesis Testing

Hypothesis testing is a fundamental statistical tool that begins with the assumption that the null hypothesis H0 is true. During this process, two types of errors can occur: Type I and Type II. A Type I error refers to the incorrect rejection of a true null hypothesis, while a Type II error involves the failure to reject a false null hypothesis.
In hypothesis testing, the probability of making a Type I error, denoted as α, is commonly set at 0.05. This significance level indicates a 5%...

Regression Toward the Mean

Regression Toward the Mean

Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Identification of functional modules in a PPI network by bounded diameter clustering.

Journal of bioinformatics and computational biology·2010

Same journal

CardiaTics: An explainable AI integrated heart disease diagnosis model with feature engineering and stacked ensemble approach.

Journal of big data·2026

Same journal

Comprehensive representation of health-related phenotypes in one million dogs using topic modelling of electronic health records.

Journal of big data·2026

Same journal

UniqueNOSD: a novel framework for NoSQL over SQL databases.

Journal of big data·2025

Same journal

<i>F</i>u<i>n</i>Da: scalable serverless data analytics and in situ query processing.

Journal of big data·2025

Same journal

Integrating Big Data, Artificial Intelligence, and motion analysis for emerging precision medicine applications in Parkinson's Disease.

Journal of big data·2024

Same journal

Interpolation-split: a data-centric deep learning approach with big interpolated data to boost airway segmentation performance.

Journal of big data·2024

See all related articles

Search research articles

Related Experiment Video

Updated: Aug 11, 2025

An R-Based Landscape Validation of a Competing Risk Model

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

Error and optimism bias regularization.

Nassim Sohaee¹

¹Department of Information Technology and Decision Science, University of North Texas, 1155 Union Circle, Denton, TX 76203 USA.

Journal of Big Data

|February 6, 2023

Summary

This summary is machine-generated.

This study introduces a new regularization term for regression models to better manage over-predicted and under-predicted instances. This method enhances prediction quality by analyzing specific error types, improving machine learning model evaluation.

Keywords:

Convex cost function Cost function Optimism bias Over-estimation Regression Regularization Under-estimation

More Related Videos

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

Errors as a Means of Reducing Impulsive Food Choice

Errors as a Means of Reducing Impulsive Food Choice

Published on: June 5, 2016

Related Experiment Videos

Last Updated: Aug 11, 2025

An R-Based Landscape Validation of a Competing Risk Model

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

Errors as a Means of Reducing Impulsive Food Choice

Errors as a Means of Reducing Impulsive Food Choice

Published on: June 5, 2016

Area of Science:

Machine Learning
Statistical Modeling

Background:

Current regression model evaluation focuses on minimizing overall error, neglecting specific types of prediction inaccuracies.
Existing methods lack detailed analysis of over-predicted versus under-predicted instances, limiting model interpretability.

Purpose of the Study:

To introduce a novel regularization term for regression models.
To specifically manage and evaluate the count of over-predicted and under-predicted instances.
To enhance the granular evaluation of prediction errors in machine learning.

Main Methods:

A simple regularization term is proposed and integrated into regression models.
The method focuses on differentiating and controlling over-prediction and under-prediction.
Evaluation involves analyzing the impact of the regularization term on error distribution.

Main Results:

The proposed regularization term effectively manages the number of over-predicted and under-predicted instances.
This approach provides a more detailed insight into regression model error types.
Improved ability to diagnose and address specific prediction biases.

Conclusions:

The introduced regularization term offers a valuable enhancement for regression model evaluation.
It allows for more nuanced analysis of prediction quality beyond simple error minimization.
This technique aids in building more robust and reliable machine learning models.