Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Types of Errors: Detection and Minimization01:12

Types of Errors: Detection and Minimization

1.9K
Error is the deviation of the obtained result from the true, expected value or the estimated central value. Errors are expressed in absolute or relative terms.
Absolute error in a measurement is the numerical difference from the true or central value. Relative error is the ratio between absolute error and the true or central value, expressed as a percentage.
Errors can be classified by source, magnitude, and sign. There are three types of errors: systematic, random, and gross.
Systematic or...
1.9K
Random and Systematic Errors01:20

Random and Systematic Errors

11.5K
Scientists always try their best to record measurements with the utmost accuracy and precision. However, sometimes errors do occur. These errors can be random or systematic. Random errors are observed due to the inconsistency or fluctuation in the measurement process, or variations in the quantity itself that is being measured. Such errors fluctuate from being greater than or less than the true value in repeated measurements. Consider a scientist measuring the length of an earthworm using a...
11.5K
Propagation of Uncertainty from Systematic Error01:10

Propagation of Uncertainty from Systematic Error

580
The atomic mass of an element varies due to the relative ratio of its isotopes. A sample's relative proportion of oxygen isotopes influences its average atomic mass. For instance, if we were to measure the atomic mass of oxygen from a sample, the mass would be a weighted average of the isotopic masses of oxygen in that sample. Since a single sample is not likely to perfectly reflect the true atomic mass of oxygen for all the molecules of oxygen on Earth, the mass we obtain from this...
580
Margin of Error01:27

Margin of Error

4.4K
The margin of error is also called the maximum error of an estimate. The margin of error is the maximum possible or expected difference between the observed sample parameter value and the actual population parameter value. For proportion, it is the maximum difference between the value of sample proportion obtained from the data and the true value of population proportion. As the true value of the population parameter is not known, the margin of error is calculated using the sample statistic.
4.4K
Accuracy and Errors in Hypothesis Testing01:13

Accuracy and Errors in Hypothesis Testing

249
Hypothesis testing is a fundamental statistical tool that begins with the assumption that the null hypothesis H0 is true. During this process, two types of errors can occur: Type I and Type II. A Type I error refers to the incorrect rejection of a true null hypothesis, while a Type II error involves the failure to reject a false null hypothesis.
In hypothesis testing, the probability of making a Type I error, denoted as α, is commonly set at 0.05. This significance level indicates a 5%...
249
Regression Toward the Mean01:52

Regression Toward the Mean

6.3K
Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when...
6.3K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Identification of functional modules in a PPI network by bounded diameter clustering.

Journal of bioinformatics and computational biology·2010
Same journal

CardiaTics: An explainable AI integrated heart disease diagnosis model with feature engineering and stacked ensemble approach.

Journal of big data·2026
Same journal

Comprehensive representation of health-related phenotypes in one million dogs using topic modelling of electronic health records.

Journal of big data·2026
Same journal

UniqueNOSD: a novel framework for NoSQL over SQL databases.

Journal of big data·2025
Same journal

<i>F</i>u<i>n</i>Da: scalable serverless data analytics and in situ query processing.

Journal of big data·2025
Same journal

Integrating Big Data, Artificial Intelligence, and motion analysis for emerging precision medicine applications in Parkinson's Disease.

Journal of big data·2024
Same journal

Interpolation-split: a data-centric deep learning approach with big interpolated data to boost airway segmentation performance.

Journal of big data·2024
See all related articles

Related Experiment Video

Updated: Aug 11, 2025

An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.2K

Error and optimism bias regularization.

Nassim Sohaee1

  • 1Department of Information Technology and Decision Science, University of North Texas, 1155 Union Circle, Denton, TX 76203 USA.

Journal of Big Data
|February 6, 2023
PubMed
Summary
This summary is machine-generated.

This study introduces a new regularization term for regression models to better manage over-predicted and under-predicted instances. This method enhances prediction quality by analyzing specific error types, improving machine learning model evaluation.

Keywords:
Convex cost functionCost functionOptimism biasOver-estimationRegressionRegularizationUnder-estimation

More Related Videos

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems
05:47

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

431
Errors as a Means of Reducing Impulsive Food Choice
07:07

Errors as a Means of Reducing Impulsive Food Choice

Published on: June 5, 2016

8.7K

Related Experiment Videos

Last Updated: Aug 11, 2025

An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.2K
Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems
05:47

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

431
Errors as a Means of Reducing Impulsive Food Choice
07:07

Errors as a Means of Reducing Impulsive Food Choice

Published on: June 5, 2016

8.7K

Area of Science:

  • Machine Learning
  • Statistical Modeling

Background:

  • Current regression model evaluation focuses on minimizing overall error, neglecting specific types of prediction inaccuracies.
  • Existing methods lack detailed analysis of over-predicted versus under-predicted instances, limiting model interpretability.

Purpose of the Study:

  • To introduce a novel regularization term for regression models.
  • To specifically manage and evaluate the count of over-predicted and under-predicted instances.
  • To enhance the granular evaluation of prediction errors in machine learning.

Main Methods:

  • A simple regularization term is proposed and integrated into regression models.
  • The method focuses on differentiating and controlling over-prediction and under-prediction.
  • Evaluation involves analyzing the impact of the regularization term on error distribution.

Main Results:

  • The proposed regularization term effectively manages the number of over-predicted and under-predicted instances.
  • This approach provides a more detailed insight into regression model error types.
  • Improved ability to diagnose and address specific prediction biases.

Conclusions:

  • The introduced regularization term offers a valuable enhancement for regression model evaluation.
  • It allows for more nuanced analysis of prediction quality beyond simple error minimization.
  • This technique aids in building more robust and reliable machine learning models.