Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Margin of Error

Margin of Error

The margin of error is also called the maximum error of an estimate. The margin of error is the maximum possible or expected difference between the observed sample parameter value and the actual population parameter value. For proportion, it is the maximum difference between the value of sample proportion obtained from the data and the true value of population proportion. As the true value of the population parameter is not known, the margin of error is calculated using the sample statistic.

Central Limit Theorem

Central Limit Theorem

The central limit theorem, abbreviated as clt, is one of the most powerful and useful ideas in all of statistics. The central limit theorem for sample means says that if you repeatedly draw samples of a given size and calculate their means, and create a histogram of those means, then the resulting histogram will tend to have an approximate normal bell shape. In other words, as sample sizes increase, the distribution of means follows the normal distribution more closely.
The sample size, n, that...

Contaminants and Errors

Contaminants and Errors

Effective sample preparation is crucial for accurate and reliable laboratory analysis. During this process, two significant sources of error can arise: concentration bias from improper sample splitting and contamination caused by methods used to reduce particle size, such as grinding or homogenization. Identifying and minimizing these potential errors is crucial to ensuring the validity of the analysis.
Another key consideration is determining the appropriate number of samples required to...

Sampling Distribution

Sampling Distribution

Given simple random samples of size n from a given population with a measured characteristic such as mean, proportion, or standard deviation for each sample, the probability distribution of all the measured characteristics is called a sampling distribution. How much the statistic varies from one sample to another is known as the sampling variability of a statistic. You typically measure the sampling variability of a statistic by its standard error. The standard error of the mean is an example...

Estimating Population Mean with Known Standard Deviation

Estimating Population Mean with Known Standard Deviation

To construct a confidence interval for a single unknown population mean μ, where the population standard deviation is known, we need sample mean as an estimate for μ and we need the margin of error. Here, the margin of error (EBM) is called the error bound for a population mean (abbreviated EBM). The sample mean is the point estimate of the unknown population mean μ.
The confidence interval estimate will have the form as follows:
(point estimate - error bound, point estimate + error bound)
The...

Estimating Population Mean with Unknown Standard Deviation

Estimating Population Mean with Unknown Standard Deviation

In practice, we rarely know the population standard deviation. In the past, when the sample size was large, this did not present a problem to statisticians. They used the sample standard deviation s as an estimate for σ and proceeded as before to calculate a confidence interval with close enough results. However, statisticians ran into problems when the sample size was small. A small sample size caused inaccuracies in the confidence interval.
William S. Gosset (1876–1937) of the Guinness...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Is More Always Better With Digital Health Interventions? Shifting Engagement From Maximizing Use to Supporting Health.

Mayo Clinic proceedings. Digital health·2026

Same author

Effective monitoring of online AI decision-making algorithms in just-in-time adaptive interventions.

NPJ digital medicine·2026

Same author

SigmaScheduling: Uncertainty-Informed Scheduling of Decision Points for Intelligent Mobile Health Interventions.

... International Conference on Wearable and Implantable Body Sensor Networks. International Conference on Wearable and Implantable Body Sensor Networks·2026

Same author

Non-Stationary Latent Auto-Regressive Bandits.

Reinforcement learning journal·2026

Same author

Harnessing Causality in Reinforcement Learning With Bagged Decision Times.

Proceedings of machine learning research·2026

Same author

Digital Twins for Just-in-Time Adaptive Interventions (JITAIs): Framework for Optimizing and Continually Improving JITAIs.

Journal of medical Internet research·2026

Same journal

Calibrated and Conformal Propensity Scores for Causal Effect Estimation.

Uncertainty in artificial intelligence : proceedings of the ... conference. Conference on Uncertainty in Artificial Intelligence·2025

Same journal

Non-stationary Domain Generalization: Theory and Algorithm.

Uncertainty in artificial intelligence : proceedings of the ... conference. Conference on Uncertainty in Artificial Intelligence·2025

Same journal

Offline Reward Perturbation Boosts Distributional Shift in Online RL.

Uncertainty in artificial intelligence : proceedings of the ... conference. Conference on Uncertainty in Artificial Intelligence·2024

Same journal

A Variational Approximation for Analyzing the Dynamics of Panel Data.

Uncertainty in artificial intelligence : proceedings of the ... conference. Conference on Uncertainty in Artificial Intelligence·2021

Same journal

Graph Reparameterizations for Enabling 1000+ Monte Carlo Iterations in Bayesian Deep Neural Networks.

Uncertainty in artificial intelligence : proceedings of the ... conference. Conference on Uncertainty in Artificial Intelligence·2021

Same journal

Sampling-free Uncertainty Estimation in Gated Recurrent Units with Applications to Normative Modeling in Neuroimaging.

Uncertainty in artificial intelligence : proceedings of the ... conference. Conference on Uncertainty in Artificial Intelligence·2020

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 12, 2026

Signal Acquisition, Score Interpretation, and Economics of a Non-Invasive Point-of-Care Test for Coronary Artery Disease

Signal Acquisition, Score Interpretation, and Economics of a Non-Invasive Point-of-Care Test for Coronary Artery Disease

Published on: August 9, 2024

Small Sample Inference for Generalization Error in Classification Using the CUD Bound.

Eric B Laber¹, Susan A Murphy

¹Department of Statistics, University of Michigan, Ann Arbor, MI 48104.

Uncertainty in Artificial Intelligence : Proceedings of the ... Conference. Conference on Uncertainty in Artificial Intelligence

|May 29, 2010

Summary

This summary is machine-generated.

Estimating generalization error for classifiers with small training data is challenging. This study introduces a novel method using a smooth upper bound and bootstrapping to create reliable confidence sets for generalization error, improving classifier performance.

More Related Videos

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

Related Experiment Videos

Last Updated: Jun 12, 2026

Signal Acquisition, Score Interpretation, and Economics of a Non-Invasive Point-of-Care Test for Coronary Artery Disease

Signal Acquisition, Score Interpretation, and Economics of a Non-Invasive Point-of-Care Test for Coronary Artery Disease

Published on: August 9, 2024

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

Area of Science:

Machine Learning
Statistical Learning Theory

Background:

Estimating generalization error is critical for classifier performance, especially with limited training data.
Existing methods using resampling and distribution assumptions often fail to provide accurate confidence sets due to the non-smooth nature of generalization error.
The non-normal distribution of estimated generalization error complicates reliable confidence interval construction.

Purpose of the Study:

To develop a robust method for constructing confidence sets for generalization error in machine learning classifiers.
To address the limitations of existing techniques that struggle with the inherent non-smoothness of generalization error.
To provide a computationally efficient algorithm for parametric additive models.

Main Methods:

Constructing a confidence set for generalization error using a smooth upper bound on the deviation between resampled estimates and true error.
Bootstrapping this smooth upper bound to form the confidence set.
Developing a computationally efficient algorithm for classifiers represented as parametric additive models.

Main Results:

The proposed method demonstrates superior performance in constructing confidence sets for generalization error compared to traditional approaches.
The technique effectively handles the non-smoothness issue, leading to more reliable confidence intervals.
Validated through performance on test and simulated datasets.

Conclusions:

The novel bootstrapping approach using a smooth upper bound offers a reliable solution for confidence measures in generalization error estimation.
This method improves the accuracy and trustworthiness of confidence sets, particularly for small training sample sizes.
The computationally efficient algorithm enhances practical applicability in specific classifier settings.