Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Margin of Error01:27

Margin of Error

The margin of error is also called the maximum error of an estimate. The margin of error is the maximum possible or expected difference between the observed sample parameter value and the actual population parameter value. For proportion, it is the maximum difference between the value of sample proportion obtained from the data and the true value of population proportion. As the true value of the population parameter is not known, the margin of error is calculated using the sample statistic.
Central Limit Theorem01:14

Central Limit Theorem

The central limit theorem, abbreviated as clt, is one of the most powerful and useful ideas in all of statistics. The central limit theorem for sample means says that if you repeatedly draw samples of a given size and calculate their means, and create a histogram of those means, then the resulting histogram will tend to have an approximate normal bell shape. In other words, as sample sizes increase, the distribution of means follows the normal distribution more closely.
The sample size, n, that...
Contaminants and Errors01:16

Contaminants and Errors

Effective sample preparation is crucial for accurate and reliable laboratory analysis. During this process, two significant sources of error can arise: concentration bias from improper sample splitting and contamination caused by methods used to reduce particle size, such as grinding or homogenization. Identifying and minimizing these potential errors is crucial to ensuring the validity of the analysis.
Another key consideration is determining the appropriate number of samples required to...
Sampling Distribution01:12

Sampling Distribution

Given simple random samples of size n from a given population with a measured characteristic such as mean, proportion, or standard deviation for each sample, the probability distribution of all the measured characteristics is called a sampling distribution. How much the statistic varies from one sample to another is known as the sampling variability of a statistic. You typically measure the sampling variability of a statistic by its standard error. The standard error of the mean is an example...
Estimating Population Mean with Known Standard Deviation01:16

Estimating Population Mean with Known Standard Deviation

To construct a confidence interval for a single unknown population mean μ, where the population standard deviation is known, we need sample mean as an estimate for μ and we need the margin of error. Here, the margin of error (EBM) is called the error bound for a population mean (abbreviated EBM). The sample mean is the point estimate of the unknown population mean μ.
The confidence interval estimate will have the form as follows:
(point estimate - error bound, point estimate + error bound)
The...
Estimating Population Mean with Unknown Standard Deviation01:22

Estimating Population Mean with Unknown Standard Deviation

In practice, we rarely know the population standard deviation. In the past, when the sample size was large, this did not present a problem to statisticians. They used the sample standard deviation s as an estimate for σ and proceeded as before to calculate a confidence interval with close enough results. However, statisticians ran into problems when the sample size was small. A small sample size caused inaccuracies in the confidence interval.
William S. Gosset (1876–1937) of the Guinness...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Is More Always Better With Digital Health Interventions? Shifting Engagement From Maximizing Use to Supporting Health.

Mayo Clinic proceedings. Digital health·2026
Same author

Effective monitoring of online AI decision-making algorithms in just-in-time adaptive interventions.

NPJ digital medicine·2026
Same author

SigmaScheduling: Uncertainty-Informed Scheduling of Decision Points for Intelligent Mobile Health Interventions.

... International Conference on Wearable and Implantable Body Sensor Networks. International Conference on Wearable and Implantable Body Sensor Networks·2026
Same author

Non-Stationary Latent Auto-Regressive Bandits.

Reinforcement learning journal·2026
Same author

Harnessing Causality in Reinforcement Learning With Bagged Decision Times.

Proceedings of machine learning research·2026
Same author

Digital Twins for Just-in-Time Adaptive Interventions (JITAIs): Framework for Optimizing and Continually Improving JITAIs.

Journal of medical Internet research·2026
Same journal

Calibrated and Conformal Propensity Scores for Causal Effect Estimation.

Uncertainty in artificial intelligence : proceedings of the ... conference. Conference on Uncertainty in Artificial Intelligence·2025
Same journal

Non-stationary Domain Generalization: Theory and Algorithm.

Uncertainty in artificial intelligence : proceedings of the ... conference. Conference on Uncertainty in Artificial Intelligence·2025
Same journal

Offline Reward Perturbation Boosts Distributional Shift in Online RL.

Uncertainty in artificial intelligence : proceedings of the ... conference. Conference on Uncertainty in Artificial Intelligence·2024
Same journal

A Variational Approximation for Analyzing the Dynamics of Panel Data.

Uncertainty in artificial intelligence : proceedings of the ... conference. Conference on Uncertainty in Artificial Intelligence·2021
Same journal

Graph Reparameterizations for Enabling 1000+ Monte Carlo Iterations in Bayesian Deep Neural Networks.

Uncertainty in artificial intelligence : proceedings of the ... conference. Conference on Uncertainty in Artificial Intelligence·2021
Same journal

Sampling-free Uncertainty Estimation in Gated Recurrent Units with Applications to Normative Modeling in Neuroimaging.

Uncertainty in artificial intelligence : proceedings of the ... conference. Conference on Uncertainty in Artificial Intelligence·2020
See all related articles

Related Experiment Video

Updated: Jun 12, 2026

Signal Acquisition, Score Interpretation, and Economics of a Non-Invasive Point-of-Care Test for Coronary Artery Disease
06:16

Signal Acquisition, Score Interpretation, and Economics of a Non-Invasive Point-of-Care Test for Coronary Artery Disease

Published on: August 9, 2024

Small Sample Inference for Generalization Error in Classification Using the CUD Bound.

Eric B Laber1, Susan A Murphy

  • 1Department of Statistics, University of Michigan, Ann Arbor, MI 48104.

Uncertainty in Artificial Intelligence : Proceedings of the ... Conference. Conference on Uncertainty in Artificial Intelligence
|May 29, 2010
PubMed
Summary
This summary is machine-generated.

Estimating generalization error for classifiers with small training data is challenging. This study introduces a novel method using a smooth upper bound and bootstrapping to create reliable confidence sets for generalization error, improving classifier performance.

More Related Videos

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
12:18

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

Related Experiment Videos

Last Updated: Jun 12, 2026

Signal Acquisition, Score Interpretation, and Economics of a Non-Invasive Point-of-Care Test for Coronary Artery Disease
06:16

Signal Acquisition, Score Interpretation, and Economics of a Non-Invasive Point-of-Care Test for Coronary Artery Disease

Published on: August 9, 2024

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
12:18

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

Area of Science:

  • Machine Learning
  • Statistical Learning Theory

Background:

  • Estimating generalization error is critical for classifier performance, especially with limited training data.
  • Existing methods using resampling and distribution assumptions often fail to provide accurate confidence sets due to the non-smooth nature of generalization error.
  • The non-normal distribution of estimated generalization error complicates reliable confidence interval construction.

Purpose of the Study:

  • To develop a robust method for constructing confidence sets for generalization error in machine learning classifiers.
  • To address the limitations of existing techniques that struggle with the inherent non-smoothness of generalization error.
  • To provide a computationally efficient algorithm for parametric additive models.

Main Methods:

  • Constructing a confidence set for generalization error using a smooth upper bound on the deviation between resampled estimates and true error.
  • Bootstrapping this smooth upper bound to form the confidence set.
  • Developing a computationally efficient algorithm for classifiers represented as parametric additive models.

Main Results:

  • The proposed method demonstrates superior performance in constructing confidence sets for generalization error compared to traditional approaches.
  • The technique effectively handles the non-smoothness issue, leading to more reliable confidence intervals.
  • Validated through performance on test and simulated datasets.

Conclusions:

  • The novel bootstrapping approach using a smooth upper bound offers a reliable solution for confidence measures in generalization error estimation.
  • This method improves the accuracy and trustworthiness of confidence sets, particularly for small training sample sizes.
  • The computationally efficient algorithm enhances practical applicability in specific classifier settings.