Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Sample Size Calculation01:19

Sample Size Calculation

3.7K
Knowledge of the sample size is the first requirement to conduct random sampling or an experiment. The sample size is the total number of units, observations, or groups (in some cases) used to get the data to estimate a population parameter. As the name suggests, the sample size is that of the sample drawn from the population and differs from the population size.
The sample size for the given experiment or sampling effort is fundamental to any study design. Sample size decides the number of...
3.7K
One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for ka Estimation01:24

One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for ka Estimation

642
This lesson introduces two critical methods in pharmacokinetics, the Wagner-Nelson and Loo-Riegelman methods, used for estimating the absorption rate constant (ka) for drugs administered via non-intravenous routes. The Wagner-Nelson method relates ka to the plasma concentration derived from the slope of a semilog percent unabsorbed time plot. However, it is limited to drugs with one-compartment kinetics and can be impacted by factors like gastrointestinal motility or enzymatic degradation.
On...
642
Bootstrapping01:24

Bootstrapping

658
The term "bootstrap" originated in the 19th century as a metaphor for self-improvement or achieving something independently, without external assistance. This concept extends to statistical bootstrapping, a self-contained method for estimating population parameters through resampling, even though it can be computationally intensive. Developed by the American statistician Dr. Bradley Efron in 1979, bootstrapping provides a robust way to perform inference when the original sample size is...
658
Classification of Signals01:30

Classification of Signals

645
In signal processing, signals are classified based on various characteristics: continuous-time versus discrete-time, periodic versus aperiodic, analog versus digital, and causal versus noncausal. Each category highlights distinct properties crucial for understanding and manipulating signals.
A continuous-time signal holds a value at every instant in time, representing information seamlessly. In contrast, a discrete-time signal holds values only at specific moments, often denoted as x(n), where...
645
Prediction Intervals01:03

Prediction Intervals

2.3K
The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y. 
2.3K
Sampling Methods: Sample Types01:18

Sampling Methods: Sample Types

336
Sampling materials are classified into three main types: solid, liquid, and gas.
Solid samples include a variety of substances, such as sediments from water bodies, soil, metals, and biological tissues. Two standard methods for extracting sediments from water bodies are grab sampling and piston coring. Grab sampling involves using a device to collect a discrete sediment sample from the bottom of a water body with minimal disturbance. Grab samples do not always represent the entire area due to...
336

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Real-World Associations of KidneyIntelX Risk Stratification With Guideline-Directed Therapy, Kidney Outcomes, and Metabolic Trajectories in Early Diabetic Kidney Disease.

Diabetes, obesity & metabolism·2026
Same author

Evaluating Sycophancy in Frontier Models Using Persona-Driven Challenge.

medRxiv : the preprint server for health sciences·2026
Same author

Large language models are poor clinical administrators: An evaluation of structured queries in real-world electronic health records.

PLOS digital health·2026
Same author

<i>Ad-verse Effects:</i> Pharmaceutical Advertising Shifts Drug Recommendations by Consumer-Facing AI.

medRxiv : the preprint server for health sciences·2026
Same author

Genomic analyses implicate hormonal and metabolic dysregulation in polycystic ovary syndrome.

Nature genetics·2026
Same author

How to meaningfully evaluate AI in clinical medicine.

Nature medicine·2026
Same journal

Enhancing anatomical recognition by surgeons during pelvic lymph node dissection using artificial intelligence.

NPJ digital medicine·2026
Same journal

AFP assistant: a retrieval-augmented generation and large language model-powered multilingual polio chatbot for low-resource language communities.

NPJ digital medicine·2026
Same journal

Structured reasoning failures compromise LLM interpretation of clinical oncology notes.

NPJ digital medicine·2026
Same journal

Translation of frozen sections into FFPE images for skin cancer resection margins using generative AI.

NPJ digital medicine·2026
Same journal

FedFound: a federated foundation model for lifespan brain morphological connectome analysis.

NPJ digital medicine·2026
Same journal

A multimodal instruction dataset and benchmark for ultrasound understanding.

NPJ digital medicine·2026
See all related articles

Related Experiment Video

Updated: Aug 17, 2025

Author Spotlight: Advancing Alzheimer's Research &#8211; Exploring Early Detection and Multi-Omics Approaches
09:47

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

1.2K

Autoencoders for sample size estimation for fully connected neural network classifiers.

Faris F Gulamali1, Ashwin S Sawant2, Patricia Kovatch2

  • 1Icahn School of Medicine, New York, NY, 10029, USA. faris.gulamali@icahn.mssm.edu.

NPJ Digital Medicine
|December 13, 2022
PubMed
Summary
This summary is machine-generated.

Estimating deep learning sample sizes is challenging. This study introduces a Minimum Converging Sample (MCS) method using autoencoder loss to determine optimal labeled data for computer vision models, improving training efficiency.

More Related Videos

Author Spotlight: Advancing Large-Scale Neural Dynamics Through HD-MEA Technology
09:44

Author Spotlight: Advancing Large-Scale Neural Dynamics Through HD-MEA Technology

Published on: March 8, 2024

5.0K
Automatic Image Processing to Determine the Community Size Structure of Riverine Macroinvertebrates
08:56

Automatic Image Processing to Determine the Community Size Structure of Riverine Macroinvertebrates

Published on: January 13, 2023

2.3K

Related Experiment Videos

Last Updated: Aug 17, 2025

Author Spotlight: Advancing Alzheimer's Research &#8211; Exploring Early Detection and Multi-Omics Approaches
09:47

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

1.2K
Author Spotlight: Advancing Large-Scale Neural Dynamics Through HD-MEA Technology
09:44

Author Spotlight: Advancing Large-Scale Neural Dynamics Through HD-MEA Technology

Published on: March 8, 2024

5.0K
Automatic Image Processing to Determine the Community Size Structure of Riverine Macroinvertebrates
08:56

Automatic Image Processing to Determine the Community Size Structure of Riverine Macroinvertebrates

Published on: January 13, 2023

2.3K

Area of Science:

  • Computer Science
  • Machine Learning
  • Artificial Intelligence

Background:

  • Sample size estimation is critical in experimental design but remains understudied for deep learning.
  • Current methods rely on heuristics or prior experience, often leading to inefficient data labeling for supervised learning tasks.

Purpose of the Study:

  • To address the underestimation of sample size requirements in deep learning, particularly for computer vision.
  • To develop a rigorous method for estimating the minimum labeled data needed for effective model training.

Main Methods:

  • Investigated the concept of a Minimum Converging Sample (MCS) representing the smallest dataset for a generalizable representation.
  • Utilized autoencoder loss to estimate MCS for fully connected neural networks in computer vision tasks.
  • Developed a code-free, dataset-agnostic tool for MCS estimation.

Main Results:

  • Found that below the estimated MCS, fully connected networks struggle to differentiate classes.
  • Demonstrated a strong correlation between generalizability and autoencoder loss for sample sizes above the MCS.
  • Successfully provided a practical tool for estimating sample sizes.

Conclusions:

  • Minimum Converging Sample (MCS) estimation using autoencoder loss is a promising approach for guiding data collection and labeling in deep learning.
  • This method can significantly improve the efficiency and effectiveness of training computer vision models.
  • The findings offer a more data-driven strategy for sample size determination in deep learning applications.