Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Sample Size Calculation

Sample Size Calculation

Knowledge of the sample size is the first requirement to conduct random sampling or an experiment. The sample size is the total number of units, observations, or groups (in some cases) used to get the data to estimate a population parameter. As the name suggests, the sample size is that of the sample drawn from the population and differs from the population size.
The sample size for the given experiment or sampling effort is fundamental to any study design. Sample size decides the number of...

One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for ka Estimation

One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for k_a Estimation

This lesson introduces two critical methods in pharmacokinetics, the Wagner-Nelson and Loo-Riegelman methods, used for estimating the absorption rate constant (ka) for drugs administered via non-intravenous routes. The Wagner-Nelson method relates ka to the plasma concentration derived from the slope of a semilog percent unabsorbed time plot. However, it is limited to drugs with one-compartment kinetics and can be impacted by factors like gastrointestinal motility or enzymatic degradation.
On...

Bootstrapping

Bootstrapping

The term "bootstrap" originated in the 19th century as a metaphor for self-improvement or achieving something independently, without external assistance. This concept extends to statistical bootstrapping, a self-contained method for estimating population parameters through resampling, even though it can be computationally intensive. Developed by the American statistician Dr. Bradley Efron in 1979, bootstrapping provides a robust way to perform inference when the original sample size is...

Classification of Signals

Classification of Signals

In signal processing, signals are classified based on various characteristics: continuous-time versus discrete-time, periodic versus aperiodic, analog versus digital, and causal versus noncausal. Each category highlights distinct properties crucial for understanding and manipulating signals.
A continuous-time signal holds a value at every instant in time, representing information seamlessly. In contrast, a discrete-time signal holds values only at specific moments, often denoted as x(n), where...

Prediction Intervals

Prediction Intervals

The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y.

Sampling Methods: Sample Types

Sampling Methods: Sample Types

Sampling materials are classified into three main types: solid, liquid, and gas.
Solid samples include a variety of substances, such as sediments from water bodies, soil, metals, and biological tissues. Two standard methods for extracting sediments from water bodies are grab sampling and piston coring. Grab sampling involves using a device to collect a discrete sediment sample from the bottom of a water body with minimal disturbance. Grab samples do not always represent the entire area due to...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Real-World Associations of KidneyIntelX Risk Stratification With Guideline-Directed Therapy, Kidney Outcomes, and Metabolic Trajectories in Early Diabetic Kidney Disease.

Diabetes, obesity & metabolism·2026

Same author

Evaluating Sycophancy in Frontier Models Using Persona-Driven Challenge.

medRxiv : the preprint server for health sciences·2026

Same author

Large language models are poor clinical administrators: An evaluation of structured queries in real-world electronic health records.

PLOS digital health·2026

Same author

<i>Ad-verse Effects:</i> Pharmaceutical Advertising Shifts Drug Recommendations by Consumer-Facing AI.

medRxiv : the preprint server for health sciences·2026

Same author

Genomic analyses implicate hormonal and metabolic dysregulation in polycystic ovary syndrome.

Nature genetics·2026

Same author

How to meaningfully evaluate AI in clinical medicine.

Nature medicine·2026

Same journal

Enhancing anatomical recognition by surgeons during pelvic lymph node dissection using artificial intelligence.

NPJ digital medicine·2026

Same journal

AFP assistant: a retrieval-augmented generation and large language model-powered multilingual polio chatbot for low-resource language communities.

NPJ digital medicine·2026

Same journal

Structured reasoning failures compromise LLM interpretation of clinical oncology notes.

NPJ digital medicine·2026

Same journal

Translation of frozen sections into FFPE images for skin cancer resection margins using generative AI.

NPJ digital medicine·2026

Same journal

FedFound: a federated foundation model for lifespan brain morphological connectome analysis.

NPJ digital medicine·2026

Same journal

A multimodal instruction dataset and benchmark for ultrasound understanding.

NPJ digital medicine·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Aug 17, 2025

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

Autoencoders for sample size estimation for fully connected neural network classifiers.

Faris F Gulamali¹, Ashwin S Sawant², Patricia Kovatch²

¹Icahn School of Medicine, New York, NY, 10029, USA. faris.gulamali@icahn.mssm.edu.

NPJ Digital Medicine

|December 13, 2022

Summary

This summary is machine-generated.

Estimating deep learning sample sizes is challenging. This study introduces a Minimum Converging Sample (MCS) method using autoencoder loss to determine optimal labeled data for computer vision models, improving training efficiency.

More Related Videos

Author Spotlight: Advancing Large-Scale Neural Dynamics Through HD-MEA Technology

Author Spotlight: Advancing Large-Scale Neural Dynamics Through HD-MEA Technology

Published on: March 8, 2024

Automatic Image Processing to Determine the Community Size Structure of Riverine Macroinvertebrates

Automatic Image Processing to Determine the Community Size Structure of Riverine Macroinvertebrates

Published on: January 13, 2023

Related Experiment Videos

Last Updated: Aug 17, 2025

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

Author Spotlight: Advancing Large-Scale Neural Dynamics Through HD-MEA Technology

Author Spotlight: Advancing Large-Scale Neural Dynamics Through HD-MEA Technology

Published on: March 8, 2024

Automatic Image Processing to Determine the Community Size Structure of Riverine Macroinvertebrates

Automatic Image Processing to Determine the Community Size Structure of Riverine Macroinvertebrates

Published on: January 13, 2023

Area of Science:

Computer Science
Machine Learning
Artificial Intelligence

Background:

Sample size estimation is critical in experimental design but remains understudied for deep learning.
Current methods rely on heuristics or prior experience, often leading to inefficient data labeling for supervised learning tasks.

Purpose of the Study:

To address the underestimation of sample size requirements in deep learning, particularly for computer vision.
To develop a rigorous method for estimating the minimum labeled data needed for effective model training.

Main Methods:

Investigated the concept of a Minimum Converging Sample (MCS) representing the smallest dataset for a generalizable representation.
Utilized autoencoder loss to estimate MCS for fully connected neural networks in computer vision tasks.
Developed a code-free, dataset-agnostic tool for MCS estimation.

Main Results:

Found that below the estimated MCS, fully connected networks struggle to differentiate classes.
Demonstrated a strong correlation between generalizability and autoencoder loss for sample sizes above the MCS.
Successfully provided a practical tool for estimating sample sizes.

Conclusions:

Minimum Converging Sample (MCS) estimation using autoencoder loss is a promising approach for guiding data collection and labeling in deep learning.
This method can significantly improve the efficiency and effectiveness of training computer vision models.
The findings offer a more data-driven strategy for sample size determination in deep learning applications.