Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Sample Size Calculation01:19

Sample Size Calculation

6.1K
Knowledge of the sample size is the first requirement to conduct random sampling or an experiment. The sample size is the total number of units, observations, or groups (in some cases) used to get the data to estimate a population parameter. As the name suggests, the sample size is that of the sample drawn from the population and differs from the population size.
The sample size for the given experiment or sampling effort is fundamental to any study design. Sample size decides the number of...
6.1K
Residuals and Least-Squares Property01:11

Residuals and Least-Squares Property

8.8K
The vertical distance between the actual value of y and the estimated value of y. In other words, it measures the vertical distance between the actual data point and the predicted point on the line
If the observed data point lies above the line, the residual is positive, and the line underestimates the actual data value for y. If the observed data point lies below the line, the residual is negative, and the line overestimates the actual data value for y.
The process of fitting the best-fit...
8.8K
One-Way ANOVA: Unequal Sample Sizes01:15

One-Way ANOVA: Unequal Sample Sizes

6.5K
One-way ANOVA can be performed on three or more samples of unequal sizes. However, calculations get complicated when sample sizes are not always the same. So, while performing ANOVA with unequal samples size, the following equation is used:
6.5K
One-Way ANOVA: Equal Sample Sizes01:15

One-Way ANOVA: Equal Sample Sizes

3.9K
One-Way ANOVA can be performed on three or more samples with equal or unequal sample sizes. When one-way ANOVA is performed on two datasets with samples of equal sizes, it can be easily observed that the computed F statistic is highly sensitive to the sample mean.
Different sample means can result in different values for the variance estimate: variance between samples. This is because the variance between samples is calculated as the product of the sample size and the variance between the...
3.9K
Multiple Regression01:25

Multiple Regression

3.7K
Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
3.7K
Regression Analysis01:11

Regression Analysis

7.6K
Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:
7.6K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Coastal and inland American alligators (Alligator mississippiensis) diverge along physiological axes in response to a salinity gradient.

Journal of evolutionary biology·2026
Same author

Natural and anthropogenic factors influence flowering synchrony and reproduction of a dominant plant in an inter-Andean scrub.

American journal of botany·2024
Same author

Fish diversity of Colombian Andes-Amazon streams at the end of conflict is a reference for conservation before increased land use.

Ecology and evolution·2024
Same author

Grassland intensification effects cascade to alter multifunctionality of wetlands within metaecosystems.

Nature communications·2023
Same author

Experimental disturbance and productivity gradients drive community diversity in aquatic mesocosms.

Ecology and evolution·2023
Same author

Incorporating distance metrics and temporal trends to refine mixed stock analysis.

Scientific reports·2022
Same journal

Analysis of strength degradation of coal and rock masses and stability of mined areas under long term immersion environment.

PloS one·2026
Same journal

Biogenic Silver-Selenium nanocomposite with anticancer activity and potent efficacy against vancomycin-resistant Staphylococcus aureus.

PloS one·2026
Same journal

Preparation and physicochemical characterization of a biodegradable chitosan/carboxymethyl cellulose hydrogel synthesized in NaOH/urea medium.

PloS one·2026
Same journal

Action-guilt, survivor-guilt, and depression in combat-related PTSD.

PloS one·2026
Same journal

Explainable machine learning for predicting activities of daily living at discharge in stroke patients: A retrospective study using SHAP interpretability.

PloS one·2026
Same journal

Deep learning based two-way feature depiction model for brain tumor detection.

PloS one·2026
See all related articles

Related Experiment Video

Updated: Dec 28, 2025

Problem-Solving Before Instruction PS-I: A Protocol for Assessment and Intervention in Students with Different Abilities
10:26

Problem-Solving Before Instruction PS-I: A Protocol for Assessment and Intervention in Students with Different Abilities

Published on: September 11, 2021

4.3K

A solution to minimum sample size for regressions.

David G Jenkins1, Pedro F Quintana-Ascencio1

  • 1Department of Biology, University of Central Florida, Orlando, Florida, United States of America.

Plos One
|February 22, 2020
PubMed
Summary
This summary is machine-generated.

A minimum sample size (N) of 8 is sufficient for regression analysis with very low variance, but N ≥ 25 is needed for high variance. Information theory indices, not R2, are best for model selection to ensure reproducible research.

More Related Videos

The Innovation Arena: A Method for Comparing Innovative Problem-Solving Across Groups
14:14

The Innovation Arena: A Method for Comparing Innovative Problem-Solving Across Groups

Published on: May 13, 2022

6.2K
Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach
04:35

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

3.6K

Related Experiment Videos

Last Updated: Dec 28, 2025

Problem-Solving Before Instruction PS-I: A Protocol for Assessment and Intervention in Students with Different Abilities
10:26

Problem-Solving Before Instruction PS-I: A Protocol for Assessment and Intervention in Students with Different Abilities

Published on: September 11, 2021

4.3K
The Innovation Arena: A Method for Comparing Innovative Problem-Solving Across Groups
14:14

The Innovation Arena: A Method for Comparing Innovative Problem-Solving Across Groups

Published on: May 13, 2022

6.2K
Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach
04:35

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

3.6K

Area of Science:

  • Biostatistics
  • Reproducible Research
  • Statistical Modeling

Background:

  • Regressions and meta-regressions are vital for estimating patterns and effect sizes across disciplines.
  • Low sample sizes (N) in biological and medical studies raise concerns about research reproducibility.
  • Determining the minimum N for reliable regression model selection is crucial.

Purpose of the Study:

  • To identify the minimum sample size (N) required to accurately discern data patterns using regression models.
  • To compare the effectiveness of information theoretic model selection versus R-squared for determining minimum N.
  • To provide evidence-based recommendations for sample size in regression and meta-regression studies.

Main Methods:

  • Generated synthetic datasets (null, simple linear, quadratic) with varying variances and effect sizes.
  • Employed information theoretic model selection, specifically Akaike Information Criterion weights (AICc wi), to evaluate minimum N.
  • Assessed the utility of coefficient of determination (R2) and adjusted R2 for model selection and minimum N determination.

Main Results:

  • Data patterns were clearly identified at N ≥ 8, even with very low variance, though false positives/negatives occurred below this.
  • Accurate statistical inference required N ≥ 25 in conditions with high data variance.
  • Akaike Information Criterion weights (AICc wi) effectively distinguished between models (e.g., linear vs. null), while R2 and adjusted R2 proved inadequate.

Conclusions:

  • A minimum N of 8 is informative for low-variance data, but N ≥ 25 is necessary for high-variance data in regression.
  • Information theory indices (e.g., AIC) are superior to R2 or adjusted R2 for comparing alternative regression models.
  • Insufficient sample size and reliance on R2 for model selection contribute to poor reproducibility; recommend N ≥ 25 for regression/meta-regression.