Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

P-value

P-value

P-value is one of the most crucial concepts in statistics.
P-value stands for the probability value. P-value is the probability that, if the null hypothesis is true, the results from another randomly selected sample will be as extreme or more extreme as the results obtained from the given sample.
A large P-value calculated from the data indicates to not reject the null hypothesis. But a higher P-value does not mean that the null hypothesis is true. The smaller the P-value, the more...

Statistical Significance

Statistical Significance

Once data is collected from both the experimental and the control groups, a statistical analysis is conducted to find out if there are meaningful differences between the two groups. A statistical analysis determines how likely any difference found is due to chance (and thus not meaningful). In psychology, group differences are considered meaningful, or significant, if the odds that these differences occurred by chance alone are 5 percent or less. Stated another way, if we repeated this...

Decision Making: P-value Method

Decision Making: P-value Method

The process of hypothesis testing based on the P-value method includes calculating the P- value using the sample data and interpreting it.
First, a specific claim about the population parameter is proposed. The claim is based on the research question and is stated in a simple form. Further, an opposing statement to the claim is also stated. These statements can act as null and alternative hypotheses: a null hypothesis would be a neutral statement while the alternative hypothesis can...

Significance Testing: Overview

Significance Testing: Overview

Significance testing is a set of statistical methods used to test whether a claim about a parameter is valid. In analytical chemistry, significance testing is used primarily to determine whether the difference between two values comes from determinate or random errors. The effect of a particular change in the measurement protocol, analyst, or sample itself can cause a deviation from the expected result. In the case of a suspected deviation/outlier, we need to be able to confirm mathematically...

Critical Region, Critical Values and Significance Level

Critical Region, Critical Values and Significance Level

The critical region, critical value, and significance level are interdependent concepts crucial in hypothesis testing.
In hypothesis testing, a sample statistic is converted to a test statistic using z, t, or chi-square distribution. A critical region is an area under the curve in probability distributions demarcated by the critical value. When the test statistic falls in this region, it suggests that the null hypothesis must be rejected. As this region contains all those values of the...

Bonferroni Test

Bonferroni Test

The Bonferroni test is a statistical test named after Carlo Emilio Bonferroni, an Italian mathematician best known for Bonferroni inequalities. This statistical test is a type of multiple comparison test to determine which means are different than the rest. Bonferroni test can minimize the Type 1 error by reducing the significance level alpha, which otherwise increases with sample pairs.
The means of different samples are first paired in all possible combinations.
The null hypothesis of the...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Modeling Mental Health Case-Mix for Quality Improvement-A Comparison of Statistical and AI Models.

Healthcare (Basel, Switzerland)·2025

Same author

The Effect of Primary Care Visits on Total Patient Care Cost: Evidence From the Veterans Health Administration.

Journal of primary care & community health·2022

Same author

Case-mix for assessing primary care value (CPCV).

Health services management research·2020

Same journal

Methods for incorporating test result information within the high-dimensional propensity score framework: application in UK electronic health record data.

BMC medical research methodology·2026

Same journal

Sparse multi-way DMDC for longitudinal classification in high dimension low sample size data.

BMC medical research methodology·2026

Same journal

Tree-based exploratory identification of predictive biomarkers in non-randomized data.

BMC medical research methodology·2026

Same journal

Comparative evaluation of interrupted time series analytical methods for healthcare quality improvement research: a Monte Carlo simulation study.

BMC medical research methodology·2026

Same journal

Methodological advances in claims-based dementia algorithms: integrating medication and clinical data for medicare populations.

BMC medical research methodology·2026

Same journal

An interpretable XGboost algorithm for predicting 30-day mortality in acute pancreatitis using routine biomarkers.

BMC medical research methodology·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Dec 17, 2025

Author Spotlight: Impact of Physical Barriers on Rodent Populations in Farmland Areas

Author Spotlight: Impact of Physical Barriers on Rodent Populations in Farmland Areas

Published on: March 8, 2024

P-values - a chronic conundrum.

¹Department of Veterans Affairs, Office of Productivity, Efficiency and Staffing (OPES, RAPID), Albany, USA. Jian.Gao@va.gov.

BMC Medical Research Methodology

|June 26, 2020

Summary

This summary is machine-generated.

Misunderstanding p-values has serious consequences in research and medicine. This paper clarifies p-value confusion, explains the difference between significance and hypothesis testing, and proposes calibrated p-values as a viable alternative.

Keywords:

Calibrated P-values Hypothesis testing P-values Research reproducibility Significance testing Type I error

More Related Videos

A Protocol for Using Gene Set Enrichment Analysis to Identify the Appropriate Animal Model for Translational Research

A Protocol for Using Gene Set Enrichment Analysis to Identify the Appropriate Animal Model for Translational Research

Published on: August 16, 2017

Mapping Dysfunctional Protein-Protein Interactions in Disease

Mapping Dysfunctional Protein-Protein Interactions in Disease

Published on: October 24, 2025

Related Experiment Videos

Last Updated: Dec 17, 2025

Author Spotlight: Impact of Physical Barriers on Rodent Populations in Farmland Areas

Author Spotlight: Impact of Physical Barriers on Rodent Populations in Farmland Areas

Published on: March 8, 2024

A Protocol for Using Gene Set Enrichment Analysis to Identify the Appropriate Animal Model for Translational Research

A Protocol for Using Gene Set Enrichment Analysis to Identify the Appropriate Animal Model for Translational Research

Published on: August 16, 2017

Mapping Dysfunctional Protein-Protein Interactions in Disease

Mapping Dysfunctional Protein-Protein Interactions in Disease

Published on: October 24, 2025

Area of Science:

Statistical methodology in medical research
Hypothesis testing and significance evaluation

Background:

The p-value is frequently misconstrued as the probability of a Type I error, leading to significant issues in research reproducibility and clinical decision-making.
Common statistical education often conflates Fisher's significance testing with Neyman-Pearson's hypothesis testing, obscuring fundamental differences and contributing to widespread confusion.

Discussion:

The p-value quantifies evidence against a null hypothesis, with smaller values indicating stronger evidence, but it is not the probability of a Type I error.
A p-value of 0.05 does not imply a 5% chance of a Type I error; the actual probability of a treatment not working can be substantially higher, at least 28.9%.

Key Insights:

Clarifying the distinction between statistical significance and hypothesis testing is crucial for accurate interpretation of research findings.
The conventional p-value's misinterpretation has detrimental effects on treatment selection and empirical analysis.
A practical alternative is needed to address the limitations of traditional p-value usage.

Outlook:

Implementing calibrated p-values, representing the probability a treatment does not work, is essential for informed medical practice and research.
Future research should focus on developing and disseminating intuitive, mathematically sound educational materials on statistical interpretation.
Adopting calibrated p-values will enhance transparency and reliability in medical research and clinical decision-making.