Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

P-value01:10

P-value

8.4K
P-value is one of the most crucial concepts in statistics.
P-value stands for the probability value.  P-value is the probability that, if the null hypothesis is true, the results from another randomly selected sample will be as extreme or more extreme as the results obtained from the given sample.
A large P-value calculated from the data indicates to  not reject the null hypothesis. But a higher P-value does not mean that the null hypothesis is true. The smaller the P-value, the more...
8.4K
Statistical Significance01:50

Statistical Significance

20.9K
Once data is collected from both the experimental and the control groups, a statistical analysis is conducted to find out if there are meaningful differences between the two groups. A statistical analysis determines how likely any difference found is due to chance (and thus not meaningful). In psychology, group differences are considered meaningful, or significant, if the odds that these differences occurred by chance alone are 5 percent or less. Stated another way, if we repeated this...
20.9K
Decision Making: P-value Method01:09

Decision Making: P-value Method

6.7K
The process of hypothesis testing based on the P-value method includes calculating the P- value using the sample data and interpreting it.
First, a specific claim about the population parameter is proposed. The claim is based on the research question and is stated in a simple form. Further, an opposing statement to the claim  is also stated. These statements can act as null and alternative hypotheses:  a null hypothesis would be a neutral statement while the alternative hypothesis can...
6.7K
Significance Testing: Overview01:04

Significance Testing: Overview

10.3K
Significance testing is a set of statistical methods used to test whether a claim about a parameter is valid. In analytical chemistry, significance testing is used primarily to determine whether the difference between two values comes from determinate or random errors. The effect of a particular change in the measurement protocol, analyst, or sample itself can cause a deviation from the expected result. In the case of a suspected deviation/outlier, we need to be able to confirm mathematically...
10.3K
Critical Region, Critical Values and Significance Level01:16

Critical Region, Critical Values and Significance Level

13.1K
The critical region, critical value, and significance level are interdependent concepts crucial in hypothesis testing.
In hypothesis testing, a sample statistic is converted to a test statistic using z, t, or chi-square distribution. A critical region is an area under the curve in  probability distributions demarcated by the critical value. When the test statistic falls in this region, it suggests that the null hypothesis must be rejected. As this region contains all those values of the...
13.1K
Bonferroni Test01:10

Bonferroni Test

3.2K
The Bonferroni test is a statistical test named after Carlo Emilio Bonferroni, an Italian mathematician best known for Bonferroni inequalities. This statistical test is a type of multiple comparison test to determine which means are different than the rest. Bonferroni test can minimize the Type 1 error by reducing the significance level alpha, which otherwise increases with sample pairs.
The means of different samples are first paired in all possible combinations.
The null hypothesis of the...
3.2K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Modeling Mental Health Case-Mix for Quality Improvement-A Comparison of Statistical and AI Models.

Healthcare (Basel, Switzerland)·2025
Same author

The Effect of Primary Care Visits on Total Patient Care Cost: Evidence From the Veterans Health Administration.

Journal of primary care & community health·2022
Same author

Case-mix for assessing primary care value (CPCV).

Health services management research·2020
Same journal

Methods for incorporating test result information within the high-dimensional propensity score framework: application in UK electronic health record data.

BMC medical research methodology·2026
Same journal

Sparse multi-way DMDC for longitudinal classification in high dimension low sample size data.

BMC medical research methodology·2026
Same journal

Tree-based exploratory identification of predictive biomarkers in non-randomized data.

BMC medical research methodology·2026
Same journal

Comparative evaluation of interrupted time series analytical methods for healthcare quality improvement research: a Monte Carlo simulation study.

BMC medical research methodology·2026
Same journal

Methodological advances in claims-based dementia algorithms: integrating medication and clinical data for medicare populations.

BMC medical research methodology·2026
Same journal

An interpretable XGboost algorithm for predicting 30-day mortality in acute pancreatitis using routine biomarkers.

BMC medical research methodology·2026
See all related articles

Related Experiment Video

Updated: Dec 17, 2025

Author Spotlight: Impact of Physical Barriers on Rodent Populations in Farmland Areas
03:29

Author Spotlight: Impact of Physical Barriers on Rodent Populations in Farmland Areas

Published on: March 8, 2024

869

P-values - a chronic conundrum.

Jian Gao1

  • 1Department of Veterans Affairs, Office of Productivity, Efficiency and Staffing (OPES, RAPID), Albany, USA. Jian.Gao@va.gov.

BMC Medical Research Methodology
|June 26, 2020
PubMed
Summary
This summary is machine-generated.

Misunderstanding p-values has serious consequences in research and medicine. This paper clarifies p-value confusion, explains the difference between significance and hypothesis testing, and proposes calibrated p-values as a viable alternative.

Keywords:
Calibrated P-valuesHypothesis testingP-valuesResearch reproducibilitySignificance testingType I error

More Related Videos

A Protocol for Using Gene Set Enrichment Analysis to Identify the Appropriate Animal Model for Translational Research
09:35

A Protocol for Using Gene Set Enrichment Analysis to Identify the Appropriate Animal Model for Translational Research

Published on: August 16, 2017

18.2K
Mapping Dysfunctional Protein-Protein Interactions in Disease
09:39

Mapping Dysfunctional Protein-Protein Interactions in Disease

Published on: October 24, 2025

382

Related Experiment Videos

Last Updated: Dec 17, 2025

Author Spotlight: Impact of Physical Barriers on Rodent Populations in Farmland Areas
03:29

Author Spotlight: Impact of Physical Barriers on Rodent Populations in Farmland Areas

Published on: March 8, 2024

869
A Protocol for Using Gene Set Enrichment Analysis to Identify the Appropriate Animal Model for Translational Research
09:35

A Protocol for Using Gene Set Enrichment Analysis to Identify the Appropriate Animal Model for Translational Research

Published on: August 16, 2017

18.2K
Mapping Dysfunctional Protein-Protein Interactions in Disease
09:39

Mapping Dysfunctional Protein-Protein Interactions in Disease

Published on: October 24, 2025

382

Area of Science:

  • Statistical methodology in medical research
  • Hypothesis testing and significance evaluation

Background:

  • The p-value is frequently misconstrued as the probability of a Type I error, leading to significant issues in research reproducibility and clinical decision-making.
  • Common statistical education often conflates Fisher's significance testing with Neyman-Pearson's hypothesis testing, obscuring fundamental differences and contributing to widespread confusion.

Discussion:

  • The p-value quantifies evidence against a null hypothesis, with smaller values indicating stronger evidence, but it is not the probability of a Type I error.
  • A p-value of 0.05 does not imply a 5% chance of a Type I error; the actual probability of a treatment not working can be substantially higher, at least 28.9%.

Key Insights:

  • Clarifying the distinction between statistical significance and hypothesis testing is crucial for accurate interpretation of research findings.
  • The conventional p-value's misinterpretation has detrimental effects on treatment selection and empirical analysis.
  • A practical alternative is needed to address the limitations of traditional p-value usage.

Outlook:

  • Implementing calibrated p-values, representing the probability a treatment does not work, is essential for informed medical practice and research.
  • Future research should focus on developing and disseminating intuitive, mathematically sound educational materials on statistical interpretation.
  • Adopting calibrated p-values will enhance transparency and reliability in medical research and clinical decision-making.