Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Accuracy and Errors in Hypothesis Testing

Accuracy and Errors in Hypothesis Testing

Hypothesis testing is a fundamental statistical tool that begins with the assumption that the null hypothesis H0 is true. During this process, two types of errors can occur: Type I and Type II. A Type I error refers to the incorrect rejection of a true null hypothesis, while a Type II error involves the failure to reject a false null hypothesis.
In hypothesis testing, the probability of making a Type I error, denoted as α, is commonly set at 0.05. This significance level indicates a 5%...

Measures of Intelligence

Measures of Intelligence

Psychologists measure intelligence by using standardized tests that produce a score known as the intelligence quotient or IQ. To understand IQ tests, it's important to recognize the key principles behind their construction: validity, reliability, and standardization.
Validity refers to how well a test measures what it claims to measure. An intelligence test should accurately assess intelligence rather than another characteristic, like anxiety. Criterion validity is one way to evaluate this;...

Statistical Hypothesis Testing

Statistical Hypothesis Testing

Hypothesis testing is a critical statistical procedure facilitating informed, evidence-based decisions. It begins with a hypothesis, which is a tentative explanation, or a prediction about a population parameter. This hypothesis can be either a null hypothesis (H0), indicating no effect or difference, or an alternative hypothesis (Ha), suggesting an effect or difference.
Statistical significance measures the probability that an observed result occurred by chance. If this probability, known as...

Hypothesis: Accept or Fail to Reject?

Hypothesis: Accept or Fail to Reject?

The outcome of any hypothesis testing leads to rejecting or not rejecting the null hypothesis. This decision is taken based on the analysis of the data, an appropriate test statistic, an appropriate confidence level, the critical values, and P-values. However, when the evidence suggests that the null hypothesis cannot be rejected, is it right to say, 'Accept' the null hypothesis?
There are two ways to indicate that the null hypothesis is not rejected. 'Accept' the null...

Inductive Reasoning

Inductive Reasoning

Inductive reasoning is a form of logical thinking that uses related observations to arrive at a general conclusion. It is uncertain and operates in degrees to which the conclusions are credible. As such, inductive arguments can be weak or strong, rather than valid or invalid, and conclusions can be used to formulate testable, falsifiable hypotheses.
Inductive reasoning is common in descriptive science. A life scientist makes observations and records them. This data can be qualitative or...

Types of Hypothesis Testing

Types of Hypothesis Testing

There are three types of hypothesis tests: right-tailed, left-tailed, and two-tailed.
When the null and alternative hypotheses are stated, it is observed that the null hypothesis is a neutral statement against which the alternative hypothesis is tested. The alternative hypothesis is a claim that instead has a certain direction. If the null hypothesis claims that p = 0.5, the alternative hypothesis would be an opposing statement to this and can be put either p > 0.5, p < 0.5, or p...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

A largely univariate framework for understanding multivariate analysis of variance.

Psychological methods·2026

Same author

Patience and Patience Regulation: Development and Psychometric Evaluation of Complementary Scales with Alternate Forms.

Journal of personality assessment·2026

Same author

Assessing the Response Process Validity of the Posttraumatic Growth Inventory Using Cognitive Interviews.

Journal of personality assessment·2026

Same author

Is Doing Good Good Enough? A Motivation, Action, Sacrifice, and Temptation (MAST) View of Moral Praiseworthiness.

Personality & social psychology bulletin·2024

Same author

Consensus, controversy, and chaos in the attribution of characteristics to the morally exceptional.

Journal of personality·2023

Same author

The Statistical Developments and Applications Section at Age 20: A Time to Review the SDA's Purpose and Ideal Types of Papers.

Journal of personality assessment·2023

Same journal

Modeling Individual Language Patterns and Psychological Constructs to Generate AI-Augmented Data for Scalable Psychological Assessment.

Assessment·2026

Same journal

The Psychometric Properties of the Perth Emotion Regulation Competency Inventory (PERCI) in Sexual and Gender Minority Adults: Minority Stress and Resilience Correlates of Positive and Negative Emotion Regulation Difficulties.

Assessment·2026

Same journal

Future Orientation Scale: A Psychometric Evaluation Across Health-Vulnerable Samples.

Assessment·2026

Same journal

Spurious Reliability Increase?: The Number of Response Options in the Likert-Type Scale Influences Only Internal Consistency, Not Criterion Validity.

Assessment·2026

Same journal

Measuring Moral Injury Outcome and Distress in High-Risk Populations in Germany: A Validation Study.

Assessment·2026

Same journal

Establishing Psychometric Validity of the PBSS-20 for Sexual Minority College Students.

Assessment·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jan 7, 2026

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

Using Generative Artificial Intelligence to Advance Hypothesis-Driven Scale Validation: Identifying Criterion

Kyle D Austin¹, Hannah K Crawley¹, William Fleeson¹

¹Wake Forest University, Winston-Salem, NC, USA.

|December 29, 2025

Summary

This summary is machine-generated.

Artificial intelligence (AI) can generate precise validity hypotheses for scale validation, matching expert accuracy. This approach enhances psychological scale development efficiently.

Keywords:

artificial intelligence assessment measurement psychometrics scale development validity

Related Experiment Videos

Last Updated: Jan 7, 2026

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

Area of Science:

Psychological Measurement
Artificial Intelligence in Research
Quantitative Psychology

Background:

Scale validation is crucial for psychological research.
Developing precise validity hypotheses is a key step.
Current methods can be time-consuming and require expert input.

Purpose of the Study:

To evaluate artificial intelligence (AI) for hypothesis-driven scale validation.
To assess AI's ability to generate psychologically reasonable validity hypotheses.
To compare AI-generated hypotheses with expert predictions.

Main Methods:

Qualitative assessment of AI suggestions for scale validation criteria.
Quantitative evaluation of AI-generated validity hypotheses using existing data.
Comparison of AI (ChatGPT, Gemini) hypothesis consistency and accuracy against expert predictions across nine scales/subscales.

Main Results:

AI provided useful suggestions for scale validation criteria.
AI-generated hypotheses demonstrated high inter-trial consistency, comparable to expert inter-rater consistency.
AI hypotheses showed strong agreement with expert hypotheses and similar accuracy in predicting validity correlations.

Conclusions:

AI, including ChatGPT and Gemini, can effectively facilitate hypothesis generation for convergent and discriminant validity.
AI offers a time-efficient method for scale validation without compromising psychological or psychometric quality.
AI shows promise in advancing rigorous, hypothesis-driven scale validation practices.