Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Quantifying and Rejecting Outliers: The Grubbs Test

Quantifying and Rejecting Outliers: The Grubbs Test

Sometimes, a data set can have a recorded numerical observation that greatly deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier. To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...

Outliers and Influential Points

Outliers and Influential Points

An outlier is an observation of data that does not fit the rest of the data. It is sometimes called an extreme value. When you graph an outlier, it will appear not to fit the pattern of the graph. Some outliers are due to mistakes (for example, writing down 50 instead of 500), while others may indicate that something unusual is happening. Outliers are present far from the least squares line in the vertical direction. They have large "errors," where the "error" or residual is the...

Detection of Gross Error: The Q Test

Detection of Gross Error: The Q Test

When one or more data points appear far from the rest of the data, there is a need to determine whether they are outliers and whether they should be eliminated from the data set to ensure an accurate representation of the measured value. In many cases, outliers arise from gross errors (or human errors) and do not accurately reflect the underlying phenomenon. In some cases, however, these apparent outliers reflect true phenomenological differences. In these cases, we can use statistical methods...

What Are Outliers?

What Are Outliers?

Outliers are observed data points that are far from the least squares line. They have unusual values and need to be examined carefully. Though an outlier may result from erroneous data, at other times, it may hold valuable information about the population under study and should be included in the data. Hence, it is crucial to examine what causes a data point to be an outlier.
The z score is used to find outliers or unusual values. It should be noted that any values beyond -2 and +2 are...

Statistical Hypothesis Testing

Statistical Hypothesis Testing

Hypothesis testing is a critical statistical procedure facilitating informed, evidence-based decisions. It begins with a hypothesis, which is a tentative explanation, or a prediction about a population parameter. This hypothesis can be either a null hypothesis (H0), indicating no effect or difference, or an alternative hypothesis (Ha), suggesting an effect or difference.
Statistical significance measures the probability that an observed result occurred by chance. If this probability, known as...

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

Statistical inference techniques, paramount in hypothesis testing, differentiate into two broad categories: parametric and nonparametric statistics.
Parametric statistics, as the name suggests, assumes that data follow a specific distribution, often a normal distribution. This assumption enables robust hypothesis testing and estimation. Parametric methods, like the Student's t-test or Goodness-of-fit test, are frequently employed in biostatistics due to their robustness. For instance,...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same authorSame journal

X implies Y - Testing Hypotheses of Direction of Effect Using Configural Frequency Analysis.

Integrative psychological & behavioral science·2026

Same author

Cumulant-Based Approaches for Testing the Assumption of Independent Errors in Non-Gaussian Parallel and Congeneric Measures.

Educational and psychological measurement·2026

Same author

Control of Type 1 and Type 2 Errors in Configural Frequency Analysis.

Journal for person-oriented research·2026

Same author

Does X at Time 1 Cause Y at Time 2? Longitudinal Causal Learning with Hidden Confounders.

Psychometrika·2026

Same author

Conceptual and methodological advances for understanding contextual, identity, and cultural effects in intervention research: The contextually informed research model.

Journal of school psychology·2025

Same author

Right-sizing growth mixture models as multi-group growth and confirmatory factor models.

Behavior research methods·2025

Same journal

The Dynamics of Emotional Regulation in Aversive Social Interactions: A Review of the Rationalization Trap and the Impact of Affect Labeling.

Integrative psychological & behavioral science·2026

Same journal

From Static Ontology to Dynamic Generativity: A Relational-Dynamic Model of "Dao" and DRQ in Moral Education.

Integrative psychological & behavioral science·2026

Same journal

Temporal Experience and Human State Field: Toward a New Framework Bridging Physics and Perception Part I-Basic Arguments, Concepts, and Definitions.

Integrative psychological & behavioral science·2026

Same journal

Beyond Fear: Disgust, Anger, and the Affective Core of Interpersonal Phobias.

Integrative psychological & behavioral science·2026

Same journal

Symbolic Closure and the Neurodevelopment of Social Normativity.

Integrative psychological & behavioral science·2026

See all related articles

Search research articles

Related Experiment Video

Updated: May 8, 2025

A Cross-Disciplinary and Multi-Modal Experimental Design for Studying Near-Real-Time Authentic Examination Experiences

A Cross-Disciplinary and Multi-Modal Experimental Design for Studying Near-Real-Time Authentic Examination Experiences

Published on: September 4, 2019

Moving From Statistical to Hypothesis-driven Outliers.

Alexander von Eye¹, Wolfgang Wiedermann²

¹Michigan State University, 190 Allee du Nouveau Monde, 34000, Montpellier, France. voneye@msu.edu.

Integrative Psychological & Behavioral Science

|April 23, 2025

Summary

This summary is machine-generated.

This study introduces a novel method for outlier analysis in categorical data, defining outliers by their extremity relative to hypotheses. Configural Frequency Analysis (CFA) reveals how unsupervised outlier detection can distort supervised classification results.

Keywords:

CFA Configural frequency analysis Distance outlier Hypothesis outlier Outlier

More Related Videos

Probing the Limits of Egg Recognition Using Egg Rejection Experiments Along Phenotypic Gradients

Probing the Limits of Egg Recognition Using Egg Rejection Experiments Along Phenotypic Gradients

Published on: August 22, 2018

Characterization of Complex Systems Using the Design of Experiments Approach: Transient Protein Expression in Tobacco as a Case Study

Characterization of Complex Systems Using the Design of Experiments Approach: Transient Protein Expression in Tobacco as a Case Study

Published on: January 31, 2014

Related Experiment Videos

Last Updated: May 8, 2025

A Cross-Disciplinary and Multi-Modal Experimental Design for Studying Near-Real-Time Authentic Examination Experiences

A Cross-Disciplinary and Multi-Modal Experimental Design for Studying Near-Real-Time Authentic Examination Experiences

Published on: September 4, 2019

Probing the Limits of Egg Recognition Using Egg Rejection Experiments Along Phenotypic Gradients

Probing the Limits of Egg Recognition Using Egg Rejection Experiments Along Phenotypic Gradients

Published on: August 22, 2018

Characterization of Complex Systems Using the Design of Experiments Approach: Transient Protein Expression in Tobacco as a Case Study

Characterization of Complex Systems Using the Design of Experiments Approach: Transient Protein Expression in Tobacco as a Case Study

Published on: January 31, 2014

Area of Science:

Statistics
Data Mining
Categorical Data Analysis

Background:

Traditional outlier analysis relies on data characteristics like distance or correlation.
This approach is applicable to various data types and analysis scales.
A gap exists in defining outliers based on substantive hypotheses.

Purpose of the Study:

To propose a new approach for outlier analysis in categorical data.
To define outliers as data points extreme relative to substantive hypotheses.
To compare standard outlier analysis with Configural Frequency Analysis (CFA).

Main Methods:

Proposed defining outliers based on extremity to substantive hypotheses.
Introduced a two-step outlier analysis: standard analysis and CFA.
Utilized cluster analysis for unsupervised classification and CFA for supervised classification.

Main Results:

Outliers identified via unsupervised classification can distort supervised classification outcomes.
Configural Frequency Analysis (CFA) identifies outliers as cells contradicting a null hypothesis.
The interplay between unsupervised and supervised classification methods was examined.

Conclusions:

A new perspective on outlier definition in categorical data is presented.
Configural Frequency Analysis offers a hypothesis-driven approach to outlier detection.
Understanding the impact of unsupervised outlier identification on supervised methods is crucial.