Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

1.3K
Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...
1.3K
Outliers and Influential Points01:08

Outliers and Influential Points

3.9K
An outlier is an observation of data that does not fit the rest of the data. It is sometimes called an extreme value. When you graph an outlier, it will appear not to fit the pattern of the graph. Some outliers are due to mistakes (for example, writing down 50 instead of 500), while others may indicate that something unusual is happening. Outliers are present far from the least squares line in the vertical direction. They have large "errors," where the "error" or residual is the...
3.9K
Detection of Gross Error: The Q Test01:00

Detection of Gross Error: The Q Test

4.4K
When one or more data points appear far from the rest of the data, there is a need to determine whether they are outliers and whether they should be eliminated from the data set to ensure an accurate representation of the measured value. In many cases, outliers arise from gross errors (or human errors) and do not accurately reflect the underlying phenomenon. In some cases, however, these apparent outliers reflect true phenomenological differences. In these cases, we can use statistical methods...
4.4K
What Are Outliers?01:12

What Are Outliers?

3.6K
Outliers are observed data points that are far from the least squares line. They have unusual values and need to be examined carefully. Though an outlier may result from erroneous data, at other times, it may hold valuable information about the population under study and should be included in the data. Hence, it is crucial to examine what causes a data point to be an outlier.
The z score is used to find outliers or unusual values. It should be noted that any values beyond -2 and +2 are...
3.6K
Statistical Hypothesis Testing01:16

Statistical Hypothesis Testing

1.8K
Hypothesis testing is a critical statistical procedure facilitating informed, evidence-based decisions. It begins with a hypothesis, which is a tentative explanation, or a prediction about a population parameter. This hypothesis can be either a null hypothesis (H0), indicating no effect or difference, or an alternative hypothesis (Ha), suggesting an effect or difference.
Statistical significance measures the probability that an observed result occurred by chance. If this probability, known as...
1.8K
Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data01:16

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

95
Statistical inference techniques, paramount in hypothesis testing, differentiate into two broad categories: parametric and nonparametric statistics.
Parametric statistics, as the name suggests, assumes that data follow a specific distribution, often a normal distribution. This assumption enables robust hypothesis testing and estimation. Parametric methods, like the Student's t-test or Goodness-of-fit test, are frequently employed in biostatistics due to their robustness. For instance,...
95

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same authorSame journal

X implies Y - Testing Hypotheses of Direction of Effect Using Configural Frequency Analysis.

Integrative psychological & behavioral science·2026
Same author

Cumulant-Based Approaches for Testing the Assumption of Independent Errors in Non-Gaussian Parallel and Congeneric Measures.

Educational and psychological measurement·2026
Same author

Control of Type 1 and Type 2 Errors in Configural Frequency Analysis.

Journal for person-oriented research·2026
Same author

Does X at Time 1 Cause Y at Time 2? Longitudinal Causal Learning with Hidden Confounders.

Psychometrika·2026
Same author

Conceptual and methodological advances for understanding contextual, identity, and cultural effects in intervention research: The contextually informed research model.

Journal of school psychology·2025
Same author

Right-sizing growth mixture models as multi-group growth and confirmatory factor models.

Behavior research methods·2025
Same journal

The Dynamics of Emotional Regulation in Aversive Social Interactions: A Review of the Rationalization Trap and the Impact of Affect Labeling.

Integrative psychological & behavioral science·2026
Same journal

From Static Ontology to Dynamic Generativity: A Relational-Dynamic Model of "Dao" and DRQ in Moral Education.

Integrative psychological & behavioral science·2026
Same journal

Temporal Experience and Human State Field: Toward a New Framework Bridging Physics and Perception Part I-Basic Arguments, Concepts, and Definitions.

Integrative psychological & behavioral science·2026
Same journal

Beyond Fear: Disgust, Anger, and the Affective Core of Interpersonal Phobias.

Integrative psychological & behavioral science·2026
Same journal

Symbolic Closure and the Neurodevelopment of Social Normativity.

Integrative psychological & behavioral science·2026
See all related articles

Related Experiment Video

Updated: May 8, 2025

A Cross-Disciplinary and Multi-Modal Experimental Design for Studying Near-Real-Time Authentic Examination Experiences
00:08

A Cross-Disciplinary and Multi-Modal Experimental Design for Studying Near-Real-Time Authentic Examination Experiences

Published on: September 4, 2019

6.9K

Moving From Statistical to Hypothesis-driven Outliers.

Alexander von Eye1, Wolfgang Wiedermann2

  • 1Michigan State University, 190 Allee du Nouveau Monde, 34000, Montpellier, France. voneye@msu.edu.

Integrative Psychological & Behavioral Science
|April 23, 2025
PubMed
Summary
This summary is machine-generated.

This study introduces a novel method for outlier analysis in categorical data, defining outliers by their extremity relative to hypotheses. Configural Frequency Analysis (CFA) reveals how unsupervised outlier detection can distort supervised classification results.

Keywords:
CFAConfigural frequency analysisDistance outlierHypothesis outlierOutlier

More Related Videos

Probing the Limits of Egg Recognition Using Egg Rejection Experiments Along Phenotypic Gradients
07:34

Probing the Limits of Egg Recognition Using Egg Rejection Experiments Along Phenotypic Gradients

Published on: August 22, 2018

8.2K
Characterization of Complex Systems Using the Design of Experiments Approach: Transient Protein Expression in Tobacco as a Case Study
20:24

Characterization of Complex Systems Using the Design of Experiments Approach: Transient Protein Expression in Tobacco as a Case Study

Published on: January 31, 2014

16.4K

Related Experiment Videos

Last Updated: May 8, 2025

A Cross-Disciplinary and Multi-Modal Experimental Design for Studying Near-Real-Time Authentic Examination Experiences
00:08

A Cross-Disciplinary and Multi-Modal Experimental Design for Studying Near-Real-Time Authentic Examination Experiences

Published on: September 4, 2019

6.9K
Probing the Limits of Egg Recognition Using Egg Rejection Experiments Along Phenotypic Gradients
07:34

Probing the Limits of Egg Recognition Using Egg Rejection Experiments Along Phenotypic Gradients

Published on: August 22, 2018

8.2K
Characterization of Complex Systems Using the Design of Experiments Approach: Transient Protein Expression in Tobacco as a Case Study
20:24

Characterization of Complex Systems Using the Design of Experiments Approach: Transient Protein Expression in Tobacco as a Case Study

Published on: January 31, 2014

16.4K

Area of Science:

  • Statistics
  • Data Mining
  • Categorical Data Analysis

Background:

  • Traditional outlier analysis relies on data characteristics like distance or correlation.
  • This approach is applicable to various data types and analysis scales.
  • A gap exists in defining outliers based on substantive hypotheses.

Purpose of the Study:

  • To propose a new approach for outlier analysis in categorical data.
  • To define outliers as data points extreme relative to substantive hypotheses.
  • To compare standard outlier analysis with Configural Frequency Analysis (CFA).

Main Methods:

  • Proposed defining outliers based on extremity to substantive hypotheses.
  • Introduced a two-step outlier analysis: standard analysis and CFA.
  • Utilized cluster analysis for unsupervised classification and CFA for supervised classification.

Main Results:

  • Outliers identified via unsupervised classification can distort supervised classification outcomes.
  • Configural Frequency Analysis (CFA) identifies outliers as cells contradicting a null hypothesis.
  • The interplay between unsupervised and supervised classification methods was examined.

Conclusions:

  • A new perspective on outlier definition in categorical data is presented.
  • Configural Frequency Analysis offers a hypothesis-driven approach to outlier detection.
  • Understanding the impact of unsupervised outlier identification on supervised methods is crucial.