Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Outliers and Influential Points

Outliers and Influential Points

An outlier is an observation of data that does not fit the rest of the data. It is sometimes called an extreme value. When you graph an outlier, it will appear not to fit the pattern of the graph. Some outliers are due to mistakes (for example, writing down 50 instead of 500), while others may indicate that something unusual is happening. Outliers are present far from the least squares line in the vertical direction. They have large "errors," where the "error" or residual is the...

Detection of Gross Error: The Q Test

Detection of Gross Error: The Q Test

When one or more data points appear far from the rest of the data, there is a need to determine whether they are outliers and whether they should be eliminated from the data set to ensure an accurate representation of the measured value. In many cases, outliers arise from gross errors (or human errors) and do not accurately reflect the underlying phenomenon. In some cases, however, these apparent outliers reflect true phenomenological differences. In these cases, we can use statistical methods...

What Are Outliers?

What Are Outliers?

Outliers are observed data points that are far from the least squares line. They have unusual values and need to be examined carefully. Though an outlier may result from erroneous data, at other times, it may hold valuable information about the population under study and should be included in the data. Hence, it is crucial to examine what causes a data point to be an outlier.
The z score is used to find outliers or unusual values. It should be noted that any values beyond -2 and +2 are...

Quantifying and Rejecting Outliers: The Grubbs Test

Quantifying and Rejecting Outliers: The Grubbs Test

Sometimes, a data set can have a recorded numerical observation that greatly deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier. To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...

Modified Boxplots

Modified Boxplots

A standard box and whisker plot informs us about the spread of the data in a given sample. One can identify the minimum value, maximum value, first quartile value, second quartile or median value, and third quartile.
However, the box plot does not tell the reader about outliers - values that lie far from the center of the data. We can modify the standard box and whisker plot to identify the outliers and visualize the actual spread of the data in a sample.
Initially, we calculate the adjusted...

Midpoint Rule

Midpoint Rule

Approximating areas under curved boundaries is a common problem in applied mathematics, particularly when an exact calculation is difficult or impractical. One effective numerical method for this purpose is the Midpoint Rule, which provides an estimate of the area under a curve by using rectangular approximations over a specified interval.Description of the Midpoint RuleThe Midpoint Rule begins by dividing the given interval into a number of equal subintervals. For each subinterval, the...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Policy-Based Active Learning for Efficient Molecular Identification.

Journal of chemical information and modeling·2026

Same author

Multimodal Deep Learning with Routine Clinical Data for Recurrence Risk Stratification in HR<sup>+</sup>/HER2<sup>-</sup> Early Breast Cancer.

Research (Washington, D.C.)·2026

Same author

A systematic comparison of methodologies for the estimation of the serial interval.

Infectious Disease Modelling·2026

Same author

Learning With Partial and Noisy Correspondence in Graph Matching.

IEEE transactions on pattern analysis and machine intelligence·2026

Same author

Few-shot molecular property optimization <i>via</i> a domain-specialized large language model.

Chemical science·2026

Same author

Task-specific pre-training for molecular property prediction.

Briefings in bioinformatics·2026

Same journal

Hidden Data Recovery and Forecasting via Next-Generation Reservoir Computing With Multiscale Delay Selection.

IEEE transactions on neural networks and learning systems·2026

Same journal

CAFF-CIL: Causality-Aware Freedom Forgetting Approach for Class-Incremental Learning.

IEEE transactions on neural networks and learning systems·2026

Same journal

Harmonic Autoencoding Framework for Multiple Tasks in Magnetic Particle Imaging Reconstruction.

IEEE transactions on neural networks and learning systems·2026

Same journal

A Survey on Human-Centric Voice-Face Multimodal Learning.

IEEE transactions on neural networks and learning systems·2026

Same journal

Vision-Assisted Foundation Model for Solving Multitask Vehicle Routing Problems.

IEEE transactions on neural networks and learning systems·2026

Same journal

FP3O: Enabling Proximal Policy Optimization in Multiagent Cooperation With Parameter-Sharing Versatility.

IEEE transactions on neural networks and learning systems·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Mar 13, 2026

Automated Midline Shift and Intracranial Pressure Estimation based on Brain CT Images

Automated Midline Shift and Intracranial Pressure Estimation based on Brain CT Images

Published on: April 13, 2013

An Efficient Representation-Based Method for Boundary Point and Outlier Detection.

Xiaojie Li, Jiancheng Lv, Zhang Yi

IEEE Transactions on Neural Networks and Learning Systems

|October 25, 2016

Summary

This summary is machine-generated.

This study introduces an efficient representation-based method to detect boundary points and outliers. The novel "reverse unreachability" metric effectively identifies these valuable data points, regardless of data distribution or dimensionality.

More Related Videos

Detection of Architectural Distortion in Prior Mammograms via Analysis of Oriented Patterns

Detection of Architectural Distortion in Prior Mammograms via Analysis of Oriented Patterns

Published on: August 30, 2013

A System for Tracking the Dynamics of Social Preference Behavior in Small Rodents

A System for Tracking the Dynamics of Social Preference Behavior in Small Rodents

Published on: November 21, 2019

Related Experiment Videos

Last Updated: Mar 13, 2026

Automated Midline Shift and Intracranial Pressure Estimation based on Brain CT Images

Automated Midline Shift and Intracranial Pressure Estimation based on Brain CT Images

Published on: April 13, 2013

Detection of Architectural Distortion in Prior Mammograms via Analysis of Oriented Patterns

Detection of Architectural Distortion in Prior Mammograms via Analysis of Oriented Patterns

Published on: August 30, 2013

A System for Tracking the Dynamics of Social Preference Behavior in Small Rodents

A System for Tracking the Dynamics of Social Preference Behavior in Small Rodents

Published on: November 21, 2019

Area of Science:

Data Mining and Machine Learning
Computational Statistics

Background:

Detecting boundary points and outliers is crucial for uncovering valuable patterns in data.
Traditional methods may struggle with diverse data distributions and high-dimensional spaces.

Purpose of the Study:

To present an efficient representation-based method for detecting boundary points and outliers.
To introduce and validate the 'reverse unreachability' metric for identifying these points.

Main Methods:

Utilizes an efficient representation-based approach to analyze data structure.
Calculates 'reverse unreachability' by counting zero and negative components in a point's representation.
Evaluates data points based on their reverse unreachability score to identify boundary points and outliers.

Main Results:

The reverse unreachability metric effectively distinguishes boundary points and outliers from normal observations.
Higher reverse unreachability scores correlate with lower data density and increased likelihood of being a boundary point or outlier.
The method demonstrates superior performance across synthetic and real-world datasets, outperforming related techniques.

Conclusions:

The proposed representation-based method with reverse unreachability is effective and efficient for outlier and boundary point detection.
This approach accurately reflects data characteristics and is robust to data distribution and dimensionality.
The method successfully identifies both boundary points and outliers simultaneously.