Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Outliers and Influential Points01:08

Outliers and Influential Points

6.5K
An outlier is an observation of data that does not fit the rest of the data. It is sometimes called an extreme value. When you graph an outlier, it will appear not to fit the pattern of the graph. Some outliers are due to mistakes (for example, writing down 50 instead of 500), while others may indicate that something unusual is happening. Outliers are present far from the least squares line in the vertical direction. They have large "errors," where the "error" or residual is the...
6.5K
Detection of Gross Error: The Q Test01:00

Detection of Gross Error: The Q Test

7.2K
When one or more data points appear far from the rest of the data, there is a need to determine whether they are outliers and whether they should be eliminated from the data set to ensure an accurate representation of the measured value. In many cases, outliers arise from gross errors (or human errors) and do not accurately reflect the underlying phenomenon. In some cases, however, these apparent outliers reflect true phenomenological differences. In these cases, we can use statistical methods...
7.2K
What Are Outliers?01:12

What Are Outliers?

5.5K
Outliers are observed data points that are far from the least squares line. They have unusual values and need to be examined carefully. Though an outlier may result from erroneous data, at other times, it may hold valuable information about the population under study and should be included in the data. Hence, it is crucial to examine what causes a data point to be an outlier.
The z score is used to find outliers or unusual values. It should be noted that any values beyond -2 and +2 are...
5.5K
Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

4.4K
Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...
4.4K
Modified Boxplots00:57

Modified Boxplots

11.7K
A standard box and whisker plot informs us about the spread of the data in a given sample. One can identify the minimum value, maximum value, first quartile value, second quartile or median value, and third quartile.
However, the box plot does not tell the reader about outliers - values that lie far from the center of the data. We can modify the standard box and whisker plot to identify the outliers and visualize the actual spread of the data in a sample.
Initially, we calculate the adjusted...
11.7K
Midpoint Rule01:20

Midpoint Rule

117
Approximating areas under curved boundaries is a common problem in applied mathematics, particularly when an exact calculation is difficult or impractical. One effective numerical method for this purpose is the Midpoint Rule, which provides an estimate of the area under a curve by using rectangular approximations over a specified interval.Description of the Midpoint RuleThe Midpoint Rule begins by dividing the given interval into a number of equal subintervals. For each subinterval, the...
117

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Policy-Based Active Learning for Efficient Molecular Identification.

Journal of chemical information and modeling·2026
Same author

Multimodal Deep Learning with Routine Clinical Data for Recurrence Risk Stratification in HR<sup>+</sup>/HER2<sup>-</sup> Early Breast Cancer.

Research (Washington, D.C.)·2026
Same author

A systematic comparison of methodologies for the estimation of the serial interval.

Infectious Disease Modelling·2026
Same author

Learning With Partial and Noisy Correspondence in Graph Matching.

IEEE transactions on pattern analysis and machine intelligence·2026
Same author

Few-shot molecular property optimization <i>via</i> a domain-specialized large language model.

Chemical science·2026
Same author

Task-specific pre-training for molecular property prediction.

Briefings in bioinformatics·2026
Same journal

Hidden Data Recovery and Forecasting via Next-Generation Reservoir Computing With Multiscale Delay Selection.

IEEE transactions on neural networks and learning systems·2026
Same journal

CAFF-CIL: Causality-Aware Freedom Forgetting Approach for Class-Incremental Learning.

IEEE transactions on neural networks and learning systems·2026
Same journal

Harmonic Autoencoding Framework for Multiple Tasks in Magnetic Particle Imaging Reconstruction.

IEEE transactions on neural networks and learning systems·2026
Same journal

A Survey on Human-Centric Voice-Face Multimodal Learning.

IEEE transactions on neural networks and learning systems·2026
Same journal

Vision-Assisted Foundation Model for Solving Multitask Vehicle Routing Problems.

IEEE transactions on neural networks and learning systems·2026
Same journal

FP3O: Enabling Proximal Policy Optimization in Multiagent Cooperation With Parameter-Sharing Versatility.

IEEE transactions on neural networks and learning systems·2026
See all related articles

Related Experiment Video

Updated: Mar 13, 2026

Automated Midline Shift and Intracranial Pressure Estimation based on Brain CT Images
14:08

Automated Midline Shift and Intracranial Pressure Estimation based on Brain CT Images

Published on: April 13, 2013

43.7K

An Efficient Representation-Based Method for Boundary Point and Outlier Detection.

Xiaojie Li, Jiancheng Lv, Zhang Yi

    IEEE Transactions on Neural Networks and Learning Systems
    |October 25, 2016
    PubMed
    Summary
    This summary is machine-generated.

    This study introduces an efficient representation-based method to detect boundary points and outliers. The novel "reverse unreachability" metric effectively identifies these valuable data points, regardless of data distribution or dimensionality.

    More Related Videos

    Detection of Architectural Distortion in Prior Mammograms via Analysis of Oriented Patterns
    13:44

    Detection of Architectural Distortion in Prior Mammograms via Analysis of Oriented Patterns

    Published on: August 30, 2013

    43.8K
    A System for Tracking the Dynamics of Social Preference Behavior in Small Rodents
    08:38

    A System for Tracking the Dynamics of Social Preference Behavior in Small Rodents

    Published on: November 21, 2019

    8.3K

    Related Experiment Videos

    Last Updated: Mar 13, 2026

    Automated Midline Shift and Intracranial Pressure Estimation based on Brain CT Images
    14:08

    Automated Midline Shift and Intracranial Pressure Estimation based on Brain CT Images

    Published on: April 13, 2013

    43.7K
    Detection of Architectural Distortion in Prior Mammograms via Analysis of Oriented Patterns
    13:44

    Detection of Architectural Distortion in Prior Mammograms via Analysis of Oriented Patterns

    Published on: August 30, 2013

    43.8K
    A System for Tracking the Dynamics of Social Preference Behavior in Small Rodents
    08:38

    A System for Tracking the Dynamics of Social Preference Behavior in Small Rodents

    Published on: November 21, 2019

    8.3K

    Area of Science:

    • Data Mining and Machine Learning
    • Computational Statistics

    Background:

    • Detecting boundary points and outliers is crucial for uncovering valuable patterns in data.
    • Traditional methods may struggle with diverse data distributions and high-dimensional spaces.

    Purpose of the Study:

    • To present an efficient representation-based method for detecting boundary points and outliers.
    • To introduce and validate the 'reverse unreachability' metric for identifying these points.

    Main Methods:

    • Utilizes an efficient representation-based approach to analyze data structure.
    • Calculates 'reverse unreachability' by counting zero and negative components in a point's representation.
    • Evaluates data points based on their reverse unreachability score to identify boundary points and outliers.

    Main Results:

    • The reverse unreachability metric effectively distinguishes boundary points and outliers from normal observations.
    • Higher reverse unreachability scores correlate with lower data density and increased likelihood of being a boundary point or outlier.
    • The method demonstrates superior performance across synthetic and real-world datasets, outperforming related techniques.

    Conclusions:

    • The proposed representation-based method with reverse unreachability is effective and efficient for outlier and boundary point detection.
    • This approach accurately reflects data characteristics and is robust to data distribution and dimensionality.
    • The method successfully identifies both boundary points and outliers simultaneously.