Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

4.5K
Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...
4.5K
Outliers and Influential Points01:08

Outliers and Influential Points

6.6K
An outlier is an observation of data that does not fit the rest of the data. It is sometimes called an extreme value. When you graph an outlier, it will appear not to fit the pattern of the graph. Some outliers are due to mistakes (for example, writing down 50 instead of 500), while others may indicate that something unusual is happening. Outliers are present far from the least squares line in the vertical direction. They have large "errors," where the "error" or residual is the...
6.6K
Detection of Gross Error: The Q Test01:00

Detection of Gross Error: The Q Test

7.2K
When one or more data points appear far from the rest of the data, there is a need to determine whether they are outliers and whether they should be eliminated from the data set to ensure an accurate representation of the measured value. In many cases, outliers arise from gross errors (or human errors) and do not accurately reflect the underlying phenomenon. In some cases, however, these apparent outliers reflect true phenomenological differences. In these cases, we can use statistical methods...
7.2K
What Are Outliers?01:12

What Are Outliers?

5.6K
Outliers are observed data points that are far from the least squares line. They have unusual values and need to be examined carefully. Though an outlier may result from erroneous data, at other times, it may hold valuable information about the population under study and should be included in the data. Hence, it is crucial to examine what causes a data point to be an outlier.
The z score is used to find outliers or unusual values. It should be noted that any values beyond -2 and +2 are...
5.6K
Steps in Outbreak Investigation01:18

Steps in Outbreak Investigation

693
In the ever-evolving field of public health, statistical analysis serves as a cornerstone for understanding and managing disease outbreaks. By leveraging various statistical tools, health professionals can predict potential outbreaks, analyze ongoing situations, and devise effective responses to mitigate impact. For that to happen, there are a few possible stages of the analysis:
693
Protein Networks02:26

Protein Networks

4.7K
An organism can have thousands of different proteins, and these proteins must cooperate to ensure the health of an organism. Proteins bind to other proteins and form complexes to carry out their functions. Many proteins interact with multiple other proteins creating a complex network of protein interactions.
These interactions can be represented through maps depicting protein-protein interaction networks, represented as nodes and edges. Nodes are circles that are representative of a protein,...
4.7K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

GWO-Optimized BPNN for Abrasion Resistance Prediction of Nano-SiO<sub>2</sub> and Hybrid Fiber Reinforced Geopolymer Gel Concrete.

Gels (Basel, Switzerland)·2026
Same author

Evolutionary and Structural Analysis Reveals the Gradual Establishment and High Conservation of Auxin Pathways from Algae to Land Plants.

Plant physiology·2026
Same author

Advanced nanobiosensors for the detection of neurovascular damage in cerebral infarction: prospects and challenges.

Journal of biological engineering·2026
Same author

Mechanistic Insight into Formation and Release Enhancement of Sulindac-Amino Acid Small-Molecule Hydrogels.

Pharmaceutical research·2026
Same author

Preoperative sleep disturbance and postoperative delirium in elderly joint replacement patients: a prospective cohort study.

BMC surgery·2026
Same author

Cell-o1 : training LLMs to solve single-cell reasoning puzzles with reinforcement learning.

Bioinformatics (Oxford, England)·2026
Same journal

Fair Spatial Indexing: A paradigm for Group Spatial Fairness.

Advances in database technology : proceedings. International Conference on Extending Database Technology·2024
Same journal

3DPro: Querying Complex Three-Dimensional Data with Progressive Compression and Refinement.

Advances in database technology : proceedings. International Conference on Extending Database Technology·2022
Same journal

Publishing Video Data with Indistinguishable Objects.

Advances in database technology : proceedings. International Conference on Extending Database Technology·2020
Same journal

Distributed query-aware quantization for high-dimensional similarity searches.

Advances in database technology : proceedings. International Conference on Extending Database Technology·2018
Same journal

PROX: Approximated Summarization of Data Provenance.

Advances in database technology : proceedings. International Conference on Extending Database Technology·2016
Same journal

Differentially Private Synthesization of Multi-Dimensional Data using Copula Functions.

Advances in database technology : proceedings. International Conference on Extending Database Technology·2014
See all related articles

Related Experiment Video

Updated: Mar 22, 2026

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations
08:03

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations

Published on: December 7, 2021

2.9K

Query-Based Outlier Detection in Heterogeneous Information Networks.

Jonathan Kuck1, Honglei Zhuang1, Xifeng Yan2

  • 1Department of Computer Science, University of Illinois at Urbana-Champaign.

Advances in Database Technology : Proceedings. International Conference on Extending Database Technology
|April 12, 2016
PubMed
Summary
This summary is machine-generated.

This study introduces query-based outlier detection for heterogeneous information networks. It enables flexible user-defined outlier queries and efficient processing for uncovering hidden data patterns.

More Related Videos

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems
05:47

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

1.8K

Related Experiment Videos

Last Updated: Mar 22, 2026

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations
08:03

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations

Published on: December 7, 2021

2.9K
Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems
05:47

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

1.8K

Area of Science:

  • Data Science
  • Network Analysis
  • Database Systems

Background:

  • Outlier detection is crucial in large datasets but challenging in high-dimensional spaces.
  • Traditional methods struggle with complex relationships and user-specific interests in heterogeneous networks.
  • Existing approaches lack flexibility for users to define and query for specific outlier types.

Purpose of the Study:

  • To introduce the concept of query-based outliers in heterogeneous information networks (HINs).
  • To develop a flexible query language for users to specify outlier detection criteria.
  • To design efficient algorithms for processing these user-defined outlier queries in large HINs.

Main Methods:

  • Introduced a novel concept of query-based outliers tailored for HINs.
  • Designed a specialized query language for flexible user input on outlier characteristics.
  • Developed and evaluated efficient algorithms for mining outliers based on user queries.
  • Defined a robust outlier measure suitable for the complexities of heterogeneous networks.

Main Results:

  • Demonstrated the ability to define and uncover interesting outliers flexibly in large HINs.
  • Showcased the effectiveness of the proposed query-based approach over traditional methods.
  • Validated the efficiency of the developed processing algorithms on real-world datasets.
  • Confirmed that user-specified queries lead to more relevant and actionable outlier discoveries.

Conclusions:

  • Query-based outlier detection offers a powerful and flexible paradigm for analyzing large heterogeneous information networks.
  • The proposed query language and efficient processing methods enable users to effectively discover hidden patterns and anomalies.
  • This approach enhances the utility of outlier detection by aligning it with specific user interests and search spaces.