Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Outliers and Influential Points

Outliers and Influential Points

An outlier is an observation of data that does not fit the rest of the data. It is sometimes called an extreme value. When you graph an outlier, it will appear not to fit the pattern of the graph. Some outliers are due to mistakes (for example, writing down 50 instead of 500), while others may indicate that something unusual is happening. Outliers are present far from the least squares line in the vertical direction. They have large "errors," where the "error" or residual is the...

What Are Outliers?

What Are Outliers?

Outliers are observed data points that are far from the least squares line. They have unusual values and need to be examined carefully. Though an outlier may result from erroneous data, at other times, it may hold valuable information about the population under study and should be included in the data. Hence, it is crucial to examine what causes a data point to be an outlier.
The z score is used to find outliers or unusual values. It should be noted that any values beyond -2 and +2 are...

Detection of Gross Error: The Q Test

Detection of Gross Error: The Q Test

When one or more data points appear far from the rest of the data, there is a need to determine whether they are outliers and whether they should be eliminated from the data set to ensure an accurate representation of the measured value. In many cases, outliers arise from gross errors (or human errors) and do not accurately reflect the underlying phenomenon. In some cases, however, these apparent outliers reflect true phenomenological differences. In these cases, we can use statistical methods...

Quantifying and Rejecting Outliers: The Grubbs Test

Quantifying and Rejecting Outliers: The Grubbs Test

Sometimes, a data set can have a recorded numerical observation that greatly deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier. To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...

Cluster Sampling Method

Cluster Sampling Method

Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...

Steps in Outbreak Investigation

Steps in Outbreak Investigation

In the ever-evolving field of public health, statistical analysis serves as a cornerstone for understanding and managing disease outbreaks. By leveraging various statistical tools, health professionals can predict potential outbreaks, analyze ongoing situations, and devise effective responses to mitigate impact. For that to happen, there are a few possible stages of the analysis:

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Leveraging mathematical models to predict and control T-cell activation.

PLoS computational biology·2026

Same author

Identification of dynamic models of microbial communities: A workflow addressing identifiability and modeling pitfalls.

PLoS computational biology·2025

Same author

Analysing the Structural Identifiability and Observability of Mechanistic Models of Tumour Growth.

Bioengineering (Basel, Switzerland)·2025

Same author

Correction to "Sequence Control of the Self-Assembly of Elastin-Like Polypeptides into Hydrogels with Bespoke Viscoelastic and Structural Properties".

Biomacromolecules·2025

Same author

Conformal prediction for uncertainty quantification in dynamic biological systems.

PLoS computational biology·2025

Same author

Employing Observability Rank Conditions for Taking into Account Experimental Information a priori.

Bulletin of mathematical biology·2025

Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026

Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026

Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026

Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026

Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026

Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Apr 4, 2026

Statistical Modelling of Cortical Connectivity Using Non-invasive Electroencephalograms

Statistical Modelling of Cortical Connectivity Using Non-invasive Electroencephalograms

Published on: November 1, 2019

Enabling network inference methods to handle missing data and outliers.

Abel Folch-Fortuny¹, Alejandro F Villaverde^2,3,4, Alberto Ferrer⁵

¹Departamento de Estadística e Investigación Operativa Aplicadas y Calidad, Universitat Politècnica de València, Camino de Vera s/n, Valencia, 46022, Spain. abfolfor@upv.es.

BMC Bioinformatics

|September 4, 2015

Summary

This summary is machine-generated.

Trimmed scores regression (TSR) effectively handles missing and outlier data for complex network inference. This method improves data quality, enabling analysis of previously unusable datasets across various scientific fields.

More Related Videos

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

Related Experiment Videos

Last Updated: Apr 4, 2026

Statistical Modelling of Cortical Connectivity Using Non-invasive Electroencephalograms

Statistical Modelling of Cortical Connectivity Using Non-invasive Electroencephalograms

Published on: November 1, 2019

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

Area of Science:

Multidisciplinary network inference
Data analysis in biological sciences, chemistry, economics, and sociology

Background:

Complex network inference from data is crucial but challenged by data quality issues.
Existing methodologies often fail to address missing data or outliers effectively.
Proper handling of incomplete and erroneous data is essential for reliable network inference.

Purpose of the Study:

To introduce a novel approach for handling missing data and detecting/correcting outliers in datasets.
To enhance the capability of network inference methods to analyze incomplete and faulty datasets.
To provide a robust data curation step for network inference.

Main Methods:

Development of Trimmed Scores Regression (TSR) utilizing multivariate projection to latent structures.
TSR imputes missing values coherently with the latent data structure.
TSR detects and corrects outlier values through robust estimation.

Main Results:

TSR enables network inference on incomplete datasets by imputing missing values.
The method effectively substitutes erroneous data points with accurate estimations.
Demonstrated integration of TSR with the MIDER network inference method.

Conclusions:

The TSR methodology significantly expands the scope of network inference to include previously unmanageable datasets.
Comparative studies confirm TSR's superior performance over alternative missing data imputation techniques.
TSR offers a comprehensive solution for both missing data and outlier issues in network analysis.