Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Outliers and Influential Points01:08

Outliers and Influential Points

6.7K
An outlier is an observation of data that does not fit the rest of the data. It is sometimes called an extreme value. When you graph an outlier, it will appear not to fit the pattern of the graph. Some outliers are due to mistakes (for example, writing down 50 instead of 500), while others may indicate that something unusual is happening. Outliers are present far from the least squares line in the vertical direction. They have large "errors," where the "error" or residual is the...
6.7K
What Are Outliers?01:12

What Are Outliers?

5.6K
Outliers are observed data points that are far from the least squares line. They have unusual values and need to be examined carefully. Though an outlier may result from erroneous data, at other times, it may hold valuable information about the population under study and should be included in the data. Hence, it is crucial to examine what causes a data point to be an outlier.
The z score is used to find outliers or unusual values. It should be noted that any values beyond -2 and +2 are...
5.6K
Detection of Gross Error: The Q Test01:00

Detection of Gross Error: The Q Test

7.4K
When one or more data points appear far from the rest of the data, there is a need to determine whether they are outliers and whether they should be eliminated from the data set to ensure an accurate representation of the measured value. In many cases, outliers arise from gross errors (or human errors) and do not accurately reflect the underlying phenomenon. In some cases, however, these apparent outliers reflect true phenomenological differences. In these cases, we can use statistical methods...
7.4K
Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

4.6K
Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...
4.6K
Cluster Sampling Method01:20

Cluster Sampling Method

15.6K
Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...
15.6K
Steps in Outbreak Investigation01:18

Steps in Outbreak Investigation

704
In the ever-evolving field of public health, statistical analysis serves as a cornerstone for understanding and managing disease outbreaks. By leveraging various statistical tools, health professionals can predict potential outbreaks, analyze ongoing situations, and devise effective responses to mitigate impact. For that to happen, there are a few possible stages of the analysis:
704

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Leveraging mathematical models to predict and control T-cell activation.

PLoS computational biology·2026
Same author

Identification of dynamic models of microbial communities: A workflow addressing identifiability and modeling pitfalls.

PLoS computational biology·2025
Same author

Analysing the Structural Identifiability and Observability of Mechanistic Models of Tumour Growth.

Bioengineering (Basel, Switzerland)·2025
Same author

Correction to "Sequence Control of the Self-Assembly of Elastin-Like Polypeptides into Hydrogels with Bespoke Viscoelastic and Structural Properties".

Biomacromolecules·2025
Same author

Conformal prediction for uncertainty quantification in dynamic biological systems.

PLoS computational biology·2025
Same author

Employing Observability Rank Conditions for Taking into Account Experimental Information a priori.

Bulletin of mathematical biology·2025
Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026
Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026
Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026
Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026
Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026
Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026
See all related articles

Related Experiment Video

Updated: Apr 4, 2026

Statistical Modelling of Cortical Connectivity Using Non-invasive Electroencephalograms
08:51

Statistical Modelling of Cortical Connectivity Using Non-invasive Electroencephalograms

Published on: November 1, 2019

6.2K

Enabling network inference methods to handle missing data and outliers.

Abel Folch-Fortuny1, Alejandro F Villaverde2,3,4, Alberto Ferrer5

  • 1Departamento de Estadística e Investigación Operativa Aplicadas y Calidad, Universitat Politècnica de València, Camino de Vera s/n, Valencia, 46022, Spain. abfolfor@upv.es.

BMC Bioinformatics
|September 4, 2015
PubMed
Summary
This summary is machine-generated.

Trimmed scores regression (TSR) effectively handles missing and outlier data for complex network inference. This method improves data quality, enabling analysis of previously unusable datasets across various scientific fields.

More Related Videos

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations
12:27

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

7.4K

Related Experiment Videos

Last Updated: Apr 4, 2026

Statistical Modelling of Cortical Connectivity Using Non-invasive Electroencephalograms
08:51

Statistical Modelling of Cortical Connectivity Using Non-invasive Electroencephalograms

Published on: November 1, 2019

6.2K
Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations
12:27

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

7.4K

Area of Science:

  • Multidisciplinary network inference
  • Data analysis in biological sciences, chemistry, economics, and sociology

Background:

  • Complex network inference from data is crucial but challenged by data quality issues.
  • Existing methodologies often fail to address missing data or outliers effectively.
  • Proper handling of incomplete and erroneous data is essential for reliable network inference.

Purpose of the Study:

  • To introduce a novel approach for handling missing data and detecting/correcting outliers in datasets.
  • To enhance the capability of network inference methods to analyze incomplete and faulty datasets.
  • To provide a robust data curation step for network inference.

Main Methods:

  • Development of Trimmed Scores Regression (TSR) utilizing multivariate projection to latent structures.
  • TSR imputes missing values coherently with the latent data structure.
  • TSR detects and corrects outlier values through robust estimation.

Main Results:

  • TSR enables network inference on incomplete datasets by imputing missing values.
  • The method effectively substitutes erroneous data points with accurate estimations.
  • Demonstrated integration of TSR with the MIDER network inference method.

Conclusions:

  • The TSR methodology significantly expands the scope of network inference to include previously unmanageable datasets.
  • Comparative studies confirm TSR's superior performance over alternative missing data imputation techniques.
  • TSR offers a comprehensive solution for both missing data and outlier issues in network analysis.