Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Residuals and Least-Squares Property

Residuals and Least-Squares Property

The vertical distance between the actual value of y and the estimated value of y. In other words, it measures the vertical distance between the actual data point and the predicted point on the line
If the observed data point lies above the line, the residual is positive, and the line underestimates the actual data value for y. If the observed data point lies below the line, the residual is negative, and the line overestimates the actual data value for y.
The process of fitting the best-fit...

Quantifying and Rejecting Outliers: The Grubbs Test

Quantifying and Rejecting Outliers: The Grubbs Test

Sometimes, a data set can have a recorded numerical observation that greatly deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier. To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...

Mechanistic Models: Compartment Models in Individual and Population Analysis

Mechanistic Models: Compartment Models in Individual and Population Analysis

Mechanistic models are utilized in individual analysis using single-source data, but imperfections arise due to data collection errors, preventing perfect prediction of observed data. The mathematical equation involves known values (Xi), observed concentrations (Ci), measurement errors (εi), model parameters (ϕj), and the related function (ƒi) for i number of values. Different least-squares metrics quantify differences between predicted and observed values. The ordinary least...

One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for ka Estimation

One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for k_a Estimation

This lesson introduces two critical methods in pharmacokinetics, the Wagner-Nelson and Loo-Riegelman methods, used for estimating the absorption rate constant (ka) for drugs administered via non-intravenous routes. The Wagner-Nelson method relates ka to the plasma concentration derived from the slope of a semilog percent unabsorbed time plot. However, it is limited to drugs with one-compartment kinetics and can be impacted by factors like gastrointestinal motility or enzymatic degradation.
On...

Estimating Population Mean with Unknown Standard Deviation

Estimating Population Mean with Unknown Standard Deviation

In practice, we rarely know the population standard deviation. In the past, when the sample size was large, this did not present a problem to statisticians. They used the sample standard deviation s as an estimate for σ and proceeded as before to calculate a confidence interval with close enough results. However, statisticians ran into problems when the sample size was small. A small sample size caused inaccuracies in the confidence interval.
William S. Gosset (1876–1937) of the...

Truncation in Survival Analysis

Truncation in Survival Analysis

Truncation in survival analysis refers to the exclusion of individuals or events from the dataset based on specific criteria related to the time of the event. This exclusion can happen in two primary forms: left truncation and right truncation.
Left truncation occurs when individuals who experienced the event of interest before a certain time are not included in the study. This is often due to a "delayed entry" into the study where only those who survive until a certain entry point are...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Application of weighted low rank approximations: outlier detection in a data matrix.

BMC research notes·2025

Same author

Missing value imputation in a data matrix using the regularised singular value decomposition.

MethodsX·2023

Same author

Confident interpretation of Bayesian decision tree ensembles for clinical applications.

IEEE transactions on information technology in biomedicine : a publication of the IEEE Engineering in Medicine and Biology Society·2007

Same author

Biclustering models for structured microarray data.

IEEE/ACM transactions on computational biology and bioinformatics·2006

Same journal

Facile synthesis of model polystyrene nanoparticles for nanoplastics research.

MethodsX·2026

Same journal

Effectiveness of a posture education program in high school students: A randomized controlled trial protocol.

MethodsX·2026

Same journal

Development and characterization of silicone-based testosterone propionate implants for sustained androgen delivery in juvenile castrated male pigs.

MethodsX·2026

Same journal

Machine learning assisted multi-criteria decision-making approaches for site selection: A systematic review.

MethodsX·2026

Same journal

A systematic analytical framework for multi-source municipal solid waste characterization for energy recovery.

MethodsX·2026

Same journal

Decision tree and reinforcement learning for contextual electricity consumption forecasting in buildings.

MethodsX·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Sep 25, 2025

Untargeted Liquid Chromatography-Mass Spectrometry-Based Metabolomics Analysis of Wheat Grain

Untargeted Liquid Chromatography-Mass Spectrometry-Based Metabolomics Analysis of Wheat Grain

Published on: March 13, 2020

Missing value imputation using least squares techniques in contaminated matrices.

Marisol Garcia-Peña¹, Sergio Arciniegas-Alarcón², Wojtek J Krzanowski³

¹Pontificia Universidad Javeriana, Departamento de Matemáticas, Bogotá, Colombia.

|April 28, 2022

Summary

This summary is machine-generated.

This study introduces robust methods to improve data imputation quality by handling outliers. Pre-processing techniques like robust singular value decomposition prevent data errors from affecting imputation results.

Keywords:

Cross-validation Eigenvalues Eigenvectors Genotype-by-environment interaction Iterative computational scheme Missing values Robust singular value decomposition

More Related Videos

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

Related Experiment Videos

Last Updated: Sep 25, 2025

Untargeted Liquid Chromatography-Mass Spectrometry-Based Metabolomics Analysis of Wheat Grain

Untargeted Liquid Chromatography-Mass Spectrometry-Based Metabolomics Analysis of Wheat Grain

Published on: March 13, 2020

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

Area of Science:

Data Science
Statistical Modeling
Matrix Approximation

Background:

Matrix imputation methods using least squares can be sensitive to outliers.
Outliers can degrade imputation quality and cause convergence issues.

Purpose of the Study:

To develop and evaluate pre-processing strategies for robust data imputation.
To mitigate the impact of outliers on matrix imputation algorithms.

Main Methods:

Explored pre-processing options before applying a mixture of regression and lower rank approximation.
Investigated robust singular value decomposition (SVD) and outlier detection followed by treating outliers as missing data.
Evaluated methods using cross-validation on real-world multi-environment trial data.

Main Results:

Proposed pre-processing methods significantly improve imputation quality compared to the original algorithm.
Robust SVD effectively handles outliers and enhances the imputation procedure.
Outlier detection and treatment as missing data also yielded robust imputation results.

Conclusions:

The original imputation method is susceptible to outliers and should be replaced by robust alternatives.
Pre-processing steps are crucial for reliable matrix imputation, especially with suspected data contamination.
The proposed methods ensure algorithm performance on datasets with potential outlier contamination.