Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Residuals and Least-Squares Property01:11

Residuals and Least-Squares Property

7.9K
The vertical distance between the actual value of y and the estimated value of y. In other words, it measures the vertical distance between the actual data point and the predicted point on the line
If the observed data point lies above the line, the residual is positive, and the line underestimates the actual data value for y. If the observed data point lies below the line, the residual is negative, and the line overestimates the actual data value for y.
The process of fitting the best-fit...
7.9K
Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

2.3K
Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...
2.3K
Mechanistic Models: Compartment Models in Individual and Population Analysis01:23

Mechanistic Models: Compartment Models in Individual and Population Analysis

89
Mechanistic models are utilized in individual analysis using single-source data, but imperfections arise due to data collection errors, preventing perfect prediction of observed data. The mathematical equation involves known values (Xi), observed concentrations (Ci), measurement errors (εi), model parameters (ϕj), and the related function (ƒi) for i number of values. Different least-squares metrics quantify differences between predicted and observed values. The ordinary least...
89
One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for ka Estimation01:24

One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for ka Estimation

770
This lesson introduces two critical methods in pharmacokinetics, the Wagner-Nelson and Loo-Riegelman methods, used for estimating the absorption rate constant (ka) for drugs administered via non-intravenous routes. The Wagner-Nelson method relates ka to the plasma concentration derived from the slope of a semilog percent unabsorbed time plot. However, it is limited to drugs with one-compartment kinetics and can be impacted by factors like gastrointestinal motility or enzymatic degradation.
On...
770
Estimating Population Mean with Unknown Standard Deviation01:22

Estimating Population Mean with Unknown Standard Deviation

8.3K
In practice, we rarely know the population standard deviation. In the past, when the sample size was large, this did not present a problem to statisticians. They used the sample standard deviation s as an estimate for σ and proceeded as before to calculate a confidence interval with close enough results. However, statisticians ran into problems when the sample size was small. A small sample size caused inaccuracies in the confidence interval.
William S. Gosset (1876–1937) of the...
8.3K
Truncation in Survival Analysis01:09

Truncation in Survival Analysis

332
Truncation in survival analysis refers to the exclusion of individuals or events from the dataset based on specific criteria related to the time of the event. This exclusion can happen in two primary forms: left truncation and right truncation.
Left truncation occurs when individuals who experienced the event of interest before a certain time are not included in the study. This is often due to a "delayed entry" into the study where only those who survive until a certain entry point are...
332

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Application of weighted low rank approximations: outlier detection in a data matrix.

BMC research notes·2025
Same author

Missing value imputation in a data matrix using the regularised singular value decomposition.

MethodsX·2023
Same author

Confident interpretation of Bayesian decision tree ensembles for clinical applications.

IEEE transactions on information technology in biomedicine : a publication of the IEEE Engineering in Medicine and Biology Society·2007
Same author

Biclustering models for structured microarray data.

IEEE/ACM transactions on computational biology and bioinformatics·2006
Same journal

Facile synthesis of model polystyrene nanoparticles for nanoplastics research.

MethodsX·2026
Same journal

Effectiveness of a posture education program in high school students: A randomized controlled trial protocol.

MethodsX·2026
Same journal

Development and characterization of silicone-based testosterone propionate implants for sustained androgen delivery in juvenile castrated male pigs.

MethodsX·2026
Same journal

Machine learning assisted multi-criteria decision-making approaches for site selection: A systematic review.

MethodsX·2026
Same journal

A systematic analytical framework for multi-source municipal solid waste characterization for energy recovery.

MethodsX·2026
Same journal

Decision tree and reinforcement learning for contextual electricity consumption forecasting in buildings.

MethodsX·2026
See all related articles

Related Experiment Video

Updated: Sep 25, 2025

Untargeted Liquid Chromatography-Mass Spectrometry-Based Metabolomics Analysis of Wheat Grain
07:10

Untargeted Liquid Chromatography-Mass Spectrometry-Based Metabolomics Analysis of Wheat Grain

Published on: March 13, 2020

10.2K

Missing value imputation using least squares techniques in contaminated matrices.

Marisol Garcia-Peña1, Sergio Arciniegas-Alarcón2, Wojtek J Krzanowski3

  • 1Pontificia Universidad Javeriana, Departamento de Matemáticas, Bogotá, Colombia.

Methodsx
|April 28, 2022
PubMed
Summary
This summary is machine-generated.

This study introduces robust methods to improve data imputation quality by handling outliers. Pre-processing techniques like robust singular value decomposition prevent data errors from affecting imputation results.

Keywords:
Cross-validationEigenvaluesEigenvectorsGenotype-by-environment interactionIterative computational schemeMissing valuesRobust singular value decomposition

More Related Videos

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach
04:35

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

3.4K
Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index
06:55

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

14.7K

Related Experiment Videos

Last Updated: Sep 25, 2025

Untargeted Liquid Chromatography-Mass Spectrometry-Based Metabolomics Analysis of Wheat Grain
07:10

Untargeted Liquid Chromatography-Mass Spectrometry-Based Metabolomics Analysis of Wheat Grain

Published on: March 13, 2020

10.2K
Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach
04:35

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

3.4K
Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index
06:55

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

14.7K

Area of Science:

  • Data Science
  • Statistical Modeling
  • Matrix Approximation

Background:

  • Matrix imputation methods using least squares can be sensitive to outliers.
  • Outliers can degrade imputation quality and cause convergence issues.

Purpose of the Study:

  • To develop and evaluate pre-processing strategies for robust data imputation.
  • To mitigate the impact of outliers on matrix imputation algorithms.

Main Methods:

  • Explored pre-processing options before applying a mixture of regression and lower rank approximation.
  • Investigated robust singular value decomposition (SVD) and outlier detection followed by treating outliers as missing data.
  • Evaluated methods using cross-validation on real-world multi-environment trial data.

Main Results:

  • Proposed pre-processing methods significantly improve imputation quality compared to the original algorithm.
  • Robust SVD effectively handles outliers and enhances the imputation procedure.
  • Outlier detection and treatment as missing data also yielded robust imputation results.

Conclusions:

  • The original imputation method is susceptible to outliers and should be replaced by robust alternatives.
  • Pre-processing steps are crucial for reliable matrix imputation, especially with suspected data contamination.
  • The proposed methods ensure algorithm performance on datasets with potential outlier contamination.