Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Residuals and Least-Squares Property

Residuals and Least-Squares Property

The vertical distance between the actual value of y and the estimated value of y. In other words, it measures the vertical distance between the actual data point and the predicted point on the line
If the observed data point lies above the line, the residual is positive, and the line underestimates the actual data value for y. If the observed data point lies below the line, the residual is negative, and the line overestimates the actual data value for y.
The process of fitting the best-fit...

Estimating Population Mean with Unknown Standard Deviation

Estimating Population Mean with Unknown Standard Deviation

In practice, we rarely know the population standard deviation. In the past, when the sample size was large, this did not present a problem to statisticians. They used the sample standard deviation s as an estimate for σ and proceeded as before to calculate a confidence interval with close enough results. However, statisticians ran into problems when the sample size was small. A small sample size caused inaccuracies in the confidence interval.
William S. Gosset (1876–1937) of the...

Estimating Population Standard Deviation

Estimating Population Standard Deviation

When the population standard deviation is unknown and the sample size is large, the sample standard deviation s is commonly used as a point estimate of σ. However, it can sometimes under or overestimate the population standard deviation. To overcome this drawback, confidence intervals are determined to estimate population parameters and eliminate any calculation bias accurately. However, this only applies to random samples from normally distributed populations. Knowing the sample mean and...

One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for ka Estimation

One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for k_a Estimation

This lesson introduces two critical methods in pharmacokinetics, the Wagner-Nelson and Loo-Riegelman methods, used for estimating the absorption rate constant (ka) for drugs administered via non-intravenous routes. The Wagner-Nelson method relates ka to the plasma concentration derived from the slope of a semilog percent unabsorbed time plot. However, it is limited to drugs with one-compartment kinetics and can be impacted by factors like gastrointestinal motility or enzymatic degradation.
On...

Estimating Population Mean with Known Standard Deviation

Estimating Population Mean with Known Standard Deviation

To construct a confidence interval for a single unknown population mean μ, where the population standard deviation is known, we need sample mean as an estimate for μ and we need the margin of error. Here, the margin of error (EBM) is called the error bound for a population mean (abbreviated EBM). The sample mean is the point estimate of the unknown population mean μ.
The confidence interval estimate will have the form as follows:
(point estimate - error bound, point estimate +...

Multiple Regression

Multiple Regression

Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Application of weighted low rank approximations: outlier detection in a data matrix.

BMC research notes·2025

Same author

Data from steady-state simulation and economic evaluation in the power to methane context for synthetic natural gas production and power generation.

Data in brief·2024

Same author

Missing value imputation using least squares techniques in contaminated matrices.

MethodsX·2022

Same author

Proposal of an open-source computational toolbox for solving PDEs in the context of chemical reaction engineering using FEniCS and complementary components.

Heliyon·2021

Same author

Confident interpretation of Bayesian decision tree ensembles for clinical applications.

IEEE transactions on information technology in biomedicine : a publication of the IEEE Engineering in Medicine and Biology Society·2007

Same author

Biclustering models for structured microarray data.

IEEE/ACM transactions on computational biology and bioinformatics·2006

Same journal

Facile synthesis of model polystyrene nanoparticles for nanoplastics research.

MethodsX·2026

Same journal

Effectiveness of a posture education program in high school students: A randomized controlled trial protocol.

MethodsX·2026

Same journal

Development and characterization of silicone-based testosterone propionate implants for sustained androgen delivery in juvenile castrated male pigs.

MethodsX·2026

Same journal

Machine learning assisted multi-criteria decision-making approaches for site selection: A systematic review.

MethodsX·2026

Same journal

A systematic analytical framework for multi-source municipal solid waste characterization for energy recovery.

MethodsX·2026

Same journal

Decision tree and reinforcement learning for contextual electricity consumption forecasting in buildings.

MethodsX·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jul 19, 2025

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data

Published on: June 26, 2013

Missing value imputation in a data matrix using the regularised singular value decomposition.

Sergio Arciniegas-Alarcón¹, Marisol García-Peña², Wojtek J Krzanowski³

¹Universidad de La Sabana, Facultad de Ingeniería, Chía, Colombia.

|August 10, 2023

Summary

This summary is machine-generated.

This study introduces a regularised singular value decomposition (SVD) imputation method to handle missing data in statistical analysis. The enhanced approach offers competitive performance, improving data imputation quality in various scenarios.

Keywords:

Cross-validation Eigenvalues Eigenvectors GabrielEigen imputation system Genotype-by-environment interaction Iterative computational scheme Overfitting

More Related Videos

Untargeted Liquid Chromatography-Mass Spectrometry-Based Metabolomics Analysis of Wheat Grain

Untargeted Liquid Chromatography-Mass Spectrometry-Based Metabolomics Analysis of Wheat Grain

Published on: March 13, 2020

Proton Transfer and Protein Conformation Dynamics in Photosensitive Proteins by Time-resolved Step-scan Fourier-transform Infrared Spectroscopy

Proton Transfer and Protein Conformation Dynamics in Photosensitive Proteins by Time-resolved Step-scan Fourier-transform Infrared Spectroscopy

Published on: June 27, 2014

Related Experiment Videos

Last Updated: Jul 19, 2025

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data

Published on: June 26, 2013

Untargeted Liquid Chromatography-Mass Spectrometry-Based Metabolomics Analysis of Wheat Grain

Untargeted Liquid Chromatography-Mass Spectrometry-Based Metabolomics Analysis of Wheat Grain

Published on: March 13, 2020

Proton Transfer and Protein Conformation Dynamics in Photosensitive Proteins by Time-resolved Step-scan Fourier-transform Infrared Spectroscopy

Proton Transfer and Protein Conformation Dynamics in Photosensitive Proteins by Time-resolved Step-scan Fourier-transform Infrared Spectroscopy

Published on: June 27, 2014

Area of Science:

Statistics
Data Science
Bioinformatics

Background:

Statistical analyses often require complete data matrices, but missing data is a common challenge in database construction.
Estimating and imputing missing data is a crucial step for maintaining data integrity and enabling robust analysis.

Purpose of the Study:

To propose and evaluate a novel imputation method that enhances data matrix completion.
To improve the quality of missing data imputation by incorporating regularisation into singular value decomposition (SVD).

Main Methods:

A modified imputation technique combining regression with low-rank approximations.
Implementation of a regularised SVD, with the regularisation parameter determined via cross-validation.
Evaluation using ten real-world datasets from multienvironment trials with varying percentages of missing data.

Main Results:

The regularised SVD imputation method demonstrated competitive performance against the classical approach.
The proposed method showed superior results in several tested scenarios, effectively handling missing data.
The inclusion of a penalised criterion in the computational algorithm smoothed eigenvectors and eigenvalues, preventing overfitting.

Conclusions:

The regularised SVD imputation method is a robust and effective technique for addressing missing data in multivariate matrices.
This approach offers improved imputation quality and computational stability, particularly when dealing with complex datasets.
The method's general applicability extends to various fields requiring multivariate data analysis.