Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Residuals and Least-Squares Property01:11

Residuals and Least-Squares Property

7.4K
The vertical distance between the actual value of y and the estimated value of y. In other words, it measures the vertical distance between the actual data point and the predicted point on the line
If the observed data point lies above the line, the residual is positive, and the line underestimates the actual data value for y. If the observed data point lies below the line, the residual is negative, and the line overestimates the actual data value for y.
The process of fitting the best-fit...
7.4K
Estimating Population Mean with Unknown Standard Deviation01:22

Estimating Population Mean with Unknown Standard Deviation

8.1K
In practice, we rarely know the population standard deviation. In the past, when the sample size was large, this did not present a problem to statisticians. They used the sample standard deviation s as an estimate for σ and proceeded as before to calculate a confidence interval with close enough results. However, statisticians ran into problems when the sample size was small. A small sample size caused inaccuracies in the confidence interval.
William S. Gosset (1876–1937) of the...
8.1K
Estimating Population Standard Deviation01:26

Estimating Population Standard Deviation

3.0K
When the population standard deviation is unknown and the sample size is large, the sample standard deviation s is commonly used as a point estimate of σ. However, it can sometimes under or overestimate the population standard deviation. To overcome this drawback, confidence intervals are determined to estimate population parameters and eliminate any calculation bias accurately. However, this only applies to random samples from normally distributed populations. Knowing the sample mean and...
3.0K
One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for ka Estimation01:24

One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for ka Estimation

570
This lesson introduces two critical methods in pharmacokinetics, the Wagner-Nelson and Loo-Riegelman methods, used for estimating the absorption rate constant (ka) for drugs administered via non-intravenous routes. The Wagner-Nelson method relates ka to the plasma concentration derived from the slope of a semilog percent unabsorbed time plot. However, it is limited to drugs with one-compartment kinetics and can be impacted by factors like gastrointestinal motility or enzymatic degradation.
On...
570
Estimating Population Mean with Known Standard Deviation01:16

Estimating Population Mean with Known Standard Deviation

8.7K
To construct a confidence interval for a single unknown population mean μ, where the population standard deviation is known, we need sample mean as an estimate for μ and we need the margin of error. Here, the margin of error (EBM) is called the error bound for a population mean (abbreviated EBM). The sample mean is the point estimate of the unknown population mean μ.
The confidence interval estimate will have the form as follows:
(point estimate - error bound, point estimate +...
8.7K
Multiple Regression01:25

Multiple Regression

3.0K
Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
3.0K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Application of weighted low rank approximations: outlier detection in a data matrix.

BMC research notes·2025
Same author

Data from steady-state simulation and economic evaluation in the power to methane context for synthetic natural gas production and power generation.

Data in brief·2024
Same author

Missing value imputation using least squares techniques in contaminated matrices.

MethodsX·2022
Same author

Proposal of an open-source computational toolbox for solving PDEs in the context of chemical reaction engineering using FEniCS and complementary components.

Heliyon·2021
Same author

Confident interpretation of Bayesian decision tree ensembles for clinical applications.

IEEE transactions on information technology in biomedicine : a publication of the IEEE Engineering in Medicine and Biology Society·2007
Same author

Biclustering models for structured microarray data.

IEEE/ACM transactions on computational biology and bioinformatics·2006
Same journal

Facile synthesis of model polystyrene nanoparticles for nanoplastics research.

MethodsX·2026
Same journal

Effectiveness of a posture education program in high school students: A randomized controlled trial protocol.

MethodsX·2026
Same journal

Development and characterization of silicone-based testosterone propionate implants for sustained androgen delivery in juvenile castrated male pigs.

MethodsX·2026
Same journal

Machine learning assisted multi-criteria decision-making approaches for site selection: A systematic review.

MethodsX·2026
Same journal

A systematic analytical framework for multi-source municipal solid waste characterization for energy recovery.

MethodsX·2026
Same journal

Decision tree and reinforcement learning for contextual electricity consumption forecasting in buildings.

MethodsX·2026
See all related articles

Related Experiment Video

Updated: Jul 19, 2025

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data
14:27

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data

Published on: June 26, 2013

15.7K

Missing value imputation in a data matrix using the regularised singular value decomposition.

Sergio Arciniegas-Alarcón1, Marisol García-Peña2, Wojtek J Krzanowski3

  • 1Universidad de La Sabana, Facultad de Ingeniería, Chía, Colombia.

Methodsx
|August 10, 2023
PubMed
Summary
This summary is machine-generated.

This study introduces a regularised singular value decomposition (SVD) imputation method to handle missing data in statistical analysis. The enhanced approach offers competitive performance, improving data imputation quality in various scenarios.

Keywords:
Cross-validationEigenvaluesEigenvectorsGabrielEigen imputation systemGenotype-by-environment interactionIterative computational schemeOverfitting

More Related Videos

Untargeted Liquid Chromatography-Mass Spectrometry-Based Metabolomics Analysis of Wheat Grain
07:10

Untargeted Liquid Chromatography-Mass Spectrometry-Based Metabolomics Analysis of Wheat Grain

Published on: March 13, 2020

9.8K
Proton Transfer and Protein Conformation Dynamics in Photosensitive Proteins by Time-resolved Step-scan Fourier-transform Infrared Spectroscopy
10:03

Proton Transfer and Protein Conformation Dynamics in Photosensitive Proteins by Time-resolved Step-scan Fourier-transform Infrared Spectroscopy

Published on: June 27, 2014

18.0K

Related Experiment Videos

Last Updated: Jul 19, 2025

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data
14:27

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data

Published on: June 26, 2013

15.7K
Untargeted Liquid Chromatography-Mass Spectrometry-Based Metabolomics Analysis of Wheat Grain
07:10

Untargeted Liquid Chromatography-Mass Spectrometry-Based Metabolomics Analysis of Wheat Grain

Published on: March 13, 2020

9.8K
Proton Transfer and Protein Conformation Dynamics in Photosensitive Proteins by Time-resolved Step-scan Fourier-transform Infrared Spectroscopy
10:03

Proton Transfer and Protein Conformation Dynamics in Photosensitive Proteins by Time-resolved Step-scan Fourier-transform Infrared Spectroscopy

Published on: June 27, 2014

18.0K

Area of Science:

  • Statistics
  • Data Science
  • Bioinformatics

Background:

  • Statistical analyses often require complete data matrices, but missing data is a common challenge in database construction.
  • Estimating and imputing missing data is a crucial step for maintaining data integrity and enabling robust analysis.

Purpose of the Study:

  • To propose and evaluate a novel imputation method that enhances data matrix completion.
  • To improve the quality of missing data imputation by incorporating regularisation into singular value decomposition (SVD).

Main Methods:

  • A modified imputation technique combining regression with low-rank approximations.
  • Implementation of a regularised SVD, with the regularisation parameter determined via cross-validation.
  • Evaluation using ten real-world datasets from multienvironment trials with varying percentages of missing data.

Main Results:

  • The regularised SVD imputation method demonstrated competitive performance against the classical approach.
  • The proposed method showed superior results in several tested scenarios, effectively handling missing data.
  • The inclusion of a penalised criterion in the computational algorithm smoothed eigenvectors and eigenvalues, preventing overfitting.

Conclusions:

  • The regularised SVD imputation method is a robust and effective technique for addressing missing data in multivariate matrices.
  • This approach offers improved imputation quality and computational stability, particularly when dealing with complex datasets.
  • The method's general applicability extends to various fields requiring multivariate data analysis.