Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Ranks

Ranks

Unlike parametric methods, nonparametric statistics are ideal for nominal and ordinal data, requiring fewer assumptions about the population's nature or distribution. This makes nonparametric methods easier to apply and interpret, as they do not depend on parameters like mean or standard deviation. One common approach in nonparametric analysis is to sort data according to a specific criterion. For instance, we might arrange weather data from hottest to coldest days in a month or rank cities...

Wilcoxon Signed-Ranks Test for Median of Single Population

Wilcoxon Signed-Ranks Test for Median of Single Population

The Wilcoxon signed-rank test for the median of a single population is a nonparametric test used to evaluate whether the median of a population differs from a specified value. Unlike parametric tests, it does not require data to follow a normal distribution, making it suitable for non-normal or small samples. The test begins by calculating the difference (d) between each observation and the hypothesized median. The absolute values of these differences are ranked in ascending order, with ties...

The Mantel-Cox Log-Rank Test

The Mantel-Cox Log-Rank Test

The Mantel-Cox log-rank test is a widely used statistical method for comparing the survival distributions of two groups. It tests whether a statistically significant difference exists in survival times between the groups without assuming a specific distribution for the survival data, making it a non-parametric test. This flexibility makes the log-rank test particularly valuable in medical research and other fields where the timing of an event, such as death or disease recurrence, is of...

Ordinal Level of Measurement

Ordinal Level of Measurement

The way a set of data is measured is called its level of measurement. Correct statistical procedures depend on a researcher being familiar with levels of measurement. For analysis, data are classified into four levels of measurement—nominal, ordinal, interval, and ratio.
Data measured using an ordinal scale are similar to nominal scale data, but there is one major difference. The ordinal scale data can be ordered. An example of ordinal scale data is a list of the top five national parks...

Wilcoxon Rank-Sum Test

Wilcoxon Rank-Sum Test

The Wilcoxon rank-sum test, also known as the Mann-Whitney U test, is a nonparametric test used to determine if there is a significant difference between the distributions of two independent samples. This test is designed specifically for two independent populations and has the following key requirements:

Quartile

Quartile

Quartiles are numbers that separate the data into quarters. Quartiles may or may not be part of the data. To find the quartiles, first, find the median or second quartile. The first quartile, Q1, is the middle value of the lower half of the data, and the third quartile, Q3, is the middle value, or median, of the upper half of the data. To get the idea, consider the same data set:
1; 1; 2; 2; 4; 6; 6.8; 7.2; 8; 8.3; 9; 10; 10; 11.5
The median or second quartile is seven. The lower half of the...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Metabolic and Inflammatory Markers of Fatty Liver As Associated Factors of Hepatic Metastatic Spread in Colorectal Cancer.

Cureus·2026

Same author

A phase 3, randomized clinical trial of soticlestat as adjunctive therapy for Lennox-Gastaut syndrome.

Epilepsia·2026

Same author

A Multidimensional Cross-Sectional Impact Assessment of Teleconsultation Services at Emirates Health Services, United Arab Emirates.

Telemedicine journal and e-health : the official journal of the American Telemedicine Association·2026

Same author

Soticlestat as an adjunctive therapy in children and young adults with Dravet syndrome.

Epilepsia·2026

Same author

Entropy and thermal dynamics motivated by ternary nanocomposites and geometric influence of oblique channel.

Scientific reports·2026

Same author

Microstructural and Rheological Properties of Camel and Bovine Milk Fermented with Five Lactic Acid Bacteria Strains.

Foods (Basel, Switzerland)·2026

Same journal

Elastic functional Cox regression model with shape predictors.

Journal of applied statistics·2026

Same journal

An improved two-stage binary relevance method for multilabel classification.

Journal of applied statistics·2026

Same journal

Classification of multivariate functional data with an application to ADHD fMRI data.

Journal of applied statistics·2026

Same journal

Assessing the performance of longitudinal T-lymphocytes as biomarkers of immune recovery in HIV-infected children with or without TB co-infection.

Journal of applied statistics·2026

Same journal

Sparse long-only Markowitz portfolio optimization.

Journal of applied statistics·2026

Same journal

Homogeneity of multinomial populations when data are classified into a large number of groups.

Journal of applied statistics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: May 17, 2025

RNA-seq Analysis of Transcriptomes in Thrombin-treated and Control Human Pulmonary Microvascular Endothelial Cells

RNA-seq Analysis of Transcriptomes in Thrombin-treated and Control Human Pulmonary Microvascular Endothelial Cells

Published on: February 13, 2013

A novel ranked k-nearest neighbors algorithm for missing data imputation.

Yasir Khan¹, Said Farooq Shah², Syed Muhammad Asim²

¹Government College of Management Sciences Jamrud, Jamrud, KP, Pakistan.

Journal of Applied Statistics

|March 31, 2025

Summary

This summary is machine-generated.

Ranked k Nearest Neighbors imputation improves data accuracy by selecting relevant neighbors, outperforming standard kNN for missing data, especially with small correlations.

Keywords:

62-08 Imputation k nearest neighbors missing data ranked set sampling

More Related Videos

Novel Sequence Discovery by Subtractive Genomics

Novel Sequence Discovery by Subtractive Genomics

Published on: January 25, 2019

A Tablet-Based Curriculum-Based Measurement Protocol for Kindergarten Writing

A Tablet-Based Curriculum-Based Measurement Protocol for Kindergarten Writing

Published on: February 7, 2025

Related Experiment Videos

Last Updated: May 17, 2025

RNA-seq Analysis of Transcriptomes in Thrombin-treated and Control Human Pulmonary Microvascular Endothelial Cells

RNA-seq Analysis of Transcriptomes in Thrombin-treated and Control Human Pulmonary Microvascular Endothelial Cells

Published on: February 13, 2013

Novel Sequence Discovery by Subtractive Genomics

Novel Sequence Discovery by Subtractive Genomics

Published on: January 25, 2019

A Tablet-Based Curriculum-Based Measurement Protocol for Kindergarten Writing

A Tablet-Based Curriculum-Based Measurement Protocol for Kindergarten Writing

Published on: February 7, 2025

Area of Science:

Data Science
Statistics
Machine Learning

Background:

Missing data is a prevalent challenge in data analysis across various scientific domains.
The standard k Nearest Neighbors (kNN) imputation method struggles with accuracy in datasets exhibiting small pairwise correlations or small k values.
Existing imputation techniques often face limitations in handling specific data characteristics.

Purpose of the Study:

To introduce a novel imputation method, Ranked k Nearest Neighbors (Ranked kNN).
To enhance imputation accuracy compared to the traditional kNN method.
To address limitations of standard kNN in datasets with low correlations and small k values.

Main Methods:

The proposed Ranked kNN method adapts the kNN approach by incorporating Ranked Set Sampling (RSS).
RSS is utilized to strategically select the most pertinent neighbors for imputation.
The method was evaluated under Missing Completely at Random (MCAR) and Missing at Random (MAR) mechanisms.

Main Results:

Ranked kNN demonstrated superior imputation accuracy over standard kNN across all tested datasets.
The proposed method consistently yielded lower Mean Squared Imputation Error (MSIE) and Mean Absolute Imputation Error (MAIE) values.
Performance improvements were particularly notable in scenarios with small pairwise correlations and small k values.

Conclusions:

The Ranked kNN imputation method offers a significant advancement for handling missing data.
It provides a promising alternative for datasets with challenging characteristics, such as low correlations and small k.
The method enhances imputation accuracy without introducing additional computational complexity, making it broadly applicable.