Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Ranks01:02

Ranks

209
Unlike parametric methods, nonparametric statistics are ideal for nominal and ordinal data, requiring fewer assumptions about the population's nature or distribution. This makes nonparametric methods easier to apply and interpret, as they do not depend on parameters like mean or standard deviation. One common approach in nonparametric analysis is to sort data according to a specific criterion. For instance, we might arrange weather data from hottest to coldest days in a month or rank cities...
209
Wilcoxon Signed-Ranks Test for Median of Single Population01:14

Wilcoxon Signed-Ranks Test for Median of Single Population

74
The Wilcoxon signed-rank test for the median of a single population is a nonparametric test used to evaluate whether the median of a population differs from a specified value. Unlike parametric tests, it does not require data to follow a normal distribution, making it suitable for non-normal or small samples. The test begins by calculating the difference (d) between each observation and the hypothesized median. The absolute values of these differences are ranked in ascending order, with ties...
74
The Mantel-Cox Log-Rank Test01:19

The Mantel-Cox Log-Rank Test

245
The Mantel-Cox log-rank test is a widely used statistical method for comparing the survival distributions of two groups. It tests whether a statistically significant difference exists in survival times between the groups without assuming a specific distribution for the survival data, making it a non-parametric test. This flexibility makes the log-rank test particularly valuable in medical research and other fields where the timing of an event, such as death or disease recurrence, is of...
245
Ordinal Level of Measurement00:55

Ordinal Level of Measurement

22.6K
The way a set of data is measured is called its level of measurement. Correct statistical procedures depend on a researcher being familiar with levels of measurement. For analysis, data are classified into four levels of measurement—nominal, ordinal, interval, and ratio.
Data measured using an ordinal scale are similar to nominal scale data, but there is one major difference. The ordinal scale data can be ordered. An example of ordinal scale data is a list of the top five national parks...
22.6K
Wilcoxon Rank-Sum Test01:21

Wilcoxon Rank-Sum Test

125
The Wilcoxon rank-sum test, also known as the Mann-Whitney U test, is a nonparametric test used to determine if there is a significant difference between the distributions of two independent samples. This test is designed specifically for two independent populations and has the following key requirements:
125
Quartile01:15

Quartile

4.0K
Quartiles are numbers that separate the data into quarters. Quartiles may or may not be part of the data. To find the quartiles, first, find the median or second quartile. The first quartile, Q1, is the middle value of the lower half of the data, and the third quartile, Q3, is the middle value, or median, of the upper half of the data. To get the idea, consider the same data set:
1; 1; 2; 2; 4; 6; 6.8; 7.2; 8; 8.3; 9; 10; 10; 11.5
The median or second quartile is seven. The lower half of the...
4.0K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Metabolic and Inflammatory Markers of Fatty Liver As Associated Factors of Hepatic Metastatic Spread in Colorectal Cancer.

Cureus·2026
Same author

A phase 3, randomized clinical trial of soticlestat as adjunctive therapy for Lennox-Gastaut syndrome.

Epilepsia·2026
Same author

A Multidimensional Cross-Sectional Impact Assessment of Teleconsultation Services at Emirates Health Services, United Arab Emirates.

Telemedicine journal and e-health : the official journal of the American Telemedicine Association·2026
Same author

Soticlestat as an adjunctive therapy in children and young adults with Dravet syndrome.

Epilepsia·2026
Same author

Entropy and thermal dynamics motivated by ternary nanocomposites and geometric influence of oblique channel.

Scientific reports·2026
Same author

Microstructural and Rheological Properties of Camel and Bovine Milk Fermented with Five Lactic Acid Bacteria Strains.

Foods (Basel, Switzerland)·2026
Same journal

Elastic functional Cox regression model with shape predictors.

Journal of applied statistics·2026
Same journal

An improved two-stage binary relevance method for multilabel classification.

Journal of applied statistics·2026
Same journal

Classification of multivariate functional data with an application to ADHD fMRI data.

Journal of applied statistics·2026
Same journal

Assessing the performance of longitudinal T-lymphocytes as biomarkers of immune recovery in HIV-infected children with or without TB co-infection.

Journal of applied statistics·2026
Same journal

Sparse long-only Markowitz portfolio optimization.

Journal of applied statistics·2026
Same journal

Homogeneity of multinomial populations when data are classified into a large number of groups.

Journal of applied statistics·2026
See all related articles

Related Experiment Video

Updated: May 17, 2025

RNA-seq Analysis of Transcriptomes in Thrombin-treated and Control Human Pulmonary Microvascular Endothelial Cells
18:30

RNA-seq Analysis of Transcriptomes in Thrombin-treated and Control Human Pulmonary Microvascular Endothelial Cells

Published on: February 13, 2013

21.8K

A novel ranked k-nearest neighbors algorithm for missing data imputation.

Yasir Khan1, Said Farooq Shah2, Syed Muhammad Asim2

  • 1Government College of Management Sciences Jamrud, Jamrud, KP, Pakistan.

Journal of Applied Statistics
|March 31, 2025
PubMed
Summary
This summary is machine-generated.

Ranked k Nearest Neighbors imputation improves data accuracy by selecting relevant neighbors, outperforming standard kNN for missing data, especially with small correlations.

Keywords:
62-08Imputationk nearest neighborsmissing dataranked set sampling

More Related Videos

Novel Sequence Discovery by Subtractive Genomics
09:40

Novel Sequence Discovery by Subtractive Genomics

Published on: January 25, 2019

8.5K
A Tablet-Based Curriculum-Based Measurement Protocol for Kindergarten Writing
15:00

A Tablet-Based Curriculum-Based Measurement Protocol for Kindergarten Writing

Published on: February 7, 2025

491

Related Experiment Videos

Last Updated: May 17, 2025

RNA-seq Analysis of Transcriptomes in Thrombin-treated and Control Human Pulmonary Microvascular Endothelial Cells
18:30

RNA-seq Analysis of Transcriptomes in Thrombin-treated and Control Human Pulmonary Microvascular Endothelial Cells

Published on: February 13, 2013

21.8K
Novel Sequence Discovery by Subtractive Genomics
09:40

Novel Sequence Discovery by Subtractive Genomics

Published on: January 25, 2019

8.5K
A Tablet-Based Curriculum-Based Measurement Protocol for Kindergarten Writing
15:00

A Tablet-Based Curriculum-Based Measurement Protocol for Kindergarten Writing

Published on: February 7, 2025

491

Area of Science:

  • Data Science
  • Statistics
  • Machine Learning

Background:

  • Missing data is a prevalent challenge in data analysis across various scientific domains.
  • The standard k Nearest Neighbors (kNN) imputation method struggles with accuracy in datasets exhibiting small pairwise correlations or small k values.
  • Existing imputation techniques often face limitations in handling specific data characteristics.

Purpose of the Study:

  • To introduce a novel imputation method, Ranked k Nearest Neighbors (Ranked kNN).
  • To enhance imputation accuracy compared to the traditional kNN method.
  • To address limitations of standard kNN in datasets with low correlations and small k values.

Main Methods:

  • The proposed Ranked kNN method adapts the kNN approach by incorporating Ranked Set Sampling (RSS).
  • RSS is utilized to strategically select the most pertinent neighbors for imputation.
  • The method was evaluated under Missing Completely at Random (MCAR) and Missing at Random (MAR) mechanisms.

Main Results:

  • Ranked kNN demonstrated superior imputation accuracy over standard kNN across all tested datasets.
  • The proposed method consistently yielded lower Mean Squared Imputation Error (MSIE) and Mean Absolute Imputation Error (MAIE) values.
  • Performance improvements were particularly notable in scenarios with small pairwise correlations and small k values.

Conclusions:

  • The Ranked kNN imputation method offers a significant advancement for handling missing data.
  • It provides a promising alternative for datasets with challenging characteristics, such as low correlations and small k.
  • The method enhances imputation accuracy without introducing additional computational complexity, making it broadly applicable.