Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Aggregates Classification01:29

Aggregates Classification

306
Aggregate classification is generally based on its size, petrographic characteristics, weight, and source. Size classification ranges from coarse to fine aggregates, defined by the size of the particles. Coarse aggregates are particles that do not pass through ASTM sieve No. 4, and aggregates that pass through the sieve are fine aggregates.
Petrographic classification groups aggregates based on common mineralogical characteristics. Some of the common mineral groups found in aggregates are...
306
Mean Absolute Deviation01:13

Mean Absolute Deviation

2.6K
The mean absolute deviation is also a measure of the variability of data in a sample. It is the absolute value of the average difference between the data values and the mean.
Let us consider a dataset containing the number of unsold cupcakes in five shops: 10, 15, 8, 7, and 10. Initially, calculate the sample mean. Then calculate the deviation, or the difference, between each data value and the mean. Next, the absolute values of these deviations are added and divided by the sample size to...
2.6K
Weighted Mean00:57

Weighted Mean

4.9K
While taking the arithmetic, geometric, or harmonic mean of a sample data set, equal importance is assigned to all the data points. However, all the values may not always be equally important in some data sets. An intrinsic bias might make it more important to give more weightage to specific values over others.
For example, consider the number of goals scored in the matches of a tournament. While computing the average number of goals scored in the tournament, it may be more important to...
4.9K
Prediction Intervals01:03

Prediction Intervals

2.2K
The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y. 
2.2K
Wilcoxon Signed-Ranks Test for Matched Pairs01:09

Wilcoxon Signed-Ranks Test for Matched Pairs

94
The Wilcoxon signed-rank test for matched pairs evaluates the null hypothesis by combining the ranks of differences with their signs. It essentially tests whether the median of the differences in a population of matched pairs is zero. Since the test incorporates more information than the sign test, it generally yields more trustable conclusions. This test also does not require the data to follow a normal distribution, but two conditions must be met for it to be applicable: (1) the data must...
94
Classification of Systems-II01:31

Classification of Systems-II

137
Continuous-time systems have continuous input and output signals, with time measured continuously. These systems are generally defined by differential or algebraic equations. For instance, in an RC circuit, the relationship between input and output voltage is expressed through a differential equation derived from Ohm's law and the capacitor relation,
137

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Anlotinib in patients with medullary thyroid carcinoma with negative prognostic factors: A sub-analysis based on the ALTER01031 study.

Frontiers in oncology·2022
Same author

Enhancer reprogramming promotes the activation of cancer-associated fibroblasts and breast cancer metastasis.

Theranostics·2022
Same author

Elevated ITGA5 facilitates hyperactivated mTORC1-mediated progression of laryngeal squamous cell carcinoma via upregulation of EFNB2.

Theranostics·2022
Same author

Combined effects of zearalenone and deoxynivalenol on oxidative stress, hepatotoxicity, apoptosis, and inflammation in zebrafish embryos.

The Science of the total environment·2022
Same author

Ammonium induces oxidative stress, endoplasmic reticulum stress, and apoptosis of hepatocytes in the liver cell line of grass carp (Ctenopharyngodon idella).

Environmental science and pollution research international·2022
Same author

Properties of Grass Carp (<i>Ctenopharyngodon idella</i>) Collagen and Gel for Application in Biomaterials.

Gels (Basel, Switzerland)·2022
Same journal

DARUMA: a gateway to fast and easy prediction of intrinsically disordered regions.

PeerJ. Computer science·2026
Same journal

Alzheimer's disease detection using a quantum deep neural network with Haralick feature extraction and simulated annealing optimization.

PeerJ. Computer science·2026
Same journal

Network anomaly detection using Deep Autoencoder and parallel Artificial Bee Colony algorithm-trained neural network.

PeerJ. Computer science·2026
Same journal

An anomaly detection model for multivariate time series with anomaly perception.

PeerJ. Computer science·2026
Same journal

Retraction: A wormhole attack detection method for tactical wireless sensor networks.

PeerJ. Computer science·2026
Same journal

Evaluation of mental disorder with prioritization of its type by utilizing the bipolar complex fuzzy decision-making approach based on Schweizer-Sklar prioritized aggregation operators.

PeerJ. Computer science·2026
See all related articles

Related Experiment Video

Updated: Jun 12, 2025

Author Spotlight: Bridging Gaps in Anatomy and Establishing a Foundation for Algorithmic Studies
04:25

Author Spotlight: Bridging Gaps in Anatomy and Establishing a Foundation for Algorithmic Studies

Published on: December 15, 2023

2.3K

A pre-averaged pseudo nearest neighbor classifier.

Dapeng Li1

  • 1School of Software Engineering, Jinling Institute of Technology, Nanjing, China.

Peerj. Computer Science
|September 24, 2024
PubMed
Summary
This summary is machine-generated.

The pre-averaged pseudo nearest neighbor classifier (PAPNN) enhances classification accuracy, especially for small datasets with outliers. This method preprocesses data to mitigate outlier impact, improving classification performance.

Keywords:
Pre-averagedPseudo nearest neighborsSmall-size samples

More Related Videos

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations
12:27

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

6.9K
Automatic Image Processing to Determine the Community Size Structure of Riverine Macroinvertebrates
08:56

Automatic Image Processing to Determine the Community Size Structure of Riverine Macroinvertebrates

Published on: January 13, 2023

2.1K

Related Experiment Videos

Last Updated: Jun 12, 2025

Author Spotlight: Bridging Gaps in Anatomy and Establishing a Foundation for Algorithmic Studies
04:25

Author Spotlight: Bridging Gaps in Anatomy and Establishing a Foundation for Algorithmic Studies

Published on: December 15, 2023

2.3K
Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations
12:27

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

6.9K
Automatic Image Processing to Determine the Community Size Structure of Riverine Macroinvertebrates
08:56

Automatic Image Processing to Determine the Community Size Structure of Riverine Macroinvertebrates

Published on: January 13, 2023

2.1K

Area of Science:

  • Machine Learning
  • Data Mining
  • Pattern Recognition

Background:

  • The k-nearest neighbor (KNN) algorithm is a widely used classification technique.
  • KNN's performance degrades with small sample sizes and the presence of outliers.
  • Existing methods struggle to effectively handle noisy data in classification tasks.

Purpose of the Study:

  • To propose a novel classifier, the pre-averaged pseudo nearest neighbor classifier (PAPNN), to improve classification performance.
  • To address the limitations of KNN in small-sized samples with outliers.
  • To reduce the negative impact of outliers on classification accuracy.

Main Methods:

  • The PAPNN rule involves calculating pre-averaged categorical vectors by averaging pairs of training data points within each class.
  • k-pseudo nearest neighbors are identified from these preprocessed vectors for classifying query points.
  • The method preprocesses training data to create robust feature representations.

Main Results:

  • Extensive experiments were conducted on nineteen numerical and three high-dimensional real datasets.
  • PAPNN was compared against twelve other classification methods.
  • The proposed PAPNN rule demonstrated effectiveness in classification tasks, particularly for small samples with outliers.

Conclusions:

  • The PAPNN classifier effectively improves classification performance in challenging datasets.
  • Pre-averaging techniques offer a viable strategy to mitigate outlier influence in KNN.
  • PAPNN shows promise for real-world applications requiring robust classification with limited or noisy data.