Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

2.9K
Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...
2.9K
Expected Frequencies in Goodness-of-Fit Tests01:19

Expected Frequencies in Goodness-of-Fit Tests

4.8K
A goodness-of-fit test is conducted to determine whether the observed frequency values are statistically similar to the frequencies expected for the dataset. Suppose the expected frequencies for a dataset are equal such as when predicting the frequency of any number appearing when casting a die. In that case, the expected frequency is the ratio of the total number of observations (n)  to the number of categories (k).
4.8K
Detection of Gross Error: The Q Test01:00

Detection of Gross Error: The Q Test

6.6K
When one or more data points appear far from the rest of the data, there is a need to determine whether they are outliers and whether they should be eliminated from the data set to ensure an accurate representation of the measured value. In many cases, outliers arise from gross errors (or human errors) and do not accurately reflect the underlying phenomenon. In some cases, however, these apparent outliers reflect true phenomenological differences. In these cases, we can use statistical methods...
6.6K
Trimmed Mean01:10

Trimmed Mean

3.1K
While measuring the mean of a data set, care needs to be taken when associating the mean to its central tendency. The same goes for the arithmetic mean, the geometric mean, or the harmonic mean. This is because the presence of a single outlier data value can significantly affect the mean. That is, the mean is sensitive to fluctuations in the data set.
Although certain measures of central tendency are not sensitive to outliers, there are alternative versions of the mean that get around the...
3.1K
Statistical Analysis: Overview01:11

Statistical Analysis: Overview

11.3K
When we take repeated measurements on the same or replicated samples, we will observe inconsistencies in the magnitude. These inconsistencies are called errors. To categorize and characterize these results and their errors, the researcher can use statistical analysis to determine the quality of the measurements and/or suitability of the methods.
One of the most commonly used statistical quantifiers is the mean, which is the ratio between the sum of the numerical values of all results and the...
11.3K
Measures of Central Tendency02:16

Measures of Central Tendency

18.3K
The "center" of a data set is also a way of describing location. The two most widely used measures of the "center" of the data are the mean (average) and the median. The words "mean" and "average" are often used interchangeably. The substitution of one word for the other is common practice. The technical term is "arithmetic mean" and "average" is technically a center location. However, in practice among non-statisticians,...
18.3K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Prevalence of Early Rheumatic Heart Disease Among Asymptomatic Students in Underserved Communities in Ethiopia: Cross-Sectional Observational Study.

JMIR public health and surveillance·2026
Same author

A capacitive-piezoelectric hybrid MEMS microphone with signal fusion for enhancing signal-to-noise ratio.

Microsystems & nanoengineering·2026
Same author

EEG-based decoding of auditory attention to conversations with turn-taking speakers.

Hearing research·2026
Same author

Temporal variation in the acoustic dynamic range is a confounding factor in EEG-based tracking of absolute auditory attention to speech.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference·2025
Same author

Synchronized EEG with two galvanically-separated miniature wireless behind-the-ear EEG sensors.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference·2025
Same author

Effects of speaker and listener sex on auditory attention decoding performance.

Hearing research·2025
Same journal

DARUMA: a gateway to fast and easy prediction of intrinsically disordered regions.

PeerJ. Computer science·2026
Same journal

Alzheimer's disease detection using a quantum deep neural network with Haralick feature extraction and simulated annealing optimization.

PeerJ. Computer science·2026
Same journal

Network anomaly detection using Deep Autoencoder and parallel Artificial Bee Colony algorithm-trained neural network.

PeerJ. Computer science·2026
Same journal

An anomaly detection model for multivariate time series with anomaly perception.

PeerJ. Computer science·2026
Same journal

Retraction: A wormhole attack detection method for tactical wireless sensor networks.

PeerJ. Computer science·2026
Same journal

Evaluation of mental disorder with prioritization of its type by utilizing the bipolar complex fuzzy decision-making approach based on Schweizer-Sklar prioritized aggregation operators.

PeerJ. Computer science·2026
See all related articles

Related Experiment Video

Updated: Nov 5, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.8K

Utility metric for unsupervised feature selection.

Amalia Villa1,2, Abhijith Mundanad Narayanan1,2, Sabine Van Huffel1,2

  • 1STADIUS Center for Dynamical Systems, Signal Processing and Data Analytics, Department of Electrical Engineering (ESAT), KU Leuven, Leuven, Belgium.

Peerj. Computer Science
|May 13, 2021
PubMed
Summary
This summary is machine-generated.

A new unsupervised feature selection algorithm, U2FS, offers state-of-the-art performance with reduced computational cost. This ready-to-use method requires no parameter tuning, making feature selection more accessible for high-dimensional data analysis.

Keywords:
Dimensionality reductionKernel methodsManifold learningUnsupervised feature selection

More Related Videos

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

1.0K
Author Spotlight: Integrated Multi-Omics Analysis for Unveiling Multicellular Immune Signatures in Clinical Heart Attack Cohorts
08:51

Author Spotlight: Integrated Multi-Omics Analysis for Unveiling Multicellular Immune Signatures in Clinical Heart Attack Cohorts

Published on: September 20, 2024

1.7K

Related Experiment Videos

Last Updated: Nov 5, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.8K
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

1.0K
Author Spotlight: Integrated Multi-Omics Analysis for Unveiling Multicellular Immune Signatures in Clinical Heart Attack Cohorts
08:51

Author Spotlight: Integrated Multi-Omics Analysis for Unveiling Multicellular Immune Signatures in Clinical Heart Attack Cohorts

Published on: September 20, 2024

1.7K

Area of Science:

  • Data Science
  • Machine Learning
  • Computational Statistics

Background:

  • Feature selection is crucial for dimensionality reduction and data interpretability.
  • Unsupervised feature selectors are necessary for data lacking annotations.
  • Existing unsupervised methods are often complex, computationally expensive, and require parameter tuning.

Purpose of the Study:

  • To propose a publicly available, ready-to-use unsupervised feature selector.
  • To achieve comparable results to state-of-the-art methods with lower computational cost.
  • To eliminate the need for parameter tuning in unsupervised feature selection.

Main Methods:

  • The study proposes the U2FS algorithm, a spectral feature selection method.
  • It involves manifold learning using a radial basis function (RBF) kernel with an alternative parameter estimation for high-dimensional data.
  • Subset selection is performed using a backwards greedy approach with a least-squares utility metric.

Main Results:

  • U2FS successfully selects relevant features in simulation environments.
  • Performance on benchmark datasets is comparable to existing state-of-the-art methods.
  • U2FS demonstrates significantly lower computational time compared to other methods.

Conclusions:

  • U2FS provides an effective and efficient unsupervised feature selection solution.
  • The algorithm is accessible due to its ready-to-use nature and lack of parameter tuning.
  • U2FS advances spectral feature selection techniques for high-dimensional data analysis.