Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Multiple Comparison Tests

Multiple Comparison Tests

Multiple comparison test, abbreviated as MCT, is a post hoc analysis generally performed after comparing multiple samples with one or more tests. An MCT will help identify a significantly different sample among multiple samples or a factor among multiple factors.
It would be easy to compare two samples using a significance alpha level of 0.05. In other words, there is only one sample pair to be compared. However, it would be difficult to identify a significantly different sample if the number...

Aggregates Classification

Aggregates Classification

Aggregate classification is generally based on its size, petrographic characteristics, weight, and source. Size classification ranges from coarse to fine aggregates, defined by the size of the particles. Coarse aggregates are particles that do not pass through ASTM sieve No. 4, and aggregates that pass through the sieve are fine aggregates.
Petrographic classification groups aggregates based on common mineralogical characteristics. Some of the common mineral groups found in aggregates are...

Causes of Similarity-Dissimilarity Effect

Causes of Similarity-Dissimilarity Effect

The similarity-dissimilarity effect, a fundamental concept in social psychology, explains how interpersonal similarities and differences influence attraction and social interactions. This effect is supported by three key psychological perspectives: balance theory, social comparison theory, and consensual validation.Balance Theory and Cognitive ConsistencyBalance theory, developed by Fritz Heider, posits that individuals seek cognitive consistency in their relationships. When two people share...

Wilcoxon Signed-Ranks Test for Matched Pairs

Wilcoxon Signed-Ranks Test for Matched Pairs

The Wilcoxon signed-rank test for matched pairs evaluates the null hypothesis by combining the ranks of differences with their signs. It essentially tests whether the median of the differences in a population of matched pairs is zero. Since the test incorporates more information than the sign test, it generally yields more trustable conclusions. This test also does not require the data to follow a normal distribution, but two conditions must be met for it to be applicable: (1) the data must...

How Data are Classified: Categorical Data

How Data are Classified: Categorical Data

A variable, usually notated by capital letters such as X and Y, is a characteristic or measurement that can be determined for each member of a population. Data are the actual values of variables. They may be numbers, or they may be words. Datum is a single value.
Data are classified based on whether they are measurable or not. Categorical data cannot be measured; instead, it can be divided into categories. For example, if Y denotes a person's party affiliation, some examples of Y include...

Crossover Experiments

Crossover Experiments

Crossover experiments, also called the repeated-measurements design, is a study design in which all experimental units are exposed to all treatments in different periods. Crossover experiments are generally used in psychology, the pharmaceutical industry, agriculture, and medicine.
Crossover designs are performed even with smaller sample sizes since the samples can act as their controls. These are better than simple randomized trials since patients are exposed to all the treatments.

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Characterization and probiotic potential of <i>Bifidobacterium longum</i> subsp. <i>infantis</i> JNU311 isolated from infant feces.

Food science and biotechnology·2026

Same author

Whole genome sequence analysis of <i>Bifidobacterium animalis</i> subsp. <i>lactis</i> KL101 and comparative genomics with BB12.

Journal of animal science and technology·2026

Same author

Chalinasterol attenuates ethanol-induced hepatic lipid accumulation by activating β-oxidation.

Biomedical reports·2026

Same author

Lactobacillus johnsonii JNU3402 Ameliorates Age-Related Liver Dysfunction Through Stimulating PGC-1α-Mediated SIRT1 Expression.

BioFactors (Oxford, England)·2025

Same author

Complete genome sequencing of <i>Limosilactobacillus fermentum</i> NS2301G2 isolated from kimchi.

Microbiology resource announcements·2025

Same author

<i>Lactobacillus amylovorus</i> KU4 inhibits adipocyte senescence in aged mice through necdin regulation of p53 activity.

Aging·2025

Same journal

sEEGnal: an automated EEG preprocessing pipeline evaluated against expert-driven preprocessing.

Computers in biology and medicine·2026

Same journal

Corrigendum to "Integrating experimental biology, computational methods, and artificial Intelligence in anticancer drug discovery: Bridging the translational Gap" [Comput. Biol. Med. 213 (2026) 111832].

Computers in biology and medicine·2026

Same journal

Organ dose optimization for a point-of-care forearm X-ray photon-counting CT.

Computers in biology and medicine·2026

Same journal

Physics-guided transformation of breathomic feature spaces into disease-specific representations for respiratory disease classification.

Computers in biology and medicine·2026

Same journal

An AI-driven deep learning pipeline for taxonomic classification and biodiversity assessment of deep-sea environmental DNA.

Computers in biology and medicine·2026

Same journal

Rapid personalisation of cardiovascular models using invasively measured right ventricular pressure.

Computers in biology and medicine·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 5, 2026

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

A new dataset evaluation method based on category overlap.

¹Department of Nanobiomedical Science, Dankook University, Cheonan 330-714, Republic of Korea. sejongoh@dankook.ac.kr

Computers in Biology and Medicine

|January 11, 2011

Summary

This summary is machine-generated.

We introduce a new R-value measure to evaluate dataset quality for classification tasks. High R-values indicate significant category overlap, potentially leading to lower classification accuracy and informing classifier design.

More Related Videos

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Related Experiment Videos

Last Updated: Jun 5, 2026

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Area of Science:

Machine Learning
Data Science
Pattern Recognition

Background:

Dataset quality significantly impacts classification accuracy.
Existing methods for dataset evaluation are insufficient.
A robust method is needed to assess dataset characteristics.

Purpose of the Study:

To propose a novel dataset evaluation method using the R-value measure.
To quantify dataset quality based on inter-category overlap.
To provide insights for feature selection and classifier design.

Main Methods:

Developed the R-value measure based on the ratio of overlapping areas among categories.
Analyzed the relationship between R-value and classification accuracy.
Demonstrated the utility of R-value for understanding dataset properties.

Main Results:

A high R-value signifies extensive overlapping areas between categories.
Datasets with high R-values are associated with potentially lower classification accuracy.
The R-value measure effectively characterizes dataset complexity.

Conclusions:

The R-value measure offers a quantitative approach to dataset evaluation.
This method aids in identifying datasets that may challenge classification algorithms.
R-value facilitates informed decisions in feature selection and classifier development.