Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Multiple Comparison Tests01:13

Multiple Comparison Tests

Multiple comparison test, abbreviated as MCT, is a post hoc analysis generally performed after comparing multiple samples with one or more tests. An MCT will help identify a significantly different sample among multiple samples or a factor among multiple factors.
It would be easy to compare two samples using a significance alpha level of 0.05. In other words, there is only one sample pair to be compared. However, it would be difficult to identify a significantly different sample if the number...
Aggregates Classification01:29

Aggregates Classification

Aggregate classification is generally based on its size, petrographic characteristics, weight, and source. Size classification ranges from coarse to fine aggregates, defined by the size of the particles. Coarse aggregates are particles that do not pass through ASTM sieve No. 4, and aggregates that pass through the sieve are fine aggregates.
Petrographic classification groups aggregates based on common mineralogical characteristics. Some of the common mineral groups found in aggregates are...
Causes of Similarity-Dissimilarity Effect01:26

Causes of Similarity-Dissimilarity Effect

The similarity-dissimilarity effect, a fundamental concept in social psychology, explains how interpersonal similarities and differences influence attraction and social interactions. This effect is supported by three key psychological perspectives: balance theory, social comparison theory, and consensual validation.Balance Theory and Cognitive ConsistencyBalance theory, developed by Fritz Heider, posits that individuals seek cognitive consistency in their relationships. When two people share...
Wilcoxon Signed-Ranks Test for Matched Pairs01:09

Wilcoxon Signed-Ranks Test for Matched Pairs

The Wilcoxon signed-rank test for matched pairs evaluates the null hypothesis by combining the ranks of differences with their signs. It essentially tests whether the median of the differences in a population of matched pairs is zero. Since the test incorporates more information than the sign test, it generally yields more trustable conclusions. This test also does not require the data to follow a normal distribution, but two conditions must be met for it to be applicable: (1) the data must...
How Data are Classified: Categorical Data01:11

How Data are Classified: Categorical Data

A variable, usually notated by capital letters such as X and Y, is a characteristic or measurement that can be determined for each member of a population. Data are the actual values of variables. They may be numbers, or they may be words. Datum is a single value.
Data are classified based on whether they are measurable or not. Categorical data cannot be measured; instead, it can be divided into categories. For example, if Y denotes a person's party affiliation, some examples of Y include...
Crossover Experiments01:16

Crossover Experiments

Crossover experiments, also called the repeated-measurements design, is a study design in which all experimental units are exposed to all treatments in different periods. Crossover experiments are generally used in psychology, the pharmaceutical industry, agriculture, and medicine.
Crossover designs are performed even with smaller sample sizes since the samples can act as their controls. These are better than simple randomized trials since patients are exposed to all the treatments.

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Characterization and probiotic potential of <i>Bifidobacterium longum</i> subsp. <i>infantis</i> JNU311 isolated from infant feces.

Food science and biotechnology·2026
Same author

Whole genome sequence analysis of <i>Bifidobacterium animalis</i> subsp. <i>lactis</i> KL101 and comparative genomics with BB12.

Journal of animal science and technology·2026
Same author

Chalinasterol attenuates ethanol-induced hepatic lipid accumulation by activating β-oxidation.

Biomedical reports·2026
Same author

Lactobacillus johnsonii JNU3402 Ameliorates Age-Related Liver Dysfunction Through Stimulating PGC-1α-Mediated SIRT1 Expression.

BioFactors (Oxford, England)·2025
Same author

Complete genome sequencing of <i>Limosilactobacillus fermentum</i> NS2301G2 isolated from kimchi.

Microbiology resource announcements·2025
Same author

<i>Lactobacillus amylovorus</i> KU4 inhibits adipocyte senescence in aged mice through necdin regulation of p53 activity.

Aging·2025
Same journal

sEEGnal: an automated EEG preprocessing pipeline evaluated against expert-driven preprocessing.

Computers in biology and medicine·2026
Same journal

Corrigendum to "Integrating experimental biology, computational methods, and artificial Intelligence in anticancer drug discovery: Bridging the translational Gap" [Comput. Biol. Med. 213 (2026) 111832].

Computers in biology and medicine·2026
Same journal

Organ dose optimization for a point-of-care forearm X-ray photon-counting CT.

Computers in biology and medicine·2026
Same journal

Physics-guided transformation of breathomic feature spaces into disease-specific representations for respiratory disease classification.

Computers in biology and medicine·2026
Same journal

An AI-driven deep learning pipeline for taxonomic classification and biodiversity assessment of deep-sea environmental DNA.

Computers in biology and medicine·2026
Same journal

Rapid personalisation of cardiovascular models using invasively measured right ventricular pressure.

Computers in biology and medicine·2026
See all related articles

Related Experiment Video

Updated: Jun 5, 2026

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments
08:12

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

A new dataset evaluation method based on category overlap.

Sejong Oh1

  • 1Department of Nanobiomedical Science, Dankook University, Cheonan 330-714, Republic of Korea. sejongoh@dankook.ac.kr

Computers in Biology and Medicine
|January 11, 2011
PubMed
Summary
This summary is machine-generated.

We introduce a new R-value measure to evaluate dataset quality for classification tasks. High R-values indicate significant category overlap, potentially leading to lower classification accuracy and informing classifier design.

More Related Videos

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Related Experiment Videos

Last Updated: Jun 5, 2026

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments
08:12

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Area of Science:

  • Machine Learning
  • Data Science
  • Pattern Recognition

Background:

  • Dataset quality significantly impacts classification accuracy.
  • Existing methods for dataset evaluation are insufficient.
  • A robust method is needed to assess dataset characteristics.

Purpose of the Study:

  • To propose a novel dataset evaluation method using the R-value measure.
  • To quantify dataset quality based on inter-category overlap.
  • To provide insights for feature selection and classifier design.

Main Methods:

  • Developed the R-value measure based on the ratio of overlapping areas among categories.
  • Analyzed the relationship between R-value and classification accuracy.
  • Demonstrated the utility of R-value for understanding dataset properties.

Main Results:

  • A high R-value signifies extensive overlapping areas between categories.
  • Datasets with high R-values are associated with potentially lower classification accuracy.
  • The R-value measure effectively characterizes dataset complexity.

Conclusions:

  • The R-value measure offers a quantitative approach to dataset evaluation.
  • This method aids in identifying datasets that may challenge classification algorithms.
  • R-value facilitates informed decisions in feature selection and classifier development.