Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Selected Data About Geographic Locations01:25

Selected Data About Geographic Locations

278
Geographic Information Systems (GIS) rely on two core types of data: spatial data and attribute data.Spatial DataSpatial data defines the physical location of features within a coordinate system, typically expressed in terms of latitude and longitude. It provides precise positioning for elements like roads, rivers, or buildings.Attribute DataAttribute data complements spatial data by adding descriptive information about these features. For example, a road's spatial data includes its start and...
278
Antibiotic Selection00:57

Antibiotic Selection

60.0K
Overview
60.0K
Statistical Methods for Analyzing Epidemiological Data01:25

Statistical Methods for Analyzing Epidemiological Data

984
Epidemiological data primarily involves information on specific populations' occurrence, distribution, and determinants of health and diseases. This data is crucial for understanding disease patterns and impacts, aiding public health decision-making and disease prevention strategies. The analysis of epidemiological data employs various statistical methods to interpret health-related data effectively. Here are some commonly used methods:
984
Statistical Methods to Analyze Parametric Data: ANOVA01:12

Statistical Methods to Analyze Parametric Data: ANOVA

1.7K
Analysis of Variance, or ANOVA, is a powerful statistical technique used to analyze parametric data, primarily in research and experimental studies. It's designed to compare the means of two or more groups, assisting researchers in identifying any significant differences between these group means. There are two main types of ANOVA based on the complexity of the analysis: one-way and two-way.
One-way ANOVA is applied when a single independent variable or factor is scrutinized. It compares...
1.7K
What is Natural Selection?01:32

What is Natural Selection?

129.6K
Natural selection is an evolutionary process in which individuals with survival-promoting traits reproduce at higher rates. These favorable traits become more common within a population or species. Naturally selected traits initially arise via random genetic mutations. In order for selection to occur, there must be variation within a population, the trait controlling the variation must be heritable, and there must be an evolutionary advantage for variation in the trait.
129.6K
How Data are Classified: Categorical Data01:11

How Data are Classified: Categorical Data

44.8K
A variable, usually notated by capital letters such as X and Y, is a characteristic or measurement that can be determined for each member of a population. Data are the actual values of variables. They may be numbers, or they may be words. Datum is a single value.
Data are classified based on whether they are measurable or not. Categorical data cannot be measured; instead, it can be divided into categories. For example, if Y denotes a person's party affiliation, some examples of Y include...
44.8K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Ensuring Fairness in Detecting Mild Cognitive Impairment with MRI.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2025
Same author

Enhancing clinical outcome predictions through effective sample size evaluation in graph-based digital twin modeling.

BioData mining·2025
Same author

Perceptual and technical barriers in sharing and formatting metadata accompanying omics studies.

Cell genomics·2025
Same author

Erratum: A latent transfer learning method for estimating hospital-specific post-acute healthcare demands following SARS-CoV-2 infection.

Patterns (New York, N.Y.)·2025
Same author

AI as an accelerator for defining new problems that transcends boundaries.

BioData mining·2025
Same author

Preoperative anemia is an unsuspecting driver of machine learning prediction of adverse outcomes after lumbar spinal fusion.

The spine journal : official journal of the North American Spine Society·2025
Same journal

Evaluation of temporal preservation in synthetic longitudinal patient data.

Journal of biomedical informatics·2026
Same journal

ARKE: An ontology-driven framework for automated mapping of local radiology procedure terms to the LOINC-RadLex playbook using large language model.

Journal of biomedical informatics·2026
Same journal

A validation-driven training controller for cross-lingual biomedical NER via reinforcement learning-based adaptive loss weighting.

Journal of biomedical informatics·2026
Same journal

ASP-HR: An Adaptive Spatial Perception and Hierarchical Reasoning mechanism for document-level biomedical relation extraction.

Journal of biomedical informatics·2026
Same journal

Beyond Accuracy: Safety-Centered guidelines for the evaluation of LLM-based therapy recommendation systems for chronic multimorbidity patients.

Journal of biomedical informatics·2026
Same journal

DeepEN: A deep reinforcement learning framework for personalized enteral nutrition in critical care.

Journal of biomedical informatics·2026
See all related articles

Related Experiment Video

Updated: Feb 7, 2026

Mass Spectrometry-Guided Genome Mining as a Tool to Uncover Novel Natural Products
11:13

Mass Spectrometry-Guided Genome Mining as a Tool to Uncover Novel Natural Products

Published on: March 12, 2020

11.6K

Benchmarking relief-based feature selection methods for bioinformatics data mining.

Ryan J Urbanowicz1, Randal S Olson1, Peter Schmitt1

  • 1Institute for Biomedical Informatics, University of Pennsylvania, Philadelphia, PA 19104, USA.

Journal of Biomedical Informatics
|July 22, 2018
PubMed
Summary
This summary is machine-generated.

Relief-Based Algorithms (RBAs) offer flexible and efficient feature selection for complex biomedical data. The study introduces MultiSURF, demonstrating reliable performance across diverse problems and data types.

Keywords:
ClassificationEpistasisFeature selectionGenetic heterogeneityRegressionReliefF

More Related Videos

Project-Based Learning Guidelines for Health Sciences Students: An Analysis with Data Mining and Qualitative Techniques
13:44

Project-Based Learning Guidelines for Health Sciences Students: An Analysis with Data Mining and Qualitative Techniques

Published on: December 9, 2022

4.5K
Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications
09:20

Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications

Published on: February 23, 2019

9.2K

Related Experiment Videos

Last Updated: Feb 7, 2026

Mass Spectrometry-Guided Genome Mining as a Tool to Uncover Novel Natural Products
11:13

Mass Spectrometry-Guided Genome Mining as a Tool to Uncover Novel Natural Products

Published on: March 12, 2020

11.6K
Project-Based Learning Guidelines for Health Sciences Students: An Analysis with Data Mining and Qualitative Techniques
13:44

Project-Based Learning Guidelines for Health Sciences Students: An Analysis with Data Mining and Qualitative Techniques

Published on: December 9, 2022

4.5K
Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications
09:20

Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications

Published on: February 23, 2019

9.2K

Area of Science:

  • Biomedical data mining
  • Bioinformatics
  • Machine learning

Background:

  • Modern biomedical data mining demands feature selection methods capable of handling large-scale 'omics' data, noisy conditions, and complex associations like gene-gene interactions.
  • Existing methods often struggle with the adaptability required for diverse data types (genetic variants, gene expression, clinical data) and computational tractability.

Purpose of the Study:

  • To examine and expand Relief-Based Algorithms (RBAs) for robust feature selection in biomedical data mining.
  • To introduce and evaluate a novel RBA, MultiSURF, within the open-source ReBATE framework.
  • To compare the performance of RBAs against other established feature selection methods using a comprehensive genetic simulation study.

Main Methods:

  • Implementation and expansion of Relief-Based Algorithms (RBAs) within the open-source ReBATE (Relief-Based Algorithm Training Environment) framework.
  • A comprehensive genetic simulation study comparing existing RBAs, the proposed MultiSURF algorithm, and other established feature selection methods.
  • Evaluation across diverse problem types, including classification vs. regression, discrete vs. continuous features, missing data, multiple classes, and class imbalance.

Main Results:

  • RBAs demonstrate flexibility, efficiency, and power in differentiating relevant features with univariate, multivariate, epistatic, or heterogeneous associations.
  • Expansions of RBAs confirm efficacy for various data characteristics and problem types, including handling missing data and class imbalance.
  • The study identifies limitations of specific RBAs and highlights MultiSURF as yielding the most reliable feature selection performance across a wide range of problem types, with MultiSURF* excelling at identifying 2-way interactions.

Conclusions:

  • Relief-Based Algorithms are highly effective and adaptable feature selection tools for modern biomedical data mining.
  • The ReBATE framework and the MultiSURF algorithm provide valuable advancements for analyzing complex biological datasets.
  • The findings offer guidance for selecting appropriate feature selection methods based on specific data characteristics and research objectives.