Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Stratified Sampling Method

Stratified Sampling Method

Sampling is a technique to select a portion (or subset) of the larger population and study that portion (the sample) to gain information about the population. The sampling method ensures that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a stratified sample, divide the population into groups called strata and then take a...

Frequency-dependent Selection

Frequency-dependent Selection

When the fitness of a trait is influenced by how common it is (i.e., its frequency) relative to different traits within a population, this is referred to as frequency-dependent selection. Frequency-dependent selection may occur between species or within a single species. This type of selection can either be positive—with more common phenotypes having higher fitness—or negative, with rarer phenotypes conferring increased fitness.

Survival Tree

Survival Tree

Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
Building a Survival Tree
Constructing a...

Strategies of Self-Presentation I: Strategic Self-Presentation

Strategies of Self-Presentation I: Strategic Self-Presentation

Strategic self-presentation refers to individuals' intentional efforts to influence how others perceive them. This process is employed in various social and professional settings, such as job interviews, dating, politics, and legal contexts, where individuals seek to shape impressions to gain social or material advantages. While people generally present themselves in ways that align with their authentic characteristics, external factors, such as cognitive load, can hinder their ability to...

Problem-Solving

Problem-Solving

Effective problem-solving consists of two steps: 1. identifying the problem and 2. selecting the appropriate problem-solving strategy (i.e., a plan of action used to find a solution). Humans use four problem-solving strategies:

Heuristics

Heuristics

Heuristics are problem-solving strategies that use mental shortcuts to simplify decision-making. Unlike algorithms, which must be followed precisely to achieve a correct result, heuristics offer a general problem-solving framework. They save time and energy but can sometimes lead to less rational decisions.
People often rely on heuristics when faced with an overload of information, limited time, low importance of the decision, limited information, or when a heuristic readily comes to mind. For...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Comparing Methods to Assess Treatment Effect Heterogeneity in General Parametric Regression Models.

Statistics in medicine·2026

Same author

Overview and Practical Recommendations on Using Shapley Values for Identifying Predictive Biomarkers via CATE Modeling.

Statistics in medicine·2026

Same author

Using Individualized Treatment Effects to Assess Treatment Effect Heterogeneity.

Statistics in medicine·2025

Same author

WATCH: A Workflow to Assess Treatment Effect Heterogeneity in Drug Development for Clinical Trial Sponsors.

Pharmaceutical statistics·2024

Same author

Multi-scale in vivo imaging of tumour development using a germline conditional triple-reporter system.

Research square·2024

Same author

Noninvasive Stratification of Colon Cancer by Multiplex PET Imaging.

Clinical cancer research : an official journal of the American Association for Cancer Research·2024

Same journal

Your Next State-of-the-Art Could Come from Another Domain: A Cross-Domain Analysis of Hierarchical Text Classification.

Machine learning·2026

Same journal

Linear Causal Discovery with Interventional Constraints.

Machine learning·2026

Same journal

Boolean matrix logic programming for active learning of gene functions in genome-scale metabolic network models.

Machine learning·2025

Same journal

Mining exceptional social behavior on attributed interaction networks.

Machine learning·2025

Same journal

Persistent Laplacian-enhanced algorithm for scarcely labeled data classification.

Machine learning·2025

Same journal

Ensuring medical AI safety: interpretability-driven detection and mitigation of spurious model behavior and associated data.

Machine learning·2025

See all related articles

Search research articles

Related Experiment Video

Updated: Dec 30, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Simple strategies for semi-supervised feature selection.

Konstantinos Sechidis¹, Gavin Brown¹

¹School of Computer Science, University of Manchester, Manchester, M13 9PL UK.

Machine Learning

|January 28, 2020

Summary

This summary is machine-generated.

Simple strategies for semi-supervised feature selection, assuming unlabeled data are all positive or all negative, yield powerful results. These methods, enhanced with domain knowledge, outperform complex algorithms, especially with missing labels.

Keywords:

Feature selection Positive unlabelled Semi-supervised

More Related Videos

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model

Published on: August 16, 2020

Related Experiment Videos

Last Updated: Dec 30, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model

Published on: August 16, 2020

Area of Science:

Machine Learning
Data Science
Computational Statistics

Background:

Semi-supervised learning leverages both labeled and unlabeled data.
Feature selection is crucial for model efficiency and interpretability.
Existing methods often require complex assumptions or extensive labeled data.

Purpose of the Study:

To investigate the efficacy of minimalist, classifier-independent strategies for semi-supervised feature selection.
To develop novel algorithms based on simple assumptions and domain knowledge.
To evaluate performance against complex competing methods, particularly in scenarios with missing-not-at-random labels.

Main Methods:

Theoretical analysis and empirical studies of two simple strategies: assuming unlabeled data are all positive or all negative.
Utilizing hypothesis testing and feature ranking for feature selection.
Developing two novel algorithms, Semi-JMI and Semi-IAMB, by incorporating soft prior domain knowledge.

Main Results:

The simple strategies provide powerful results for feature selection.
The novel algorithms (Semi-JMI, Semi-IAMB) significantly outperform more complex methods.
Exceptional performance was observed in cases where labels are missing-not-at-random.

Conclusions:

Minimalist approaches to semi-supervised feature selection can be surprisingly effective.
These simple strategies can provably recover exact feature selection dynamics, mimicking a fully labeled dataset.
The findings suggest a paradigm shift towards simpler, more robust feature selection techniques.