Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Stratified Sampling Method01:16

Stratified Sampling Method

14.4K
Sampling is a technique to select a portion (or subset) of the larger population and study that portion (the sample) to gain information about the population. The sampling method ensures that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a stratified sample, divide the population into groups called strata and then take a...
14.4K
Frequency-dependent Selection01:21

Frequency-dependent Selection

23.0K
When the fitness of a trait is influenced by how common it is (i.e., its frequency) relative to different traits within a population, this is referred to as frequency-dependent selection. Frequency-dependent selection may occur between species or within a single species. This type of selection can either be positive—with more common phenotypes having higher fitness—or negative, with rarer phenotypes conferring increased fitness.
23.0K
Survival Tree01:19

Survival Tree

339
Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
 Building a Survival Tree
Constructing a...
339
Strategies of Self-Presentation I: Strategic Self-Presentation01:12

Strategies of Self-Presentation I: Strategic Self-Presentation

159
Strategic self-presentation refers to individuals' intentional efforts to influence how others perceive them. This process is employed in various social and professional settings, such as job interviews, dating, politics, and legal contexts, where individuals seek to shape impressions to gain social or material advantages. While people generally present themselves in ways that align with their authentic characteristics, external factors, such as cognitive load, can hinder their ability to...
159
Problem-Solving01:29

Problem-Solving

438
Effective problem-solving consists of two steps: 1. identifying the problem and 2. selecting the appropriate problem-solving strategy (i.e., a plan of action used to find a solution). Humans use four problem-solving strategies:
438
Heuristics01:21

Heuristics

580
Heuristics are problem-solving strategies that use mental shortcuts to simplify decision-making. Unlike algorithms, which must be followed precisely to achieve a correct result, heuristics offer a general problem-solving framework. They save time and energy but can sometimes lead to less rational decisions.
People often rely on heuristics when faced with an overload of information, limited time, low importance of the decision, limited information, or when a heuristic readily comes to mind. For...
580

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Comparing Methods to Assess Treatment Effect Heterogeneity in General Parametric Regression Models.

Statistics in medicine·2026
Same author

Overview and Practical Recommendations on Using Shapley Values for Identifying Predictive Biomarkers via CATE Modeling.

Statistics in medicine·2026
Same author

Using Individualized Treatment Effects to Assess Treatment Effect Heterogeneity.

Statistics in medicine·2025
Same author

WATCH: A Workflow to Assess Treatment Effect Heterogeneity in Drug Development for Clinical Trial Sponsors.

Pharmaceutical statistics·2024
Same author

Multi-scale in vivo imaging of tumour development using a germline conditional triple-reporter system.

Research square·2024
Same author

Noninvasive Stratification of Colon Cancer by Multiplex PET Imaging.

Clinical cancer research : an official journal of the American Association for Cancer Research·2024
Same journal

Your Next State-of-the-Art Could Come from Another Domain: A Cross-Domain Analysis of Hierarchical Text Classification.

Machine learning·2026
Same journal

Linear Causal Discovery with Interventional Constraints.

Machine learning·2026
Same journal

Boolean matrix logic programming for active learning of gene functions in genome-scale metabolic network models.

Machine learning·2025
Same journal

Mining exceptional social behavior on attributed interaction networks.

Machine learning·2025
Same journal

Persistent Laplacian-enhanced algorithm for scarcely labeled data classification.

Machine learning·2025
Same journal

Ensuring medical AI safety: interpretability-driven detection and mitigation of spurious model behavior and associated data.

Machine learning·2025
See all related articles

Related Experiment Video

Updated: Dec 30, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.9K

Simple strategies for semi-supervised feature selection.

Konstantinos Sechidis1, Gavin Brown1

  • 1School of Computer Science, University of Manchester, Manchester, M13 9PL UK.

Machine Learning
|January 28, 2020
PubMed
Summary
This summary is machine-generated.

Simple strategies for semi-supervised feature selection, assuming unlabeled data are all positive or all negative, yield powerful results. These methods, enhanced with domain knowledge, outperform complex algorithms, especially with missing labels.

Keywords:
Feature selectionPositive unlabelledSemi-supervised

More Related Videos

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

1.2K
Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model
07:15

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model

Published on: August 16, 2020

7.3K

Related Experiment Videos

Last Updated: Dec 30, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.9K
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

1.2K
Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model
07:15

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model

Published on: August 16, 2020

7.3K

Area of Science:

  • Machine Learning
  • Data Science
  • Computational Statistics

Background:

  • Semi-supervised learning leverages both labeled and unlabeled data.
  • Feature selection is crucial for model efficiency and interpretability.
  • Existing methods often require complex assumptions or extensive labeled data.

Purpose of the Study:

  • To investigate the efficacy of minimalist, classifier-independent strategies for semi-supervised feature selection.
  • To develop novel algorithms based on simple assumptions and domain knowledge.
  • To evaluate performance against complex competing methods, particularly in scenarios with missing-not-at-random labels.

Main Methods:

  • Theoretical analysis and empirical studies of two simple strategies: assuming unlabeled data are all positive or all negative.
  • Utilizing hypothesis testing and feature ranking for feature selection.
  • Developing two novel algorithms, Semi-JMI and Semi-IAMB, by incorporating soft prior domain knowledge.

Main Results:

  • The simple strategies provide powerful results for feature selection.
  • The novel algorithms (Semi-JMI, Semi-IAMB) significantly outperform more complex methods.
  • Exceptional performance was observed in cases where labels are missing-not-at-random.

Conclusions:

  • Minimalist approaches to semi-supervised feature selection can be surprisingly effective.
  • These simple strategies can provably recover exact feature selection dynamics, mimicking a fully labeled dataset.
  • The findings suggest a paradigm shift towards simpler, more robust feature selection techniques.