Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

4.6K
Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...
4.6K
Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data01:16

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

593
Statistical inference techniques, paramount in hypothesis testing, differentiate into two broad categories: parametric and nonparametric statistics.
Parametric statistics, as the name suggests, assumes that data follow a specific distribution, often a normal distribution. This assumption enables robust hypothesis testing and estimation. Parametric methods, like the Student's t-test or Goodness-of-fit test, are frequently employed in biostatistics due to their robustness. For instance,...
593
Outliers and Influential Points01:08

Outliers and Influential Points

6.7K
An outlier is an observation of data that does not fit the rest of the data. It is sometimes called an extreme value. When you graph an outlier, it will appear not to fit the pattern of the graph. Some outliers are due to mistakes (for example, writing down 50 instead of 500), while others may indicate that something unusual is happening. Outliers are present far from the least squares line in the vertical direction. They have large "errors," where the "error" or residual is the...
6.7K
Decision Making: P-value Method01:09

Decision Making: P-value Method

7.2K
The process of hypothesis testing based on the P-value method includes calculating the P- value using the sample data and interpreting it.
First, a specific claim about the population parameter is proposed. The claim is based on the research question and is stated in a simple form. Further, an opposing statement to the claim  is also stated. These statements can act as null and alternative hypotheses:  a null hypothesis would be a neutral statement while the alternative hypothesis can...
7.2K
Frequency-dependent Selection01:21

Frequency-dependent Selection

24.5K
When the fitness of a trait is influenced by how common it is (i.e., its frequency) relative to different traits within a population, this is referred to as frequency-dependent selection. Frequency-dependent selection may occur between species or within a single species. This type of selection can either be positive—with more common phenotypes having higher fitness—or negative, with rarer phenotypes conferring increased fitness.
24.5K
Expected Frequencies in Goodness-of-Fit Tests01:19

Expected Frequencies in Goodness-of-Fit Tests

8.9K
A goodness-of-fit test is conducted to determine whether the observed frequency values are statistically similar to the frequencies expected for the dataset. Suppose the expected frequencies for a dataset are equal such as when predicting the frequency of any number appearing when casting a die. In that case, the expected frequency is the ratio of the total number of observations (n)  to the number of categories (k).
8.9K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Dissecting the integrated information of cardiovascular and cardiorespiratory systems at rest and during physiological stress.

Physiological measurement·2026
Same author

Autonomic impairment in advanced heart-failure patients revealed by nonlinear heart rate variability measured during exercise.

Scientific reports·2026
Same author

Real-World Data on the Effectiveness of Immunotherapy on Advanced NSCLC: A Retrospective Cohort Study.

Cancers·2026
Same author

EMBC Special Issue: Investigating High-Order Behaviors in Multivariate Cardiovascular Interactions via Nonlinear Prediction and Information-Theoretic Tools.

IEEE transactions on bio-medical engineering·2026
Same author

Direct causality measures unravel complex networks of cardiovascular dynamics and their modifications with postural stress.

PLoS computational biology·2026
Same author

Quantifying coupling and causality in dynamic bivariate systems: a unified framework for time-domain, spectral, and information-theoretic analysis.

Frontiers in network physiology·2026
Same journal

An Ultra-Lightweight Cross-scale Attention Mamba Network for Accurate Skin Lesion Segmentation.

IEEE journal of biomedical and health informatics·2026
Same journal

Explanation-Guided Reconstruction of Missing Clinical Features for Survival Prediction in Pancreatic Cancer.

IEEE journal of biomedical and health informatics·2026
Same journal

stDGCN: A dual-augmentation graph convolutional network for identifying spatial domains with attention mechanism.

IEEE journal of biomedical and health informatics·2026
Same journal

Patient-specific Biomechanical Investigation of Percutaneous Pulmonary Valves: Towards the Integration of Routinely Acquired Clinical Data and Fluid-structure Interaction Simulations.

IEEE journal of biomedical and health informatics·2026
Same journal

Cross-subject fMRI-to-Image with Visual-cortex 2D Representation and Pre-Training.

IEEE journal of biomedical and health informatics·2026
Same journal

PGCASurv: A Prior-Guided Cross-Attention Framework for Dynamic Survival Model with Longitudinal Data.

IEEE journal of biomedical and health informatics·2026
See all related articles

Related Experiment Video

Updated: Apr 2, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

8.1K

A Principled and Data-efficient Information-theoretic Method for Feature Selection.

Marta Iovino, Ivan Lazic, Chiara Bara

    IEEE Journal of Biomedical and Health Informatics
    |March 31, 2026
    PubMed
    Summary
    This summary is machine-generated.

    This study presents kCMI-FS, a new feature selection method for mixed data types. It effectively identifies important features for better classification and interpretability in biomedical analysis.

    More Related Videos

    A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
    12:18

    A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

    Published on: January 11, 2020

    8.2K
    Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
    03:37

    Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

    Published on: March 1, 2024

    1.5K

    Related Experiment Videos

    Last Updated: Apr 2, 2026

    Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
    07:35

    Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

    Published on: October 11, 2018

    8.1K
    A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
    12:18

    A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

    Published on: January 11, 2020

    8.2K
    Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
    03:37

    Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

    Published on: March 1, 2024

    1.5K

    Area of Science:

    • Machine Learning
    • Bioinformatics
    • Data Science

    Background:

    • Traditional feature selection methods may miss complex dependencies in mixed-type data.
    • Handling continuous and discrete variables simultaneously presents a significant challenge in feature selection.
    • Existing approaches can overlook feature redundancy and higher-order dependencies.

    Purpose of the Study:

    • Introduce kCMI-FS, a novel feature selection method designed for mixed-type data.
    • Evaluate the performance of kCMI-FS against existing methods on various datasets.
    • Demonstrate the utility of kCMI-FS in enhancing classification accuracy and model interpretability in biomedical applications.

    Main Methods:

    • Utilized Conditional Mutual Information (CMI) estimated via a k-nearest neighbour (kNN) strategy.
    • Implemented a significance-based forward selection process to identify informative and non-redundant features.
    • Assessed performance on theoretical simulations, synthetic datasets, and biomedical benchmark datasets.

    Main Results:

    • kCMI-FS consistently recovered relevant features in structured scenarios.
    • The method matched or outperformed existing approaches, especially in mixed-variable and high-dimensional settings.
    • Classification experiments showed kCMI-FS achieved strong predictive performance with reduced feature sets, improving interpretability.

    Conclusions:

    • kCMI-FS is a robust feature selection method for mixed-type data, particularly effective in high-dimensional and complex scenarios.
    • The method enhances model interpretability and predictive accuracy in biomedical data analysis.
    • kCMI-FS shows significant potential for supporting early diagnosis and clinical decision-making through effective feature selection.