Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data01:16

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

489
Statistical inference techniques, paramount in hypothesis testing, differentiate into two broad categories: parametric and nonparametric statistics.
Parametric statistics, as the name suggests, assumes that data follow a specific distribution, often a normal distribution. This assumption enables robust hypothesis testing and estimation. Parametric methods, like the Student's t-test or Goodness-of-fit test, are frequently employed in biostatistics due to their robustness. For instance,...
489
Statistical Significance01:50

Statistical Significance

21.9K
Once data is collected from both the experimental and the control groups, a statistical analysis is conducted to find out if there are meaningful differences between the two groups. A statistical analysis determines how likely any difference found is due to chance (and thus not meaningful). In psychology, group differences are considered meaningful, or significant, if the odds that these differences occurred by chance alone are 5 percent or less. Stated another way, if we repeated this...
21.9K
Probability in Statistics01:14

Probability in Statistics

23.4K
Probability is the likelihood of an event occurring. The term event is defined as a collection of results of a procedure. An event is a simple event when an outcome cannot be divided into simpler parts.
An example of a simple event is a coin toss. The result of a coin toss is either a head or a tail. Here, head and tail are two simple events. These two simple events make up the sample space. Further, the probability of an event occurring falls within the range of 0 to 1. The probability of an...
23.4K
Introduction to Statistics01:17

Introduction to Statistics

64.0K
The science of statistics involves collecting, analyzing, interpreting, and presenting data. The method of collecting, organizing, and summarizing data is called descriptive statistics. The systematic method of drawing inferences from the sample data and predicting unknown characteristics of a population is called inferential statistics.
In statistics, the collection of individuals or objects under study is called population. The idea of sampling is to select a portion of the larger population...
64.0K
Antibiotic Selection00:57

Antibiotic Selection

60.0K
Overview
60.0K
Theory of Attribution I: Correspondent Inference Theory01:15

Theory of Attribution I: Correspondent Inference Theory

575
Correspondent inference theory, proposed by Jones and Davis in 1965, seeks to explain how individuals infer stable personality traits from observed behaviors. It suggests that people attribute actions to underlying dispositions rather than external circumstances, particularly when the behavior appears intentional and socially significant.Voluntary Behavior and Dispositional AttributionAccording to this theory, individuals are more likely to attribute behavior to personal traits when it appears...
575

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Multivariate Optimization of k for k-Nearest-Neighbor Feature Selection With Dichotomous Outcomes: Complex Associations, Class Imbalance, and Application to RNA-Seq in Major Depressive Disorder.

IEEE transactions on computational biology and bioinformatics·2025
Same author

Gene age gap estimate (GAGE) for major depressive disorder: A penalized biological age model using gene expression.

Neurobiology of aging·2025
Same author

Nuclear factor kappa-B cell (NF-κB), interferon regulatory Factor, and glucocorticoid receptor pathway activation in major depressive Disorder: The role of cytomegalovirus infection.

Brain, behavior, and immunity·2024
Same author

Gene Age Gap Estimate (GAGE) for major depressive disorder: a penalized biological age model using gene expression.

bioRxiv : the preprint server for biology·2024
Same author

Exploring the role of neuronal-enriched extracellular vesicle miR-93 and interoception in major depressive disorder.

Translational psychiatry·2024
Same author

ChatGPT and large language models in academia: opportunities and challenges.

BioData mining·2023
Same journal

conMItion: an R package adjusting confounding factors for associations in multi-omics.

Bioinformatics (Oxford, England)·2026
Same journal

SpaMFG: a Spatial Multi-omics Integration Method based on Feature Grouping.

Bioinformatics (Oxford, England)·2026
Same journal

CSCN: Inference of Cell-Specific Causal Networks Using Single-Cell RNA-Seq Data.

Bioinformatics (Oxford, England)·2026
Same journal

Sparse CCA-Based Mediation Analysis with High-Dimensional Exposures and Mediators.

Bioinformatics (Oxford, England)·2026
Same journal

Enhancing Cross-Context Generalization in Drug Perturbation Prediction with a Multimodal Conditional Diffusion Framework.

Bioinformatics (Oxford, England)·2026
Same journal

Primer Design through Submodular Function Estimation.

Bioinformatics (Oxford, England)·2026
See all related articles

Related Experiment Video

Updated: Feb 5, 2026

Author Spotlight: An Accurate and Quantitative Approach to Study Visual Feature Selectivity of the Optokinetic Reflex in Mice
09:28

Author Spotlight: An Accurate and Quantitative Approach to Study Visual Feature Selectivity of the Optokinetic Reflex in Mice

Published on: June 23, 2023

3.8K

STatistical Inference Relief (STIR) feature selection.

Trang T Le1, Ryan J Urbanowicz1, Jason H Moore1

  • 1Department of Biostatistics, Epidemiology and Informatics, Institute for Biomedical Informatics, University of Pennsylvania, Philadelphia, PA, USA.

Bioinformatics (Oxford, England)
|September 22, 2018
PubMed
Summary
This summary is machine-generated.

We introduce STatistical Inference Relief (STIR), a novel machine learning approach for feature selection in high-dimensional data. STIR enhances Relief algorithms by incorporating nearest neighbor distance variance, enabling statistical significance testing and improved identification of gene interactions.

More Related Videos

Genotypic Inference of HIV-1 Tropism Using Population-based Sequencing of V3
11:10

Genotypic Inference of HIV-1 Tropism Using Population-based Sequencing of V3

Published on: December 27, 2010

12.8K
Quantitative Optical Microscopy: Measurement of Cellular Biophysical Features with a Standard Optical Microscope
14:09

Quantitative Optical Microscopy: Measurement of Cellular Biophysical Features with a Standard Optical Microscope

Published on: April 7, 2014

16.2K

Related Experiment Videos

Last Updated: Feb 5, 2026

Author Spotlight: An Accurate and Quantitative Approach to Study Visual Feature Selectivity of the Optokinetic Reflex in Mice
09:28

Author Spotlight: An Accurate and Quantitative Approach to Study Visual Feature Selectivity of the Optokinetic Reflex in Mice

Published on: June 23, 2023

3.8K
Genotypic Inference of HIV-1 Tropism Using Population-based Sequencing of V3
11:10

Genotypic Inference of HIV-1 Tropism Using Population-based Sequencing of V3

Published on: December 27, 2010

12.8K
Quantitative Optical Microscopy: Measurement of Cellular Biophysical Features with a Standard Optical Microscope
14:09

Quantitative Optical Microscopy: Measurement of Cellular Biophysical Features with a Standard Optical Microscope

Published on: April 7, 2014

16.2K

Area of Science:

  • Machine Learning
  • Bioinformatics
  • Statistical Genetics

Background:

  • Relief algorithms are effective for feature selection in high-dimensional data, identifying features associated with outcomes possibly due to epistasis or interactions.
  • However, Relief estimators are non-parametric, lacking a formal statistical inference framework to determine the significance of attribute estimates.
  • This necessitates a method to avoid arbitrary thresholds and rigorously select important features, especially in complex biological datasets.

Purpose of the Study:

  • To reconceptualize Relief-based feature selection by developing a new family of STatistical Inference Relief (STIR) estimators.
  • To incorporate sample variance of nearest neighbor distances into attribute importance estimation to enable statistical significance calculation.
  • To provide a statistical inferential formalism for Relief-based scores, including adjustment for multiple testing and application to case-control data.

Main Methods:

  • Developed STatistical Inference Relief (STIR) estimators, a novel family of algorithms building upon Relief.
  • Incorporated sample variance of nearest neighbor distances into the attribute importance estimation process.
  • Developed a pseudo t-test version of Relief-based algorithms for case-control data analysis.

Main Results:

  • Demonstrated the statistical power and type I error control of STIR on simulated data mimicking gene expression patterns, including main and network interaction effects.
  • Compared the performance of STIR using adaptive radius versus fixed-k nearest neighbor constructors.
  • Applied STIR to real RNA-Seq data from a major depressive disorder study, showing its utility in analyzing complex biological data.

Conclusions:

  • STIR provides a statistically rigorous framework for feature selection using Relief-based methods, retaining the ability to identify interactions.
  • The method allows for the calculation of statistical significance and adjustment for multiple testing, overcoming limitations of traditional Relief algorithms.
  • STIR shows promise for applications in genetic association studies and analysis of complex diseases like major depressive disorder.