Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Random Sampling Method01:09

Random Sampling Method

15.1K
Sampling is a technique to select a portion (or subset) of the larger population and study that portion (the sample) to gain information about the population. Data are the result of sampling from a population. The sampling method ensures that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest. Among the various sampling methods used by...
15.1K
Randomized Experiments01:13

Randomized Experiments

9.1K
The randomization process involves assigning study participants randomly to experimental or control groups based on their probability of being equally assigned. Randomization is meant to eliminate selection bias and balance known and unknown confounding factors so that the control group is similar to the treatment group as much as possible. A computer program and a random number generator can be used to assign participants to groups in a way that minimizes bias.
Simple randomization
Simple...
9.1K
Wald-Wolfowitz Runs Test I01:17

Wald-Wolfowitz Runs Test I

978
The Wald-Wolfowitz test, also known as the runs test, is a nonparametric statistical test used to assess the randomness of a sequence of two different types of elements (e.g., positive/negative values, successes/failures). It examines whether the order of the elements in a sequence is random or if there is a pattern or trend present. This nonparametric test applies to any ordered data despite the population and sample data distribution, even if a higher sample size is available.
The test works...
978
Frequency-dependent Selection01:21

Frequency-dependent Selection

24.3K
When the fitness of a trait is influenced by how common it is (i.e., its frequency) relative to different traits within a population, this is referred to as frequency-dependent selection. Frequency-dependent selection may occur between species or within a single species. This type of selection can either be positive—with more common phenotypes having higher fitness—or negative, with rarer phenotypes conferring increased fitness.
24.3K
Random Variables01:09

Random Variables

18.0K
A random variable is a single numerical value that indicates the outcome of a procedure. The concept of random variables is fundamental to the probability theory and was introduced by a Russian mathematician, Pafnuty Chebyshev, in the mid-nineteenth century.
Uppercase letters such as X or Y denote a random variable. Lowercase letters like x or y denote the value of a random variable. If X is a random variable, then X is written in words, and x is given as a number.
For example, let X = the...
18.0K
Bootstrapping01:24

Bootstrapping

841
The term "bootstrap" originated in the 19th century as a metaphor for self-improvement or achieving something independently, without external assistance. This concept extends to statistical bootstrapping, a self-contained method for estimating population parameters through resampling, even though it can be computationally intensive. Developed by the American statistician Dr. Bradley Efron in 1979, bootstrapping provides a robust way to perform inference when the original sample size is...
841

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Transcription factor collaboration enables precise T cell state engineering.

bioRxiv : the preprint server for biology·2026
Same author

Dango: Predicting higher-order genetic interactions.

Cell systems·2026
Same author

HeFS: helper-enhanced feature selection via Pareto-optimized genetic search.

Journal of advanced research·2026
Same author

Constrained Diffusion as a Paradigm for Evolution.

bioRxiv : the preprint server for biology·2026
Same author

Epigenetic memory of colitis promotes tumour growth.

Nature·2026
Same author

An integrated view of the structure and function of the human 4D nucleome.

Nature·2025
Same journal

Turbulent flow in a vortex separator with a directed pipe inlet.

Scientific reports·2026
Same journal

Systematic characteristic evaluation of clay-based cementitious material derived from calcium carbide residue and waste tile powder.

Scientific reports·2026
Same journal

Retraction Note: Improvement of a rapid diagnostic application of monoclonal antibodies against avian influenza H7 subtype virus using Europium nanoparticles.

Scientific reports·2026
Same journal

Applying large language models to spam detection in the Kazakh low-resource language setting.

Scientific reports·2026
Same journal

An open-source 3D printing system enabling in-situ freeze-thaw processing of hydrogels.

Scientific reports·2026
Same journal

An enhanced EfficientNet framework for automated waste classification using cosine annealing and label smoothing.

Scientific reports·2026
See all related articles

Related Experiment Video

Updated: Feb 21, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

8.1K

RIFS: a randomly restarted incremental feature selection algorithm.

Yuting Ye1, Ruochi Zhang1, Weiwei Zheng1

  • 1College of Computer Science and Technology, and Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun, Jilin, 130012, China.

Scientific Reports
|October 14, 2017
PubMed
Summary
This summary is machine-generated.

This study introduces Randomly re-started Incremental Feature Selection (RIFS) for big data challenges in biomedical research. RIFS improves classification accuracy and reduces feature numbers, outperforming existing methods for prostate cancer diagnosis.

More Related Videos

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
12:18

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

8.1K
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

1.4K

Related Experiment Videos

Last Updated: Feb 21, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

8.1K
A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
12:18

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

8.1K
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

1.4K

Area of Science:

  • Biomedical informatics
  • Machine learning
  • Bioinformatics

Background:

  • The big data era presents computational challenges in machine learning, particularly in biomedical OMIC research.
  • Biomedical research faces a "large p small n" paradigm due to high data production costs and recruitment difficulties.
  • Feature selection is crucial for reducing high-dimensional biomedical data to achieve stable classification or regression models.

Purpose of the Study:

  • To address the limitations of traditional Incremental Feature Selection (IFS) in handling "large p small n" data.
  • To develop a novel feature selection algorithm that can identify effective feature subsets ranked low by standard statistical methods.
  • To improve classification accuracy and reduce the number of features required for biomedical data analysis.

Main Methods:

  • A modified Incremental Feature Selection (IFS) strategy was developed, termed Randomly re-started Incremental Feature Selection (RIFS).
  • RIFS randomly alters the initial feature selection step of IFS to explore diverse feature subsets.
  • The algorithm was evaluated on its ability to achieve high classification performance with a minimal feature set.

Main Results:

  • The proposed RIFS algorithm demonstrated superior classification accuracy compared to existing feature selection methods.
  • RIFS achieved a smaller feature subset size while maintaining or improving classification performance.
  • RIFS outperformed the current methylomic diagnosis model for prostate malignancy, showing higher accuracy and fewer transcriptomic features.

Conclusions:

  • Randomly re-started Incremental Feature Selection (RIFS) offers an effective approach to handle "large p small n" data in biomedical research.
  • RIFS can identify powerful feature combinations that might be overlooked by conventional statistical association algorithms.
  • The RIFS algorithm provides a more accurate and parsimonious diagnostic model for diseases like prostate cancer.