Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Weighted Mean00:57

Weighted Mean

While taking the arithmetic, geometric, or harmonic mean of a sample data set, equal importance is assigned to all the data points. However, all the values may not always be equally important in some data sets. An intrinsic bias might make it more important to give more weightage to specific values over others.
For example, consider the number of goals scored in the matches of a tournament. While computing the average number of goals scored in the tournament, it may be more important to...
Upsampling01:22

Upsampling

Managing signal sampling rates is essential in digital signal processing to maintain signal integrity. A decimated signal, characterized by a reduced frequency range due to its lower sampling rate, can be upsampled by inserting zeros between each sample. This upsampling process expands the original spectrum and introduces repeated spectral replicas at intervals dictated by the new Nyquist frequency. To refine this zero-inserted sequence, it is passed through a lowpass filter with a cutoff...
Sampling Methods: Overview01:06

Sampling Methods: Overview

A sample refers to a smaller subset representative of a larger population. In analytical chemistry, studying or analyzing an entire population is often impractical or impossible. Therefore, samples are used to draw inferences and generalize the whole population. The sampling method selects individuals or items from a population to create a sample. Standard sampling methods include random, judgemental, systematic, stratified, and cluster sampling. 
In analytical chemistry, the choice of sampling...
Sampling Distribution01:12

Sampling Distribution

Given simple random samples of size n from a given population with a measured characteristic such as mean, proportion, or standard deviation for each sample, the probability distribution of all the measured characteristics is called a sampling distribution. How much the statistic varies from one sample to another is known as the sampling variability of a statistic. You typically measure the sampling variability of a statistic by its standard error. The standard error of the mean is an example...
Downsampling01:20

Downsampling

When considering a sampled sequence with zero values between sampling instants, one can replace it by taking every N-th value of the sequence. At these integer multiples of N, the original and sampled sequences coincide. This process, known as decimation, involves extracting every N-th sample from a sequence, thereby creating a more efficient sequence.
The Fourier transform of the decimated sequence reveals a combination of scaled and shifted versions of the original spectrum. This...
Sampling Methods: Sample Types01:18

Sampling Methods: Sample Types

Sampling materials are classified into three main types: solid, liquid, and gas.
Solid samples include a variety of substances, such as sediments from water bodies, soil, metals, and biological tissues. Two standard methods for extracting sediments from water bodies are grab sampling and piston coring. Grab sampling involves using a device to collect a discrete sediment sample from the bottom of a water body with minimal disturbance. Grab samples do not always represent the entire area due to...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Virus-like and Virus Replicon Particles Targeting Multiple B-Cell Antigens Do Not Protect Against African Swine Fever Virus.

Vaccines·2026
Same author

Risk stratification for renal outcomes in ANCA-associated vasculitides using established scores and histopathological criteria.

Journal of nephrology·2026
Same author

Beyond sequence similarity: ML-powered identification of pHLA off-targets for TCR-mimic antibodies using high throughput binding kinetics.

mAbs·2025
Same author

Prediction of relapses in patients with small vessel vasculitides: a multicenter cohort study on histopathological risk patterns.

Rheumatology international·2025
Same author

Echocardiographic measures read by artificial intelligence enable accurate and rapid prediction of the worsening of heart failure.

European heart journal. Digital health·2025
Same author

Effects of extreme acceleration, microgravity, and deceleration on Bacillus subtilis onboard a suborbital space flight.

NPJ microgravity·2025
Same journal

Turbulent flow in a vortex separator with a directed pipe inlet.

Scientific reports·2026
Same journal

Systematic characteristic evaluation of clay-based cementitious material derived from calcium carbide residue and waste tile powder.

Scientific reports·2026
Same journal

Retraction Note: Improvement of a rapid diagnostic application of monoclonal antibodies against avian influenza H7 subtype virus using Europium nanoparticles.

Scientific reports·2026
Same journal

Applying large language models to spam detection in the Kazakh low-resource language setting.

Scientific reports·2026
Same journal

An open-source 3D printing system enabling in-situ freeze-thaw processing of hydrogels.

Scientific reports·2026
Same journal

An enhanced EfficientNet framework for automated waste classification using cosine annealing and label smoothing.

Scientific reports·2026
See all related articles

Related Experiment Video

Updated: Jun 5, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Feature-weighted maximum representative subsampling.

Tony Hauptmann1, Stefan Kramer2

  • 1Institute of Computer Science, Johannes Gutenberg University Mainz, Mainz, Germany. thauptmann@uni-mainz.de.

Scientific Reports
|June 3, 2026
PubMed
Summary
This summary is machine-generated.

This study introduces feature-weighted Maximum Representative Subsampling (FW-MRS) to debias datasets by down-weighting biased features. This method improves data representativeness while preserving generalization performance for downstream tasks.

Keywords:
DebiasingFeature weights

Related Experiment Videos

Last Updated: Jun 5, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Area of Science:

  • Social Sciences
  • Data Science
  • Machine Learning

Background:

  • Debiasing algorithms are crucial for valid conclusions in social science studies.
  • Existing methods using sample weights can introduce bias into representative features when only a subset is biased.

Purpose of the Study:

  • To develop a novel debiasing method that minimizes the impact of highly biased features on sample weight computation.
  • To address the issue of introducing new biases into representative variables during the debiasing process.

Main Methods:

  • Developed feature-weighted Maximum Representative Subsampling (FW-MRS).
  • FW-MRS uses feature weights derived from domain classifier importance to down-weight biased features.
  • Algorithm iteratively removes elements from non-representative samples to align with representative ones.

Main Results:

  • FW-MRS was validated on eight artificially biased tabular datasets.
  • The method showed no statistically significant difference in generalization performance on downstream tasks compared to other methods.
  • Applied FW-MRS to a real-world social science dataset.

Conclusions:

  • FW-MRS effectively debiases datasets by reducing the influence of highly biased features.
  • The approach retains more data instances for downstream tasks without compromising generalization.
  • Offers a promising solution for debiasing complex datasets in social sciences.