Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Weighted Mean

Weighted Mean

While taking the arithmetic, geometric, or harmonic mean of a sample data set, equal importance is assigned to all the data points. However, all the values may not always be equally important in some data sets. An intrinsic bias might make it more important to give more weightage to specific values over others.
For example, consider the number of goals scored in the matches of a tournament. While computing the average number of goals scored in the tournament, it may be more important to...

Upsampling

Upsampling

Managing signal sampling rates is essential in digital signal processing to maintain signal integrity. A decimated signal, characterized by a reduced frequency range due to its lower sampling rate, can be upsampled by inserting zeros between each sample. This upsampling process expands the original spectrum and introduces repeated spectral replicas at intervals dictated by the new Nyquist frequency. To refine this zero-inserted sequence, it is passed through a lowpass filter with a cutoff...

Sampling Methods: Overview

Sampling Methods: Overview

A sample refers to a smaller subset representative of a larger population. In analytical chemistry, studying or analyzing an entire population is often impractical or impossible. Therefore, samples are used to draw inferences and generalize the whole population. The sampling method selects individuals or items from a population to create a sample. Standard sampling methods include random, judgemental, systematic, stratified, and cluster sampling.
In analytical chemistry, the choice of sampling...

Sampling Distribution

Sampling Distribution

Given simple random samples of size n from a given population with a measured characteristic such as mean, proportion, or standard deviation for each sample, the probability distribution of all the measured characteristics is called a sampling distribution. How much the statistic varies from one sample to another is known as the sampling variability of a statistic. You typically measure the sampling variability of a statistic by its standard error. The standard error of the mean is an example...

Downsampling

Downsampling

When considering a sampled sequence with zero values between sampling instants, one can replace it by taking every N-th value of the sequence. At these integer multiples of N, the original and sampled sequences coincide. This process, known as decimation, involves extracting every N-th sample from a sequence, thereby creating a more efficient sequence.
The Fourier transform of the decimated sequence reveals a combination of scaled and shifted versions of the original spectrum. This...

Sampling Methods: Sample Types

Sampling Methods: Sample Types

Sampling materials are classified into three main types: solid, liquid, and gas.
Solid samples include a variety of substances, such as sediments from water bodies, soil, metals, and biological tissues. Two standard methods for extracting sediments from water bodies are grab sampling and piston coring. Grab sampling involves using a device to collect a discrete sediment sample from the bottom of a water body with minimal disturbance. Grab samples do not always represent the entire area due to...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Virus-like and Virus Replicon Particles Targeting Multiple B-Cell Antigens Do Not Protect Against African Swine Fever Virus.

Vaccines·2026

Same author

Risk stratification for renal outcomes in ANCA-associated vasculitides using established scores and histopathological criteria.

Journal of nephrology·2026

Same author

Beyond sequence similarity: ML-powered identification of pHLA off-targets for TCR-mimic antibodies using high throughput binding kinetics.

mAbs·2025

Same author

Prediction of relapses in patients with small vessel vasculitides: a multicenter cohort study on histopathological risk patterns.

Rheumatology international·2025

Same author

Echocardiographic measures read by artificial intelligence enable accurate and rapid prediction of the worsening of heart failure.

European heart journal. Digital health·2025

Same author

Effects of extreme acceleration, microgravity, and deceleration on Bacillus subtilis onboard a suborbital space flight.

NPJ microgravity·2025

Same journal

Turbulent flow in a vortex separator with a directed pipe inlet.

Scientific reports·2026

Same journal

Systematic characteristic evaluation of clay-based cementitious material derived from calcium carbide residue and waste tile powder.

Scientific reports·2026

Same journal

Retraction Note: Improvement of a rapid diagnostic application of monoclonal antibodies against avian influenza H7 subtype virus using Europium nanoparticles.

Scientific reports·2026

Same journal

Applying large language models to spam detection in the Kazakh low-resource language setting.

Scientific reports·2026

Same journal

An open-source 3D printing system enabling in-situ freeze-thaw processing of hydrogels.

Scientific reports·2026

Same journal

An enhanced EfficientNet framework for automated waste classification using cosine annealing and label smoothing.

Scientific reports·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 5, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Feature-weighted maximum representative subsampling.

Tony Hauptmann¹, Stefan Kramer²

¹Institute of Computer Science, Johannes Gutenberg University Mainz, Mainz, Germany. thauptmann@uni-mainz.de.

Scientific Reports

|June 3, 2026

Summary

This summary is machine-generated.

This study introduces feature-weighted Maximum Representative Subsampling (FW-MRS) to debias datasets by down-weighting biased features. This method improves data representativeness while preserving generalization performance for downstream tasks.

Keywords:

Debiasing Feature weights

Related Experiment Videos

Last Updated: Jun 5, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Area of Science:

Social Sciences
Data Science
Machine Learning

Background:

Debiasing algorithms are crucial for valid conclusions in social science studies.
Existing methods using sample weights can introduce bias into representative features when only a subset is biased.

Purpose of the Study:

To develop a novel debiasing method that minimizes the impact of highly biased features on sample weight computation.
To address the issue of introducing new biases into representative variables during the debiasing process.

Main Methods:

Developed feature-weighted Maximum Representative Subsampling (FW-MRS).
FW-MRS uses feature weights derived from domain classifier importance to down-weight biased features.
Algorithm iteratively removes elements from non-representative samples to align with representative ones.

Main Results:

FW-MRS was validated on eight artificially biased tabular datasets.
The method showed no statistically significant difference in generalization performance on downstream tasks compared to other methods.
Applied FW-MRS to a real-world social science dataset.

Conclusions:

FW-MRS effectively debiases datasets by reducing the influence of highly biased features.
The approach retains more data instances for downstream tasks without compromising generalization.
Offers a promising solution for debiasing complex datasets in social sciences.