Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Sampling Plans

Sampling Plans

Sampling is a crucial step in analytical chemistry, allowing researchers to collect representative data from a large population. Common sampling methods include random, judgmental, systematic, stratified, and cluster sampling.
Random sampling is a method where each member of the population has an equal chance of being selected for the sample. It involves selecting individuals randomly, often using random number generators or lottery-type methods. For example, when analyzing the properties of a...

Sampling Distribution

Sampling Distribution

Given simple random samples of size n from a given population with a measured characteristic such as mean, proportion, or standard deviation for each sample, the probability distribution of all the measured characteristics is called a sampling distribution. How much the statistic varies from one sample to another is known as the sampling variability of a statistic. You typically measure the sampling variability of a statistic by its standard error. The standard error of the mean is an example...

Cluster Sampling Method

Cluster Sampling Method

Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...

Stratified Sampling Method

Stratified Sampling Method

Sampling is a technique to select a portion (or subset) of the larger population and study that portion (the sample) to gain information about the population. The sampling method ensures that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a stratified sample, divide the population into groups called strata and then take a...

Sampling Theorem

Sampling Theorem

In signal processing, the analysis of continuous-time signals, denoted as x(t), often involves sampling techniques to convert these signals into discrete-time signals. This process is essential for digital representation and manipulation. A critical component in sampling is the train of impulses, characterized by the sampling interval and the sampling frequency. The relationship between these parameters and the original signal's properties dictates the success of the sampling process.

Quantifying and Rejecting Outliers: The Grubbs Test

Quantifying and Rejecting Outliers: The Grubbs Test

Sometimes, a data set can have a recorded numerical observation that greatly deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier. To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

TACR3 variant confers resilience to aging and Alzheimer's disease.

medRxiv : the preprint server for health sciences·2026

Same author

Feature-weighted maximum representative subsampling.

Scientific reports·2026

Same author

Associations of the Lifestyle for Brain Health (LIBRA) index with cognitive functioning across adulthood: Variation by sex and socioeconomic status in the German National Cohort (NAKO).

Alzheimer's & dementia : the journal of the Alzheimer's Association·2026

Same author

Resting-state brain activity and association with physical activity.

Frontiers in aging neuroscience·2026

Same author

Influence of resilience on autonomic nervous system habituation to repeated stress exposure: Insights from heart rate variability and heart rate response.

Comprehensive psychoneuroendocrinology·2026

Same author

The need to increase support for healthy ageing and longevity research in the EU by establishing a Coordination and Support Programme on Healthy Ageing and Longevity.

Mechanisms of ageing and development·2026

Same journal

Turbulent flow in a vortex separator with a directed pipe inlet.

Scientific reports·2026

Same journal

Systematic characteristic evaluation of clay-based cementitious material derived from calcium carbide residue and waste tile powder.

Scientific reports·2026

Same journal

Retraction Note: Improvement of a rapid diagnostic application of monoclonal antibodies against avian influenza H7 subtype virus using Europium nanoparticles.

Scientific reports·2026

Same journal

Applying large language models to spam detection in the Kazakh low-resource language setting.

Scientific reports·2026

Same journal

An open-source 3D printing system enabling in-situ freeze-thaw processing of hydrogels.

Scientific reports·2026

Same journal

An enhanced EfficientNet framework for automated waste classification using cosine annealing and label smoothing.

Scientific reports·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jul 9, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Discriminative machine learning for maximal representative subsampling.

Tony Hauptmann¹, Sophie Fellenz², Laksan Nathan²

¹Institute of Computer Science, Johannes Gutenberg University Mainz, Mainz, Germany. thauptmann@uni-mainz.de.

Scientific Reports

|November 28, 2023

Summary

This summary is machine-generated.

Two new machine learning methods, maximum representative subsampling (MRS) and Soft-MRS, reduce bias in social science data. These techniques use representative data to adjust sample weights, improving research accuracy and downstream tasks.

More Related Videos

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

Related Experiment Videos

Last Updated: Jul 9, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

Area of Science:

Social Sciences
Machine Learning
Data Science

Background:

Biased population samples are a significant challenge in social science research.
Existing methods for bias mitigation may not fully address complex sampling issues.

Purpose of the Study:

To introduce two novel positive-unlabeled learning methods, Maximum Representative Subsampling (MRS) and Soft-MRS, for mitigating bias in population samples.
To evaluate the effectiveness of MRS and Soft-MRS in correcting biased datasets and improving downstream analytical tasks.

Main Methods:

Developed two machine learning methods, MRS and Soft-MRS, utilizing auxiliary information from representative datasets.
Trained classifiers to determine sample weights, with MRS iteratively removing instances and Soft-MRS adapting sample weights.
Validated methods on a biased public census dataset and compared performance against existing techniques.

Main Results:

Both MRS and Soft-MRS demonstrated effectiveness in reducing bias in artificially created biased datasets.
Sample weights generated by MRS and Soft-MRS minimized differences and enhanced performance in downstream classification tasks.
MRS is recommended for classification tasks, while Soft-MRS is suitable for tasks where dependent variable bias is critical.

Conclusions:

The proposed MRS and Soft-MRS methods offer a versatile machine learning-based approach to bias reduction in social science research.
These methods provide practical solutions for improving the reliability and generalizability of findings from social science studies.
The study highlights the applicability of these techniques in real-world scenarios, such as analyzing the influence of resilience on voting behavior.