Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Sampling Plans01:23

Sampling Plans

187
Sampling is a crucial step in analytical chemistry, allowing researchers to collect representative data from a large population. Common sampling methods include random, judgmental, systematic, stratified, and cluster sampling.
Random sampling is a method where each member of the population has an equal chance of being selected for the sample. It involves selecting individuals randomly, often using random number generators or lottery-type methods. For example, when analyzing the properties of a...
187
Sampling Distribution01:12

Sampling Distribution

12.8K
Given simple random samples of size n from a given population with a measured characteristic such as mean, proportion, or standard deviation for each sample, the probability distribution of all the measured characteristics is called a sampling distribution. How much the statistic varies from one sample to another is known as the sampling variability of a statistic. You typically measure the sampling variability of a statistic by its standard error. The standard error of the mean is an example...
12.8K
Cluster Sampling Method01:20

Cluster Sampling Method

11.9K
Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...
11.9K
Stratified Sampling Method01:16

Stratified Sampling Method

12.0K
Sampling is a technique to select a portion (or subset) of the larger population and study that portion (the sample) to gain information about the population. The sampling method ensures that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a stratified sample, divide the population into groups called strata and then take a...
12.0K
Sampling Theorem01:15

Sampling Theorem

347
In signal processing, the analysis of continuous-time signals, denoted as x(t), often involves sampling techniques to convert these signals into discrete-time signals. This process is essential for digital representation and manipulation. A critical component in sampling is the train of impulses, characterized by the sampling interval and the sampling frequency. The relationship between these parameters and the original signal's properties dictates the success of the sampling process.
347
Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

1.6K
Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...
1.6K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

TACR3 variant confers resilience to aging and Alzheimer's disease.

medRxiv : the preprint server for health sciences·2026
Same author

Feature-weighted maximum representative subsampling.

Scientific reports·2026
Same author

Associations of the Lifestyle for Brain Health (LIBRA) index with cognitive functioning across adulthood: Variation by sex and socioeconomic status in the German National Cohort (NAKO).

Alzheimer's & dementia : the journal of the Alzheimer's Association·2026
Same author

Resting-state brain activity and association with physical activity.

Frontiers in aging neuroscience·2026
Same author

Influence of resilience on autonomic nervous system habituation to repeated stress exposure: Insights from heart rate variability and heart rate response.

Comprehensive psychoneuroendocrinology·2026
Same author

The need to increase support for healthy ageing and longevity research in the EU by establishing a Coordination and Support Programme on Healthy Ageing and Longevity.

Mechanisms of ageing and development·2026
Same journal

Turbulent flow in a vortex separator with a directed pipe inlet.

Scientific reports·2026
Same journal

Systematic characteristic evaluation of clay-based cementitious material derived from calcium carbide residue and waste tile powder.

Scientific reports·2026
Same journal

Retraction Note: Improvement of a rapid diagnostic application of monoclonal antibodies against avian influenza H7 subtype virus using Europium nanoparticles.

Scientific reports·2026
Same journal

Applying large language models to spam detection in the Kazakh low-resource language setting.

Scientific reports·2026
Same journal

An open-source 3D printing system enabling in-situ freeze-thaw processing of hydrogels.

Scientific reports·2026
Same journal

An enhanced EfficientNet framework for automated waste classification using cosine annealing and label smoothing.

Scientific reports·2026
See all related articles

Related Experiment Video

Updated: Jul 9, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.5K

Discriminative machine learning for maximal representative subsampling.

Tony Hauptmann1, Sophie Fellenz2, Laksan Nathan2

  • 1Institute of Computer Science, Johannes Gutenberg University Mainz, Mainz, Germany. thauptmann@uni-mainz.de.

Scientific Reports
|November 28, 2023
PubMed
Summary
This summary is machine-generated.

Two new machine learning methods, maximum representative subsampling (MRS) and Soft-MRS, reduce bias in social science data. These techniques use representative data to adjust sample weights, improving research accuracy and downstream tasks.

More Related Videos

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
12:18

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

7.6K
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

767

Related Experiment Videos

Last Updated: Jul 9, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.5K
A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
12:18

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

7.6K
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

767

Area of Science:

  • Social Sciences
  • Machine Learning
  • Data Science

Background:

  • Biased population samples are a significant challenge in social science research.
  • Existing methods for bias mitigation may not fully address complex sampling issues.

Purpose of the Study:

  • To introduce two novel positive-unlabeled learning methods, Maximum Representative Subsampling (MRS) and Soft-MRS, for mitigating bias in population samples.
  • To evaluate the effectiveness of MRS and Soft-MRS in correcting biased datasets and improving downstream analytical tasks.

Main Methods:

  • Developed two machine learning methods, MRS and Soft-MRS, utilizing auxiliary information from representative datasets.
  • Trained classifiers to determine sample weights, with MRS iteratively removing instances and Soft-MRS adapting sample weights.
  • Validated methods on a biased public census dataset and compared performance against existing techniques.

Main Results:

  • Both MRS and Soft-MRS demonstrated effectiveness in reducing bias in artificially created biased datasets.
  • Sample weights generated by MRS and Soft-MRS minimized differences and enhanced performance in downstream classification tasks.
  • MRS is recommended for classification tasks, while Soft-MRS is suitable for tasks where dependent variable bias is critical.

Conclusions:

  • The proposed MRS and Soft-MRS methods offer a versatile machine learning-based approach to bias reduction in social science research.
  • These methods provide practical solutions for improving the reliability and generalizability of findings from social science studies.
  • The study highlights the applicability of these techniques in real-world scenarios, such as analyzing the influence of resilience on voting behavior.