Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Cluster Sampling Method01:20

Cluster Sampling Method

13.8K
Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...
13.8K
Stratified Sampling Method01:16

Stratified Sampling Method

14.3K
Sampling is a technique to select a portion (or subset) of the larger population and study that portion (the sample) to gain information about the population. The sampling method ensures that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a stratified sample, divide the population into groups called strata and then take a...
14.3K
Sampling Plans01:23

Sampling Plans

802
Sampling is a crucial step in analytical chemistry, allowing researchers to collect representative data from a large population. Common sampling methods include random, judgmental, systematic, stratified, and cluster sampling.
Random sampling is a method where each member of the population has an equal chance of being selected for the sample. It involves selecting individuals randomly, often using random number generators or lottery-type methods. For example, when analyzing the properties of a...
802
Bootstrapping01:24

Bootstrapping

746
The term "bootstrap" originated in the 19th century as a metaphor for self-improvement or achieving something independently, without external assistance. This concept extends to statistical bootstrapping, a self-contained method for estimating population parameters through resampling, even though it can be computationally intensive. Developed by the American statistician Dr. Bradley Efron in 1979, bootstrapping provides a robust way to perform inference when the original sample size is...
746
Aggregates Classification01:29

Aggregates Classification

838
Aggregate classification is generally based on its size, petrographic characteristics, weight, and source. Size classification ranges from coarse to fine aggregates, defined by the size of the particles. Coarse aggregates are particles that do not pass through ASTM sieve No. 4, and aggregates that pass through the sieve are fine aggregates.
Petrographic classification groups aggregates based on common mineralogical characteristics. Some of the common mineral groups found in aggregates are...
838
Systematic Sampling Method01:17

Systematic Sampling Method

12.3K
Sampling is a technique to select a portion (or subset) of the larger population and study that portion (the sample) to gain information about the population. Data are the result of sampling from a population. The sampling method ensures that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
Systematic sampling is one of the simplest methods...
12.3K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

A three-factor nomogram predicts the use of invasive mechanical ventilation within 72 h in preterm infants.

Frontiers in medicine·2026
Same author

Membrane protein-targeted and dual-miRNAs-activated DNA quadrilateral sensor for accurate tumor cell identification.

Talanta·2026
Same author

Electrostatic complementarity and interlayer coupling modulation in halogen-modified covalent organic frameworks for enhanced photocatalytic oxygen reduction.

Journal of colloid and interface science·2026
Same author

AI-Based Virtual Standardized Patients Improve Oral Mucosal Disease Training in Fourth-Year Dental Students.

Journal of dental education·2026
Same author

Enhancing Synthesis Efficiency in Microbial 1,5-Pentanediol Production Through Transcriptomics-Informed Metabolic Engineering of <i>Escherichia coli</i>.

Microorganisms·2026
Same author

Combining computer-aided enzyme design and chromosomal integration for plasmid-free biosynthesis of 1,5-pentanediol in Escherichia coli.

Microbial cell factories·2026
Same journal

Erratum: "Highly versatile, two-color setup for high-order harmonic generation using spatial light modulators" [Rev. Sci. Instrum. 95, 073002 (2024)].

The Review of scientific instruments·2026
Same journal

Thermal correction method for accurate performance evaluation of micro-thermoelectric coolers.

The Review of scientific instruments·2026
Same journal

Correcting the energy-dependent asymmetry in low-energy muon spin rotation.

The Review of scientific instruments·2026
Same journal

Fiber-integrated acousto-optic-modulator-based phase-controlled Rydberg atomic electrometer.

The Review of scientific instruments·2026
Same journal

A top-loading point-contact spectroscopy probe with in-situ sample exchange for dilution refrigerators.

The Review of scientific instruments·2026
Same journal

Investigation of plasma characteristics in a developed large-diameter, low-aspect ratio, radio frequency plasma source with a flat spiral antenna.

The Review of scientific instruments·2026
See all related articles

Related Experiment Video

Updated: Dec 20, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.9K

A cluster-based hybrid sampling approach for imbalanced data classification.

Shou Feng1, Chunhui Zhao1, Ping Fu2

  • 1College of Information and Communication Engineering, Harbin Engineering University, Harbin 150001, China.

The Review of Scientific Instruments
|June 4, 2020
PubMed
Summary
This summary is machine-generated.

A new cluster-based hybrid sampling approach (CUSS) effectively addresses imbalanced datasets in instrumental data classification. CUSS outperforms existing resampling methods, improving minority class identification.

More Related Videos

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations
12:27

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

7.2K
A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
12:18

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

7.8K

Related Experiment Videos

Last Updated: Dec 20, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.9K
Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations
12:27

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

7.2K
A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
12:18

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

7.8K

Area of Science:

  • Machine Learning
  • Data Science
  • Pattern Recognition

Background:

  • Imbalanced datasets pose significant challenges in instrumental data classification.
  • Standard classifiers struggle with minority class identification when overwhelmed by majority instances.

Purpose of the Study:

  • To introduce a novel hybrid sampling approach, Cluster-based Under-sampling and SMOTE (CUSS), for imbalanced dataset classification.
  • To develop a new cluster-based under-sampling technique and a data distribution-based strategy for setting expected instance numbers.

Main Methods:

  • CUSS employs a hybrid data-level resampling strategy combining under-sampling and Synthetic Minority Over-sampling Technique (SMOTE).
  • A novel cluster-based under-sampling method and a dynamic instance number setting strategy are integral to CUSS.
  • The method was evaluated against five other resampling techniques across 15 diverse datasets.

Main Results:

  • CUSS demonstrated superior performance in imbalanced dataset classification compared to other state-of-the-art methods.
  • Experimental results validated the effectiveness of CUSS across datasets with varying sizes and imbalance ratios.
  • The proposed under-sampling and instance number strategies contributed to CUSS's enhanced performance.

Conclusions:

  • The Cluster-based Under-sampling and SMOTE (CUSS) approach offers a robust solution for imbalanced instrumental data classification.
  • CUSS provides a significant advancement over existing hybrid resampling methods.
  • The developed techniques enhance the ability to accurately classify minority classes in imbalanced datasets.