Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Sampling Plans01:23

Sampling Plans

282
Sampling is a crucial step in analytical chemistry, allowing researchers to collect representative data from a large population. Common sampling methods include random, judgmental, systematic, stratified, and cluster sampling.
Random sampling is a method where each member of the population has an equal chance of being selected for the sample. It involves selecting individuals randomly, often using random number generators or lottery-type methods. For example, when analyzing the properties of a...
282
Sampling Distribution01:12

Sampling Distribution

13.6K
Given simple random samples of size n from a given population with a measured characteristic such as mean, proportion, or standard deviation for each sample, the probability distribution of all the measured characteristics is called a sampling distribution. How much the statistic varies from one sample to another is known as the sampling variability of a statistic. You typically measure the sampling variability of a statistic by its standard error. The standard error of the mean is an example...
13.6K
Sampling Methods: Overview01:06

Sampling Methods: Overview

532
A sample refers to a smaller subset representative of a larger population. In analytical chemistry, studying or analyzing an entire population is often impractical or impossible. Therefore, samples are used to draw inferences and generalize the whole population. The sampling method selects individuals or items from a population to create a sample. Standard sampling methods include random, judgemental, systematic, stratified, and cluster sampling. 
In analytical chemistry, the choice of...
532
Cluster Sampling Method01:20

Cluster Sampling Method

12.8K
Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...
12.8K
Random Sampling Method01:09

Random Sampling Method

12.4K
Sampling is a technique to select a portion (or subset) of the larger population and study that portion (the sample) to gain information about the population. Data are the result of sampling from a population. The sampling method ensures that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest. Among the various sampling methods used by...
12.4K
Sampling Methods: Sample Types01:18

Sampling Methods: Sample Types

443
Sampling materials are classified into three main types: solid, liquid, and gas.
Solid samples include a variety of substances, such as sediments from water bodies, soil, metals, and biological tissues. Two standard methods for extracting sediments from water bodies are grab sampling and piston coring. Grab sampling involves using a device to collect a discrete sediment sample from the bottom of a water body with minimal disturbance. Grab samples do not always represent the entire area due to...
443

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Analysis of changes in the spatiotemporal characteristics of impervious surfaces and their influencing factors in the Central Plains Urban Agglomeration of China from 2000 to 2018.

Heliyon·2023
Same author

Nanoparticle-mediated synergistic anticancer effect of ferroptosis and photodynamic therapy: Novel insights and perspectives.

Asian journal of pharmaceutical sciences·2023
Same author

Review on Processing Methods of Toxic Chinese Materia Medica and the Related Mechanisms of Action.

The American journal of Chinese medicine·2023
Same author

Pro‑angiogenic activity of salvianolate and its potential therapeutic effect against acute cerebral ischemia.

Experimental and therapeutic medicine·2023
Same author

CXCL12-CXCR4/CXCR7 Axis in Cancer: from Mechanisms to Clinical Applications.

International journal of biological sciences·2023
Same author

Boron-Modulated Electronic-Configuration Tuning of Cobalt for Enhanced Nitric Oxide Fixation to Ammonia.

Nano letters·2023
Same journal

Distributionally Robust Feature Selection.

Advances in neural information processing systems·2026
Same journal

On the Identifiability of Hybrid Deep Generative Models: Meta-Learning as a Solution.

Advances in neural information processing systems·2026
Same journal

Unlocking hidden biomolecular conformational landscapes in diffusion models at inference time.

Advances in neural information processing systems·2026
Same journal

JADE: Joint Alignment and Deep Embedding for Multi-Slice Spatial Transcriptomics.

Advances in neural information processing systems·2026
Same journal

Learning to Route: Per-Sample Adaptive Routing for Multimodal Multitask Prediction.

Advances in neural information processing systems·2026
Same journal

Emergence and Evolution of Interpretable Concepts in Diffusion Models.

Advances in neural information processing systems·2026
See all related articles

Related Experiment Video

Updated: Sep 16, 2025

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER
14:06

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

15.3K

Scale-invariant Optimal Sampling for Rare-events Data and Sparse Models.

Jing Wang1, HaiYing Wang1, Hao Helen Zhang2

  • 1Department of Statistics, University of Connecticut, Storrs, CT 06269.

Advances in Neural Information Processing Systems
|July 11, 2025
PubMed
Summary
This summary is machine-generated.

This study introduces a scale-invariant optimal subsampling method for massive datasets with rare events. It minimizes prediction error, improving efficiency and addressing issues with inactive features in sparse models.

More Related Videos

A Simple Stimulatory Device for Evoking Point-like Tactile Stimuli: A Searchlight for LFP to Spike Transitions
07:34

A Simple Stimulatory Device for Evoking Point-like Tactile Stimuli: A Searchlight for LFP to Spike Transitions

Published on: March 25, 2014

10.0K
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

914

Related Experiment Videos

Last Updated: Sep 16, 2025

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER
14:06

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

15.3K
A Simple Stimulatory Device for Evoking Point-like Tactile Stimuli: A Searchlight for LFP to Spike Transitions
07:34

A Simple Stimulatory Device for Evoking Point-like Tactile Stimuli: A Searchlight for LFP to Spike Transitions

Published on: March 25, 2014

10.0K
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

914

Area of Science:

  • Statistics
  • Machine Learning
  • Data Science

Background:

  • Subsampling addresses computational challenges in massive datasets, especially those with rare events.
  • Optimal subsampling is crucial to prevent information loss from overly aggressive sampling.
  • Existing methods are sensitive to data scaling and can be negatively impacted by inactive features.

Purpose of the Study:

  • To develop a scale-invariant optimal subsampling method for sparse models with rare events.
  • To minimize prediction error rather than model parameters.
  • To address the challenge of inactive features inflating subsampling probabilities.

Main Methods:

  • Introduced an adaptive lasso estimator for rare-events data, establishing its oracle properties.
  • Derived a scale-invariant optimal subsampling function to minimize prediction error for inverse probability weighted (IPW) adaptive lasso.
  • Proposed a maximum sampled conditional likelihood (MSCL) estimator to enhance efficiency.

Main Results:

  • The proposed scale-invariant subsampling method effectively mitigates information loss and improves estimation efficiency.
  • The adaptive lasso estimator demonstrates oracle properties for rare-events data.
  • Numerical experiments confirm the performance of the developed methods on simulated and real-world data.

Conclusions:

  • The novel scale-invariant subsampling approach provides a robust solution for analyzing massive datasets with rare events and inactive features.
  • The methods enhance prediction accuracy and estimation efficiency in sparse modeling contexts.
  • This work offers theoretical guarantees and practical validation for improved subsampling techniques.