Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Distributions to Estimate Population Parameter

Distributions to Estimate Population Parameter

The accurate values of population parameters such as population proportion, population mean, and population standard deviation (or variance) are usually unknown. These are fixed values that can only be estimated from the data collected from the samples. The estimates of each of these parameters are sample proportion, the sample mean, and sample standard deviation (or variance). To obtain the values of these sample statistics, data are required that have particular distribution and central...

Estimating Population Mean with Unknown Standard Deviation

Estimating Population Mean with Unknown Standard Deviation

In practice, we rarely know the population standard deviation. In the past, when the sample size was large, this did not present a problem to statisticians. They used the sample standard deviation s as an estimate for σ and proceeded as before to calculate a confidence interval with close enough results. However, statisticians ran into problems when the sample size was small. A small sample size caused inaccuracies in the confidence interval.
William S. Gosset (1876–1937) of the...

What are Estimates?

What are Estimates?

It isn't easy to measure a parameter such as the mean height or the mean weight of a population. So, we draw samples from the population and calculate the mean height or mean weight of the individuals in the sample. This sample data acts as a representative measure of the population parameter. These sample statistics are known as estimates.
The estimate for the mean of a sample is denoted by ͞x, whereas the mean of the population is designated as μ. Further, parameters such...

Testing a Claim about Population Proportion

Testing a Claim about Population Proportion

A complete procedure for testing a claim about a population proportion is provided here.
There are two methods of testing a claim about a population proportion: (1) Using the sample proportion from the data where a binomial distribution is approximated to the normal distribution and (2) Using the binomial probabilities calculated from the data.
The first method uses normal distribution as an approximation to the binomial distribution. The requirements are as follows: sample size is large...

Estimating Population Mean with Known Standard Deviation

Estimating Population Mean with Known Standard Deviation

To construct a confidence interval for a single unknown population mean μ, where the population standard deviation is known, we need sample mean as an estimate for μ and we need the margin of error. Here, the margin of error (EBM) is called the error bound for a population mean (abbreviated EBM). The sample mean is the point estimate of the unknown population mean μ.
The confidence interval estimate will have the form as follows:
(point estimate - error bound, point estimate +...

Sample Proportion and Population Proportion

Sample Proportion and Population Proportion

Collecting samples or responses from an entire population takes significant time and effort, so a researcher collects responses from only a sample of that population. Suppose a study needs to collect information about a specific mobile application. After sample collection, the researcher analyzes the data and discovers that most individuals in the sample use that specific mobile application. The sample proportion measures the number of individuals in a sample who either use or don't use the...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Functional principal component analysis forsparse censored data.

Biometrika·2026

Same author

Impact of Daylight Saving Time on Physical Activity Patterns.

Nature health·2026

Same author

Data Fusion for Partial Identification of Causal Effects.

Advances in neural information processing systems·2026

Same author

A Double Machine Learning Approach for Combining Experimental and Observational Studies.

Observational studies·2026

Same author

Empirical Bound Information-Directed Sampling for Norm-Agnostic Bandits.

Reinforcement learning journal·2026

Same author

Assessing the utility of the Primary Care PTSD Screen for DSM-5 (PC-PTSD-5) as a screening tool among caregivers of hematopoietic stem cell transplantation survivors.

Cancer·2026

Same journal

Towards the Efficient Inference by Incorporating Automated Computational Phenotypes under Covariate Shift.

Proceedings of machine learning research·2026

Same journal

Endo-SemiS: Towards Robust Semi-Supervised Image Segmentation for Endoscopic Video.

Proceedings of machine learning research·2026

Same journal

Perspective: Machine Learning for Health Should Consider Social Drivers of Health.

Proceedings of machine learning research·2026

Same journal

Classifying Phonotrauma Severity from Vocal Fold Images with Soft Ordinal Regression.

Proceedings of machine learning research·2026

Same journal

Does Domain-Specific Retrieval Augmented Generation Help LLMs Answer Consumer Health Questions?

Proceedings of machine learning research·2026

Same journal

Quantitative Convergence Analysis of Projected Stochastic Gradient Descent for Non-Convex Losses via the Goldstein Subdifferential.

Proceedings of machine learning research·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jan 17, 2026

Quantification of Information Encoded by Gene Expression Levels During Lifespan Modulation Under Broad-range Dietary Restriction in C. elegans

Quantification of Information Encoded by Gene Expression Levels During Lifespan Modulation Under Broad-range Dietary Restriction in C. elegans

Published on: August 16, 2017

Hidden Population Estimation with Indirect Inference and Auxiliary Information.

Justin Weltz¹, Eric Laber^1,2, Alexander Volfovsky^1,3

¹Department of Statistical Science, Duke University, Durham, North Carolina, USA.

Proceedings of Machine Learning Research

|September 22, 2025

Summary

This summary is machine-generated.

Respondent Driven Sampling (RDS) struggles with accurate hidden population size estimation. This study introduces a new method using auxiliary data and indirect inference to reduce bias and improve precision in RDS surveys.

More Related Videos

Topographical Estimation of Visual Population Receptive Fields by fMRI

Topographical Estimation of Visual Population Receptive Fields by fMRI

Published on: February 3, 2015

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations

Published on: December 7, 2021

Related Experiment Videos

Last Updated: Jan 17, 2026

Quantification of Information Encoded by Gene Expression Levels During Lifespan Modulation Under Broad-range Dietary Restriction in C. elegans

Quantification of Information Encoded by Gene Expression Levels During Lifespan Modulation Under Broad-range Dietary Restriction in C. elegans

Published on: August 16, 2017

Topographical Estimation of Visual Population Receptive Fields by fMRI

Topographical Estimation of Visual Population Receptive Fields by fMRI

Published on: February 3, 2015

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations

Published on: December 7, 2021

Area of Science:

Social Network Analysis
Statistical Modeling
Public Health Research

Background:

Conventional survey methods face challenges in sampling hidden or stigmatized populations.
Respondent Driven Sampling (RDS) is a key method for reaching these groups, but existing imputation techniques introduce bias.
Accurate estimation of hidden population sizes is crucial for public health interventions.

Purpose of the Study:

To develop an improved statistical method for estimating hidden population sizes using Respondent Driven Sampling (RDS).
To address and reduce estimation bias inherent in current RDS imputation techniques.
To enhance the precision of key RDS-derived metrics, including arrival rates and subgraph characteristics.

Main Methods:

Modeling RDS as a stochastic process on social network graphs.
Leveraging auxiliary participant information and indirect inference for improved imputation.
Developing novel statistical techniques to correct for biased edge imputation in RDS.

Main Results:

The proposed method significantly reduces bias in estimating the study participant arrival rate, sample subgraph, and overall population size.
Improved precision was observed in key estimation parameters compared to existing methods.
Demonstrated successful application in estimating the size of the People Who Inject Drugs (PWID) population in Estonia.

Conclusions:

The novel indirect inference approach offers a more accurate and precise way to analyze RDS data.
This method enhances the reliability of estimates for hidden populations, crucial for targeted health programs.
The findings have direct implications for improving the accuracy of public health surveys in hard-to-reach populations.