Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Estimating Population Standard Deviation

Estimating Population Standard Deviation

When the population standard deviation is unknown and the sample size is large, the sample standard deviation s is commonly used as a point estimate of σ. However, it can sometimes under or overestimate the population standard deviation. To overcome this drawback, confidence intervals are determined to estimate population parameters and eliminate any calculation bias accurately. However, this only applies to random samples from normally distributed populations. Knowing the sample mean and...

Estimating Population Mean with Unknown Standard Deviation

Estimating Population Mean with Unknown Standard Deviation

In practice, we rarely know the population standard deviation. In the past, when the sample size was large, this did not present a problem to statisticians. They used the sample standard deviation s as an estimate for σ and proceeded as before to calculate a confidence interval with close enough results. However, statisticians ran into problems when the sample size was small. A small sample size caused inaccuracies in the confidence interval.
William S. Gosset (1876–1937) of the...

What are Estimates?

What are Estimates?

It isn't easy to measure a parameter such as the mean height or the mean weight of a population. So, we draw samples from the population and calculate the mean height or mean weight of the individuals in the sample. This sample data acts as a representative measure of the population parameter. These sample statistics are known as estimates.
The estimate for the mean of a sample is denoted by ͞x, whereas the mean of the population is designated as μ. Further, parameters such...

Estimating Population Mean with Known Standard Deviation

Estimating Population Mean with Known Standard Deviation

To construct a confidence interval for a single unknown population mean μ, where the population standard deviation is known, we need sample mean as an estimate for μ and we need the margin of error. Here, the margin of error (EBM) is called the error bound for a population mean (abbreviated EBM). The sample mean is the point estimate of the unknown population mean μ.
The confidence interval estimate will have the form as follows:
(point estimate - error bound, point estimate +...

Statistical Significance

Statistical Significance

Once data is collected from both the experimental and the control groups, a statistical analysis is conducted to find out if there are meaningful differences between the two groups. A statistical analysis determines how likely any difference found is due to chance (and thus not meaningful). In psychology, group differences are considered meaningful, or significant, if the odds that these differences occurred by chance alone are 5 percent or less. Stated another way, if we repeated this...

Empirical Method to Interpret Standard Deviation

Empirical Method to Interpret Standard Deviation

The empirical rule, also known as the three-sigma rule, allows a statistician to interpret the standard deviation in a normally distributed dataset. The rule states that 68% of the data lies within one standard deviation from the mean, 95% lies within two standard deviations from the mean, and 99.7% lies within three standard deviations from the mean. Additionally, this rule is also called the 68-95-99.7 rule.
This rule is used widely in statistics to calculate the proportion of data values...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Optimization of Fe(III)-based negative electrodes for lithium-ion batteries: probing electrochemical performance and stability characteristics.

Dalton transactions (Cambridge, England : 2003)·2026

Same author

The IMPACT epilepsy Consortium: Exploring social drivers of health in epilepsy care to advance solution based initiatives.

Epilepsy & behavior : E&B·2026

Same author

Naturalistic Driving Outcomes and Sensorimotor Function in Cognitively Normal Older Adults.

Journal of the American Geriatrics Society·2026

Same author

Multivariate and Online Transfer Learning With Uncertainty Quantification.

Statistics in medicine·2026

Same author

Redox-Active Bis-Catecholaldimine Cu(II)-Salen Complex with Hydroxyl Functionality as Cathode Material in Li-Ion Battery.

ChemPlusChem·2026

Same author

A Minimalist Iron Porphyrin Which Can Catalyze Both Peroxidation and Oxygen Reduction Reaction.

JACS Au·2025

Same journal

Regression Trees and Ensemble for Multivariate Outcomes.

Sankhya. Series B. [Methodological.]·2025

Same journal

Cluster Based Association Measures with Applications.

Sankhya. Series B. [Methodological.]·2025

Same journal

Mediation Analysis using Semi-parametric Shape-Restricted Regression with Applications.

Sankhya. Series B. [Methodological.]·2024

Same journal

A Blockwise Consistency Method for Parameter Estimation of Complex Models.

Sankhya. Series B. [Methodological.]·2021

Same journal

Local linear estimation for spatial random processes with stochastic trend and stationary noise.

Sankhya. Series B. [Methodological.]·2019

Same journal

NONPARAMETRIC BENCHMARK ANALYSIS IN RISK ASSESSMENT: A COMPARATIVE STUDY BY SIMULATION AND DATA ANALYSIS.

Sankhya. Series B. [Methodological.]·2013

See all related articles

Search research articles

Related Experiment Video

Updated: Jan 8, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Word Embeddings as Statistical Estimators.

Neil Dey¹, Matthew Singer¹, Jonathan P Williams²

¹Department of Statistics, North Carolina State University.

Sankhya. Series B. [Methodological.]

|December 19, 2025

Summary

This summary is machine-generated.

This study introduces a statistical framework for word embeddings, interpreting Word2Vec through pointwise mutual information (PMI). A novel missing value estimator offers a statistically sound alternative with comparable performance to Word2Vec.

More Related Videos

Decoding Natural Behavior from Neuroethological Embedding

Decoding Natural Behavior from Neuroethological Embedding

Published on: October 3, 2025

Related Experiment Videos

Last Updated: Jan 8, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Decoding Natural Behavior from Neuroethological Embedding

Decoding Natural Behavior from Neuroethological Embedding

Published on: October 3, 2025

Area of Science:

Natural Language Processing
Statistical Theory
Machine Learning

Background:

Word embeddings are crucial in NLP but lack theoretical understanding.
Current evaluation relies on empirical performance, not rigorous properties.
Formal inference and uncertainty quantification require a theoretical basis.

Purpose of the Study:

To provide a statistical theoretical perspective on word embeddings.
To interpret classical methods like Word2Vec within a formal statistical model.
To develop a novel, statistically tractable alternative to existing word embedding techniques.

Main Methods:

Proposed a copula-based statistical model for text data.
Interpreted Word2Vec as an estimator for theoretical pointwise mutual information (PMI).
Developed a missing value-based estimator, building on prior work.

Main Results:

Demonstrated Word2Vec's connection to estimating theoretical PMI.
The proposed missing value estimator shows comparable estimation error to Word2Vec.
The new estimator outperforms truncation-based methods.
Achieved comparable performance to Word2Vec on an IMDb sentiment analysis task.

Conclusions:

The copula-based model offers a theoretical foundation for word embeddings.
The missing value estimator provides a statistically interpretable and effective alternative.
This work bridges the gap between empirical success and theoretical understanding in word embeddings.