Search research articles

ACERCA DE JoVE

Visión General Liderazgo Blog Centro de Ayuda JoVE

AUTORES

Proceso de Publicación Consejo Editorial Alcance y Políticas Revisión por Pares Preguntas Frecuentes Enviar

BIBLIOTECARIOS

Testimonios Suscripciones Acceso Recursos Consejo Asesor de Bibliotecas Preguntas Frecuentes

INVESTIGACIÓN

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archivo

EDUCACIÓN

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Centro de Recursos para Profesores Sitio de Profesores

Términos y Condiciones de Uso

Política de Privacidad

Videos de Conceptos Relacionados

Estimating Population Standard Deviation

Estimating Population Standard Deviation

When the population standard deviation is unknown and the sample size is large, the sample standard deviation s is commonly used as a point estimate of σ. However, it can sometimes under or overestimate the population standard deviation. To overcome this drawback, confidence intervals are determined to estimate population parameters and eliminate any calculation bias accurately. However, this only applies to random samples from normally distributed populations. Knowing the sample mean and...

Estimating Population Mean with Unknown Standard Deviation

Estimating Population Mean with Unknown Standard Deviation

In practice, we rarely know the population standard deviation. In the past, when the sample size was large, this did not present a problem to statisticians. They used the sample standard deviation s as an estimate for σ and proceeded as before to calculate a confidence interval with close enough results. However, statisticians ran into problems when the sample size was small. A small sample size caused inaccuracies in the confidence interval.
William S. Gosset (1876–1937) of the...

What are Estimates?

What are Estimates?

It isn't easy to measure a parameter such as the mean height or the mean weight of a population. So, we draw samples from the population and calculate the mean height or mean weight of the individuals in the sample. This sample data acts as a representative measure of the population parameter. These sample statistics are known as estimates.
The estimate for the mean of a sample is denoted by ͞x, whereas the mean of the population is designated as μ. Further, parameters such...

Estimating Population Mean with Known Standard Deviation

Estimating Population Mean with Known Standard Deviation

To construct a confidence interval for a single unknown population mean μ, where the population standard deviation is known, we need sample mean as an estimate for μ and we need the margin of error. Here, the margin of error (EBM) is called the error bound for a population mean (abbreviated EBM). The sample mean is the point estimate of the unknown population mean μ.
The confidence interval estimate will have the form as follows:
(point estimate - error bound, point estimate +...

Statistical Significance

Statistical Significance

Once data is collected from both the experimental and the control groups, a statistical analysis is conducted to find out if there are meaningful differences between the two groups. A statistical analysis determines how likely any difference found is due to chance (and thus not meaningful). In psychology, group differences are considered meaningful, or significant, if the odds that these differences occurred by chance alone are 5 percent or less. Stated another way, if we repeated this...

Empirical Method to Interpret Standard Deviation

Empirical Method to Interpret Standard Deviation

The empirical rule, also known as the three-sigma rule, allows a statistician to interpret the standard deviation in a normally distributed dataset. The rule states that 68% of the data lies within one standard deviation from the mean, 95% lies within two standard deviations from the mean, and 99.7% lies within three standard deviations from the mean. Additionally, this rule is also called the 68-95-99.7 rule.
This rule is used widely in statistics to calculate the proportion of data values...

También podría leer

Artículos Relacionados

Artículos vinculados a este trabajo por autores compartidos, revista y gráfico de citas.

Ordenar por

Same author

Optimization of Fe(III)-based negative electrodes for lithium-ion batteries: probing electrochemical performance and stability characteristics.

Dalton transactions (Cambridge, England : 2003)·2026

Same author

The IMPACT epilepsy Consortium: Exploring social drivers of health in epilepsy care to advance solution based initiatives.

Epilepsy & behavior : E&B·2026

Same author

Naturalistic Driving Outcomes and Sensorimotor Function in Cognitively Normal Older Adults.

Journal of the American Geriatrics Society·2026

Same author

Multivariate and Online Transfer Learning With Uncertainty Quantification.

Statistics in medicine·2026

Same author

Redox-Active Bis-Catecholaldimine Cu(II)-Salen Complex with Hydroxyl Functionality as Cathode Material in Li-Ion Battery.

ChemPlusChem·2026

Same author

A Minimalist Iron Porphyrin Which Can Catalyze Both Peroxidation and Oxygen Reduction Reaction.

JACS Au·2025

Same journal

Regression Trees and Ensemble for Multivariate Outcomes.

Sankhya. Series B. [Methodological.]·2025

Same journal

Cluster Based Association Measures with Applications.

Sankhya. Series B. [Methodological.]·2025

Same journal

Mediation Analysis using Semi-parametric Shape-Restricted Regression with Applications.

Sankhya. Series B. [Methodological.]·2024

Same journal

A Blockwise Consistency Method for Parameter Estimation of Complex Models.

Sankhya. Series B. [Methodological.]·2021

Same journal

Local linear estimation for spatial random processes with stochastic trend and stationary noise.

Sankhya. Series B. [Methodological.]·2019

Same journal

NONPARAMETRIC BENCHMARK ANALYSIS IN RISK ASSESSMENT: A COMPARATIVE STUDY BY SIMULATION AND DATA ANALYSIS.

Sankhya. Series B. [Methodological.]·2013

Ver todos los artículos relacionados

Search research articles

Video Experimental Relacionado

Updated: Jan 8, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Word Embeddings como Estimadores Estadísticos

Neil Dey¹, Matthew Singer¹, Jonathan P Williams²

¹Department of Statistics, North Carolina State University.

Sankhya. Series B. [Methodological.]

|December 19, 2025

Resumen

Este resumen es generado por máquina.

Este estudio introduce un marco estadístico para incrustaciones de palabras, interpretando Word2Vec a través de la información mutua punto a punto (PMI). Un nuevo estimador de valores faltantes ofrece una alternativa estadísticamente sólida con un rendimiento comparable a Word2Vec.

Palabras clave:

incrustaciones de palabras procesamiento del lenguaje natural teoría estadística aprendizaje automático información mutua punto a punto Word2Vec estimador de valores faltantes

Más Videos Relacionados

Decoding Natural Behavior from Neuroethological Embedding

Decoding Natural Behavior from Neuroethological Embedding

Published on: October 3, 2025

Videos de Experimentos Relacionados

Last Updated: Jan 8, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Decoding Natural Behavior from Neuroethological Embedding

Decoding Natural Behavior from Neuroethological Embedding

Published on: October 3, 2025

Área de la Ciencia:

Procesamiento del Lenguaje Natural
Teoría Estadística
Aprendizaje Automático

Sus antecedentes:

Las incrustaciones de palabras son cruciales en el PNL, pero carecen de comprensión teórica.
La evaluación actual se basa en el rendimiento empírico, no en propiedades rigurosas.
La inferencia formal y la cuantificación de la incertidumbre requieren una base teórica.

Objetivo del estudio:

Proporcionar una perspectiva teórica estadística sobre las incrustaciones de palabras.
Interpretar métodos clásicos como Word2Vec dentro de un modelo estadístico formal.
Desarrollar una alternativa novedosa y estadísticamente tratable a las técnicas existentes de incrustación de palabras.

Principales métodos:

Se propuso un modelo estadístico basado en cópulas para datos de texto.
Se interpretó Word2Vec como un estimador de la información mutua punto a punto (PMI) teórica.
Se desarrolló un estimador basado en valores faltantes, basándose en trabajos anteriores.

Principales resultados:

Demostró la conexión de Word2Vec con la estimación de la PMI teórica.
El estimador de valores faltantes propuesto muestra un error de estimación comparable al de Word2Vec.
El nuevo estimador supera a los métodos basados en la truncación.
Se logró un rendimiento comparable al de Word2Vec en una tarea de análisis de sentimientos de IMDb.

Conclusiones:

El modelo basado en cópulas ofrece una base teórica para las incrustaciones de palabras.
El estimador de valores faltantes proporciona una alternativa estadísticamente interpretable y eficaz.
Este trabajo cierra la brecha entre el éxito empírico y la comprensión teórica en las incrustaciones de palabras.