Jove
Visualize
Contáctanos
JoVE
x logofacebook logolinkedin logoyoutube logo
ACERCA DE JoVE
Visión GeneralLiderazgoBlogCentro de Ayuda JoVE
AUTORES
Proceso de PublicaciónConsejo EditorialAlcance y PolíticasRevisión por ParesPreguntas FrecuentesEnviar
BIBLIOTECARIOS
TestimoniosSuscripcionesAccesoRecursosConsejo Asesor de BibliotecasPreguntas Frecuentes
INVESTIGACIÓN
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchivo
EDUCACIÓN
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualCentro de Recursos para ProfesoresSitio de Profesores
Términos y Condiciones de Uso
Política de Privacidad
Políticas

Videos de Conceptos Relacionados

Estimating Population Standard Deviation01:26

Estimating Population Standard Deviation

3.3K
When the population standard deviation is unknown and the sample size is large, the sample standard deviation s is commonly used as a point estimate of σ. However, it can sometimes under or overestimate the population standard deviation. To overcome this drawback, confidence intervals are determined to estimate population parameters and eliminate any calculation bias accurately. However, this only applies to random samples from normally distributed populations. Knowing the sample mean and...
3.3K
Estimating Population Mean with Unknown Standard Deviation01:22

Estimating Population Mean with Unknown Standard Deviation

8.7K
In practice, we rarely know the population standard deviation. In the past, when the sample size was large, this did not present a problem to statisticians. They used the sample standard deviation s as an estimate for σ and proceeded as before to calculate a confidence interval with close enough results. However, statisticians ran into problems when the sample size was small. A small sample size caused inaccuracies in the confidence interval.
William S. Gosset (1876–1937) of the...
8.7K
What are Estimates?01:06

What are Estimates?

8.0K
It isn't easy to measure a parameter such as the mean height or the mean weight of a population. So, we draw samples from the population and calculate the mean height or mean weight of the individuals in the sample. This sample data acts as a representative measure of the population parameter. These sample statistics are known as estimates. 
The estimate for the mean of a sample is denoted by ͞x, whereas the mean of the population is designated as μ. Further, parameters such...
8.0K
Estimating Population Mean with Known Standard Deviation01:16

Estimating Population Mean with Known Standard Deviation

9.6K
To construct a confidence interval for a single unknown population mean μ, where the population standard deviation is known, we need sample mean as an estimate for μ and we need the margin of error. Here, the margin of error (EBM) is called the error bound for a population mean (abbreviated EBM). The sample mean is the point estimate of the unknown population mean μ.
The confidence interval estimate will have the form as follows:
(point estimate - error bound, point estimate +...
9.6K
Statistical Significance01:50

Statistical Significance

21.0K
Once data is collected from both the experimental and the control groups, a statistical analysis is conducted to find out if there are meaningful differences between the two groups. A statistical analysis determines how likely any difference found is due to chance (and thus not meaningful). In psychology, group differences are considered meaningful, or significant, if the odds that these differences occurred by chance alone are 5 percent or less. Stated another way, if we repeated this...
21.0K
Empirical Method to Interpret Standard Deviation01:09

Empirical Method to Interpret Standard Deviation

9.3K
The empirical rule, also known as the three-sigma rule, allows a statistician to interpret the standard deviation in a normally distributed dataset. The rule states that 68% of the data lies within one standard deviation from the mean, 95% lies within two standard deviations from the mean, and 99.7% lies within three standard deviations from the mean. Additionally, this rule is also called the 68-95-99.7 rule.
This rule is used widely in statistics to calculate the proportion of data values...
9.3K

También podría leer

Artículos Relacionados

Artículos vinculados a este trabajo por autores compartidos, revista y gráfico de citas.

Ordenar por
Same author

Optimization of Fe(III)-based negative electrodes for lithium-ion batteries: probing electrochemical performance and stability characteristics.

Dalton transactions (Cambridge, England : 2003)·2026
Same author

The IMPACT epilepsy Consortium: Exploring social drivers of health in epilepsy care to advance solution based initiatives.

Epilepsy & behavior : E&B·2026
Same author

Naturalistic Driving Outcomes and Sensorimotor Function in Cognitively Normal Older Adults.

Journal of the American Geriatrics Society·2026
Same author

Multivariate and Online Transfer Learning With Uncertainty Quantification.

Statistics in medicine·2026
Same author

Redox-Active Bis-Catecholaldimine Cu(II)-Salen Complex with Hydroxyl Functionality as Cathode Material in Li-Ion Battery.

ChemPlusChem·2026
Same author

A Minimalist Iron Porphyrin Which Can Catalyze Both Peroxidation and Oxygen Reduction Reaction.

JACS Au·2025
Same journal

Regression Trees and Ensemble for Multivariate Outcomes.

Sankhya. Series B. [Methodological.]·2025
Same journal

Cluster Based Association Measures with Applications.

Sankhya. Series B. [Methodological.]·2025
Same journal

Mediation Analysis using Semi-parametric Shape-Restricted Regression with Applications.

Sankhya. Series B. [Methodological.]·2024
Same journal

A Blockwise Consistency Method for Parameter Estimation of Complex Models.

Sankhya. Series B. [Methodological.]·2021
Same journal

Local linear estimation for spatial random processes with stochastic trend and stationary noise.

Sankhya. Series B. [Methodological.]·2019
Same journal

NONPARAMETRIC BENCHMARK ANALYSIS IN RISK ASSESSMENT: A COMPARATIVE STUDY BY SIMULATION AND DATA ANALYSIS.

Sankhya. Series B. [Methodological.]·2013
Ver todos los artículos relacionados

Video Experimental Relacionado

Updated: Jan 8, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

986

Word Embeddings como Estimadores Estadísticos

Neil Dey1, Matthew Singer1, Jonathan P Williams2

  • 1Department of Statistics, North Carolina State University.

Sankhya. Series B. [Methodological.]
|December 19, 2025
PubMed
Resumen
Este resumen es generado por máquina.

Este estudio introduce un marco estadístico para incrustaciones de palabras, interpretando Word2Vec a través de la información mutua punto a punto (PMI). Un nuevo estimador de valores faltantes ofrece una alternativa estadísticamente sólida con un rendimiento comparable a Word2Vec.

Palabras clave:
incrustaciones de palabrasprocesamiento del lenguaje naturalteoría estadísticaaprendizaje automáticoinformación mutua punto a puntoWord2Vecestimador de valores faltantes

Más Videos Relacionados

Decoding Natural Behavior from Neuroethological Embedding
08:00

Decoding Natural Behavior from Neuroethological Embedding

Published on: October 3, 2025

556

Videos de Experimentos Relacionados

Last Updated: Jan 8, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

986
Decoding Natural Behavior from Neuroethological Embedding
08:00

Decoding Natural Behavior from Neuroethological Embedding

Published on: October 3, 2025

556

Área de la Ciencia:

  • Procesamiento del Lenguaje Natural
  • Teoría Estadística
  • Aprendizaje Automático

Sus antecedentes:

  • Las incrustaciones de palabras son cruciales en el PNL, pero carecen de comprensión teórica.
  • La evaluación actual se basa en el rendimiento empírico, no en propiedades rigurosas.
  • La inferencia formal y la cuantificación de la incertidumbre requieren una base teórica.

Objetivo del estudio:

  • Proporcionar una perspectiva teórica estadística sobre las incrustaciones de palabras.
  • Interpretar métodos clásicos como Word2Vec dentro de un modelo estadístico formal.
  • Desarrollar una alternativa novedosa y estadísticamente tratable a las técnicas existentes de incrustación de palabras.

Principales métodos:

  • Se propuso un modelo estadístico basado en cópulas para datos de texto.
  • Se interpretó Word2Vec como un estimador de la información mutua punto a punto (PMI) teórica.
  • Se desarrolló un estimador basado en valores faltantes, basándose en trabajos anteriores.

Principales resultados:

  • Demostró la conexión de Word2Vec con la estimación de la PMI teórica.
  • El estimador de valores faltantes propuesto muestra un error de estimación comparable al de Word2Vec.
  • El nuevo estimador supera a los métodos basados en la truncación.
  • Se logró un rendimiento comparable al de Word2Vec en una tarea de análisis de sentimientos de IMDb.

Conclusiones:

  • El modelo basado en cópulas ofrece una base teórica para las incrustaciones de palabras.
  • El estimador de valores faltantes proporciona una alternativa estadísticamente interpretable y eficaz.
  • Este trabajo cierra la brecha entre el éxito empírico y la comprensión teórica en las incrustaciones de palabras.