Jove
Visualize
联系我们
JoVE
x logofacebook logolinkedin logoyoutube logo
关于 JoVE
概览领导团队博客JoVE 帮助中心
作者
出版流程编辑委员会范围与政策同行评审常见问题投稿
图书馆员
用户评价订阅访问资源图书馆顾问委员会常见问题
研究
JoVE JournalMethods CollectionsJoVE Encyclopedia of Experiments存档
教育
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab Manual教师资源中心教师网站
使用条款与条件
隐私政策
政策

相关概念视频

Multiple Regression01:25

Multiple Regression

3.0K
Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
3.0K
Regression Analysis01:11

Regression Analysis

5.7K
Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:
5.7K
Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

1.5K
Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...
1.5K
Factorial Design02:01

Factorial Design

13.0K
Factorial Analysis is an experimental design that applies Analysis of Variance (ANOVA) statistical procedures to examine a change in a dependent variable due to more than one independent variable, also known as factors. Changes in worker productivity can be reasoned, for example, to be influenced by salary and other conditions, such as skill level. One way to test this hypothesis is by categorizing salary into three levels (low, moderate, and high) and skills sets into two levels (entry level...
13.0K
Multiple Allele Traits01:49

Multiple Allele Traits

34.1K
The Concept of Multiple Allelism
34.1K
Collisions in Multiple Dimensions: Problem Solving01:06

Collisions in Multiple Dimensions: Problem Solving

3.7K
In multiple dimensions, the conservation of momentum applies in each direction independently. Hence, to solve collisions in multiple dimensions, we should write down the momentum conservation in each direction separately. To help understand collisions in multiple dimensions, consider an example.
A small car of mass 1,200 kg traveling east at 60 km/h collides at an intersection with a truck of mass 3,000 kg traveling due north at 40 km/h. The two vehicles are locked together. What is the...
3.7K

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序
Same author

Integrated analysis for electronic health records with structured and sporadic missingness.

Journal of biomedical informatics·2025
Same author

Phylogenetic association analysis with conditional rank correlation.

Biometrika·2024
Same author

Matrix Reordering for Noisy Disordered Matrices: Optimality and Computationally Efficient Algorithms.

IEEE transactions on information theory·2024
Same author

Estimation and Inference for High-Dimensional Generalized Linear Models with Knowledge Transfer.

Journal of the American Statistical Association·2024
Same author

Optimal Estimation of Genetic Relatedness in High-dimensional Linear Models.

Journal of the American Statistical Association·2024
Same author

Transfer Learning in Large-scale Gaussian Graphical Models with False Discovery Rate Control.

Journal of the American Statistical Association·2023
Same journal

Simplifying debiased inference via automatic differentiation and probabilistic programming.

Journal of the Royal Statistical Society. Series B, Statistical methodology·2026
Same journal

Principal stratification with U-statistics under principal ignorability.

Journal of the Royal Statistical Society. Series B, Statistical methodology·2026
Same journal

Causal K-Means Clustering.

Journal of the Royal Statistical Society. Series B, Statistical methodology·2026
Same journal

Inference of dependency knowledge graph for Electronic Health Records.

Journal of the Royal Statistical Society. Series B, Statistical methodology·2026
Same journal

Correction to: Inference of dependency knowledge graph for Electronic Health Records.

Journal of the Royal Statistical Society. Series B, Statistical methodology·2026
Same journal

Harmonized Estimation of Subgroup-Specific Treatment Effects in Randomized Trials: The Use of External Control Data.

Journal of the Royal Statistical Society. Series B, Statistical methodology·2026
查看所有相关文章

相关实验视频

Updated: Jun 13, 2025

Cross-Modal Multivariate Pattern Analysis
13:51

Cross-Modal Multivariate Pattern Analysis

Published on: November 9, 2011

19.9K

测试高维多项的应用到文本分析的应用.

T Tony Cai1, Zheng T Ke2, Paxton Turner2

  • 1Department of Statistics and Data Science, University of Pennsylvania, Philadelphia, PA, USA.

Journal of the Royal Statistical Society. Series B, Statistical methodology
|September 16, 2024
PubMed
概括
此摘要是机器生成的。

我们开发了一种新的统计测试,用于比较高维多项分布,这对于文本挖掘和离散分布推断至关重要. 这种测试是高效的,在各种应用中实现最佳检测边界.

关键词:
著作权归因 著作权归因接近性测试测试 接近性测试客户评论 客户评论 客户评论马丁盖尔的中央极限定理最低限度最佳度最小限度最佳度主题模型 主题模型 主题模型

更多相关视频

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.4K
Basics of Multivariate Analysis in Neuroimaging Data
06:35

Basics of Multivariate Analysis in Neuroimaging Data

Published on: July 24, 2010

16.8K

相关实验视频

Last Updated: Jun 13, 2025

Cross-Modal Multivariate Pattern Analysis
13:51

Cross-Modal Multivariate Pattern Analysis

Published on: November 9, 2011

19.9K
Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.4K
Basics of Multivariate Analysis in Neuroimaging Data
06:35

Basics of Multivariate Analysis in Neuroimaging Data

Published on: July 24, 2010

16.8K

科学领域:

  • 多变量统计的多变量统计.
  • 计算统计的计算统计.
  • 机器学习 机器学习

背景情况:

  • 对比离散概率分布对于文本挖掘,主题建模和作者归因至关重要.
  • 现有方法通常需要假设参数均性或相同样本大小,限制其适用性.
  • 高维的多项分布带来了独特的挑战,因为维度的诅咒.

研究的目的:

  • 开发一种新的统计测试,用于高维多项分布的K组中概率质量函数的等式.
  • 在零假设下确定拟议的测试统计的非对称属性.
  • 在现实场景中证明测试的最佳性和实际实用性.

主要方法:

  • 为比较多项概率质量函数提出了一个新的测试统计.
  • 测试统计数据的非对称零分布是作为标准正常推导的.
  • 测试实现最佳检测边界的能力在理论上已经确立.
  • 进行模拟研究和现实世界数据集分析.

主要成果:

  • 拟议的测试统计在零假设下具有非对称的标准正常分布.
  • 限制性零分布是无参数的,不需要相同的组大小或组内相同的参数.
  • 测试通过参数空间实现了最佳检测边界.
  • 模拟证实了测试的性能,应用程序显示其在分析客户评论和科学摘要中的实用性.

结论:

  • 本文介绍了一种强大而异常的最佳测试,用于比较高维的多项分布.
  • 该方法为文本挖掘,主题建模和离散分布分析中的应用提供了强大的工具.
  • 该测试的无参数零分布和最佳性使其在没有严格假设的情况下广泛适用.