Jove
Visualize
联系我们
JoVE
x logofacebook logolinkedin logoyoutube logo
关于 JoVE
概览领导团队博客JoVE 帮助中心
作者
出版流程编辑委员会范围与政策同行评审常见问题投稿
图书馆员
用户评价订阅访问资源图书馆顾问委员会常见问题
研究
JoVE JournalMethods CollectionsJoVE Encyclopedia of Experiments存档
教育
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab Manual教师资源中心教师网站
使用条款与条件
隐私政策
政策

相关概念视频

Statistical Analysis: Overview01:11

Statistical Analysis: Overview

6.7K
When we take repeated measurements on the same or replicated samples, we will observe inconsistencies in the magnitude. These inconsistencies are called errors. To categorize and characterize these results and their errors, the researcher can use statistical analysis to determine the quality of the measurements and/or suitability of the methods.
One of the most commonly used statistical quantifiers is the mean, which is the ratio between the sum of the numerical values of all results and the...
6.7K
Two-Way ANOVA01:17

Two-Way ANOVA

2.7K
The two-way ANOVA is an extension of the one-way ANOVA. It is a statistical test performed on three or more samples categorized by two factors - a row factor and a column factor. Ronald Fischer mentioned it in 1925 in his book 'Statistical Methods for Researchers.'
The two-way ANOVA analysis initially begins by stating the null hypothesis that there is an interaction effect between the two factors of a dataset. This effect can be visualized using line segments formed by joining the...
2.7K
One-Way ANOVA01:18

One-Way ANOVA

8.0K
One-way ANOVA analyzes more than three samples categorized by one factor. For example, it can compare the average mileage of sports bikes. Here, the data is categorized by one factor - the company. However, one-way ANOVA cannot be used to simultaneously compare the sample mean of three or more samples categorized by two factors. An example of two factors would be sports bikes from different companies driven in different terrains, such as a desert or snowy landscape. Here, two-way ANOVA is used...
8.0K
One-Way ANOVA: Unequal Sample Sizes01:15

One-Way ANOVA: Unequal Sample Sizes

5.8K
One-way ANOVA can be performed on three or more samples of unequal sizes. However, calculations get complicated when sample sizes are not always the same. So, while performing ANOVA with unequal samples size, the following equation is used:
5.8K
Data Collection by Observations01:08

Data Collection by Observations

12.1K
Data collection refers to a systematic way of obtaining, observing, measuring, and analyzing accurate information. Observational studies are one of the most widely used methods of data collection. It involves collecting data by observing the behavior and physical characteristics of a sample without making any modifications to the sample.
An astronomer viewing the motion and brightness of stars in the sky and recording the data is an example of observational data collection. A botanist recording...
12.1K
Statistical Hypothesis Testing01:16

Statistical Hypothesis Testing

2.0K
Hypothesis testing is a critical statistical procedure facilitating informed, evidence-based decisions. It begins with a hypothesis, which is a tentative explanation, or a prediction about a population parameter. This hypothesis can be either a null hypothesis (H0), indicating no effect or difference, or an alternative hypothesis (Ha), suggesting an effect or difference.
Statistical significance measures the probability that an observed result occurred by chance. If this probability, known as...
2.0K

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序
Same author

Stage-stratified prognostic impact of comorbidities on breast cancer-specific survival: A population-based flexible parametric modelling study.

Cancer epidemiology·2026
Same author

Semiparametric accelerated failure time models with time-varying covariates under partly interval censoring.

BMC medical research methodology·2026
Same author

Physical drivers of transport, dispersion and trapping of microplastics in a macrotidal, hyper-turbid fluvial-estuarine system: A modelling approach.

Marine pollution bulletin·2026
Same author

A data-informed multidimensional composite score for stress assessment.

Acta psychologica·2026
Same author

Bayesian uncertainty quantification to identify population level vaccine hesitancy behaviours.

PloS one·2026
Same author

Hydrocortisone versus dexamethasone in cerebral salt-wasting after aneurysmal subarachnoid hemorrhage.

Brain & spine·2026
Same journal

Analysis of strength degradation of coal and rock masses and stability of mined areas under long term immersion environment.

PloS one·2026
Same journal

Biogenic Silver-Selenium nanocomposite with anticancer activity and potent efficacy against vancomycin-resistant Staphylococcus aureus.

PloS one·2026
Same journal

Preparation and physicochemical characterization of a biodegradable chitosan/carboxymethyl cellulose hydrogel synthesized in NaOH/urea medium.

PloS one·2026
Same journal

Action-guilt, survivor-guilt, and depression in combat-related PTSD.

PloS one·2026
Same journal

Explainable machine learning for predicting activities of daily living at discharge in stroke patients: A retrospective study using SHAP interpretability.

PloS one·2026
Same journal

Deep learning based two-way feature depiction model for brain tumor detection.

PloS one·2026
查看所有相关文章

相关实验视频

Updated: Jul 25, 2025

Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language
09:27

Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language

Published on: October 13, 2018

10.1K

SMOTE-CD:用于组成数据的SMOTE.

Teo Nguyen1,2, Kerrie Mengersen1,3, Damien Sous4,5

  • 1Laboratoire de Mathématiques et de leurs Applications, Université de Pau et des Pays de l'Adour, E2S UPPA, CNRS, Anglet, France.

PloS one
|June 29, 2023
PubMed
概括
此摘要是机器生成的。

本研究介绍了SMOTE for Compositional Data (SMOTE-CD),这是一种解决构成数据中的类不平衡的新方法. SMOTE-CD在各种指标上改善了模型性能,特别是在现实世界数据集上提高了F1分数.

更多相关视频

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index
06:55

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

14.5K
Flypub To Study Ethanol Induced Behavioral Disinhibition and Sensitization
08:13

Flypub To Study Ethanol Induced Behavioral Disinhibition and Sensitization

Published on: May 18, 2020

6.6K

相关实验视频

Last Updated: Jul 25, 2025

Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language
09:27

Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language

Published on: October 13, 2018

10.1K
Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index
06:55

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

14.5K
Flypub To Study Ethanol Induced Behavioral Disinhibition and Sensitization
08:13

Flypub To Study Ethanol Induced Behavioral Disinhibition and Sensitization

Published on: May 18, 2020

6.6K

科学领域:

  • 统计 统计 统计 统计
  • 机器学习 机器学习
  • 数据科学数据科学数据科学

背景情况:

  • 构成数据,表示相对比例,是普遍存在的,但缺乏解决不平衡阶级的解决方案.
  • 现有的方法不能充分处理不平衡的组成数据的独特特征.

研究的目的:

  • 为不平衡的组成数据提出SMOTE (合成少数人过量采样技术) 的调整.
  • 引入组合数据的SMOTE (SMOTE-CD) 并评估其有效性.

主要方法:

  • 通过使用组合数据运算来调整原来的SMOTE算法,开发了SMOTE-CD.
  • 通过对现有数据的线性组合生成合成数据点.
  • 在真实和合成数据集上测试了SMOTE-CD与梯度增强,神经网络和迪里克莱特回归器.

主要成果:

  • SMOTE-CD在准确性,交叉,F1得分,R2得分和RMSE方面显示出性能改善.
  • 过量采样持续增加了F1分数,特别是在真实数据集中.
  • 过量抽样的影响因模型和数据而异;它有时会降低多数类的表现,但在真实数据上产生最好的总体结果.

结论:

  • SMOTE-CD是处理不平衡组合数据的有效技术.
  • 该方法在具有偏斜组成数据的场景中有望改善机器学习模型性能.
  • 有一个Python包,smote-cd,可用于实现SMOTE-CD方法.