Jove
Visualize
联系我们
JoVE
x logofacebook logolinkedin logoyoutube logo
关于 JoVE
概览领导团队博客JoVE 帮助中心
作者
出版流程编辑委员会范围与政策同行评审常见问题投稿
图书馆员
用户评价订阅访问资源图书馆顾问委员会常见问题
研究
JoVE JournalMethods CollectionsJoVE Encyclopedia of Experiments存档
教育
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab Manual教师资源中心教师网站
使用条款与条件
隐私政策
政策

相关概念视频

Stratified Sampling Method01:16

Stratified Sampling Method

13.0K
Sampling is a technique to select a portion (or subset) of the larger population and study that portion (the sample) to gain information about the population. The sampling method ensures that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a stratified sample, divide the population into groups called strata and then take a...
13.0K
Bootstrapping01:24

Bootstrapping

673
The term "bootstrap" originated in the 19th century as a metaphor for self-improvement or achieving something independently, without external assistance. This concept extends to statistical bootstrapping, a self-contained method for estimating population parameters through resampling, even though it can be computationally intensive. Developed by the American statistician Dr. Bradley Efron in 1979, bootstrapping provides a robust way to perform inference when the original sample size is...
673
Cluster Sampling Method01:20

Cluster Sampling Method

12.8K
Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...
12.8K
Random Sampling Method01:09

Random Sampling Method

12.5K
Sampling is a technique to select a portion (or subset) of the larger population and study that portion (the sample) to gain information about the population. Data are the result of sampling from a population. The sampling method ensures that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest. Among the various sampling methods used by...
12.5K
Systematic Sampling Method01:17

Systematic Sampling Method

11.2K
Sampling is a technique to select a portion (or subset) of the larger population and study that portion (the sample) to gain information about the population. Data are the result of sampling from a population. The sampling method ensures that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
Systematic sampling is one of the simplest methods...
11.2K
Sampling Distribution01:12

Sampling Distribution

13.6K
Given simple random samples of size n from a given population with a measured characteristic such as mean, proportion, or standard deviation for each sample, the probability distribution of all the measured characteristics is called a sampling distribution. How much the statistic varies from one sample to another is known as the sampling variability of a statistic. You typically measure the sampling variability of a statistic by its standard error. The standard error of the mean is an example...
13.6K

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序
Same author

Del Nido versus HTK cardioplegia for myocardial protection during adult complex valve surgery: a retrospective study.

BMC cardiovascular disorders·2021
Same author

miR-141-3p regulates saturated fatty acid-induced cardiomyocyte apoptosis through Notch1/PTEN/AKT pathway via targeting PSEN1.

Environmental toxicology·2021
Same author

Fe<sub>3</sub>O<sub>4</sub>@polydopamine nanoparticle-loaded human umbilical cord mesenchymal stem cells improve the cognitive function in Alzheimer's disease mice by promoting hippocampal neurogenesis.

Nanomedicine : nanotechnology, biology, and medicine·2021
Same author

Preparation of Pseudo-typed H5 Avian Influenza Viruses with Calcium Phosphate Transfection Method and Measurement of Antibody Neutralizing Activity.

Journal of visualized experiments : JoVE·2021
Same author

Corrigendum: Xanthomatous Hypophysitis: A Case Report and Comprehensive Literature Review.

Frontiers in endocrinology·2021
Same author

rSjP40 Inhibited the Activity of Collagen Type I Promoter <i>via</i> Ets-1 in HSCs.

Frontiers in cell and developmental biology·2021
Same journal

Poly(bromophenol blue)/CoSn(OH)<sub>6</sub> cubic particles modified pencil graphite electrode for electrochemical determination of diphenhydramine.

Scientific reports·2026
Same journal

Dietary Chlorella, Spirulina, and acidifier modulate jejunal cytokine-related gene expression in broiler chickens.

Scientific reports·2026
Same journal

Perceived physical activity barriers in university students: associations with fatigue and eating behaviours.

Scientific reports·2026
Same journal

Refuge limitation structures habitat use in agricultural landscapes: evidence from Sunda pangolins.

Scientific reports·2026
Same journal

Lightweight stateless transaction verification with outsourced witness updates for UTXO blockchains.

Scientific reports·2026
Same journal

Efficacy of historical context and exogenous features on deep learning for cooling load forecasting in chilled water plants.

Scientific reports·2026
查看所有相关文章

相关实验视频

Updated: Sep 17, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.6K

一个改进的SMOTE算法通过扩大样本生成空间来增强不平衡数据分类.

Ying Li1,2, Yali Yang1, Peihua Song3,4

  • 1School of Logistics Management and Engineering, Nanning Normal University, Nanning, 530001, Guangxi, China.

Scientific reports
|July 2, 2025
PubMed
概括
此摘要是机器生成的。

数据集中的类不平衡阻碍了模型性能. 拟议的增强合成少数群体过量采样技术 (SMOTE) 产生了更现实的合成样本,提高了对不平衡数据的分类器准确性和稳定性.

关键词:
分类 分类 分类 分类.没有平衡的数据集.过量采样过度采样在SMOTE中使用.

更多相关视频

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
12:18

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

7.6K
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

915

相关实验视频

Last Updated: Sep 17, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.6K
A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
12:18

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

7.6K
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

915

科学领域:

  • 机器学习 机器学习
  • 数据科学数据科学数据科学
  • 人工智能的人工智能

背景情况:

  • 数据集中的类不平衡显著降低了分类模型的性能.
  • 现有的过量采样技术,如SMOTE,往往无法保持本地数据密度和分布.
  • 需要改进的方法来合成更好地反映原始数据特征的样本,以便进行可靠的分类.

研究的目的:

  • 引入一个增强的SMOTE算法 (ISMOTE),该算法包含用于合成样本生成的本地空间信息.
  • 解决传统SMOTE在处理本地数据分布和密度扭曲方面的局限性.
  • 提高对不平衡数据集的分类模型的稳定性和性能.

主要方法:

  • 提出ISMOTE,它修改了生成合成样品的空间限制.
  • 在两个原始样本之间生成一个基样,并使用欧几里德距离来创建新的样本.
  • 适应性地扩大合成样本生成空间,以更好地保护本地数据分布.

主要成果:

  • 在13个公共数据集上使用7个过量采样算法进行比较分析.
  • 伊斯莫特在二维和三维散射图中展示了更现实的数据分布.
  • 分类器性能显著改善:F1评分 (+13.07%),G平均值 (+16.55%),和AUC (+7.94%).

结论:

  • 伊斯莫特有效地减轻了局部数据分布和密度的扭曲.
  • 该算法显示了对多类不平衡数据集的参数适应性.
  • 伊斯莫特为处理类不平衡提供了一种优越的方法,以提高机器学习模型的性能.