Jove
Visualize
联系我们
JoVE
x logofacebook logolinkedin logoyoutube logo
关于 JoVE
概览领导团队博客JoVE 帮助中心
作者
出版流程编辑委员会范围与政策同行评审常见问题投稿
图书馆员
用户评价订阅访问资源图书馆顾问委员会常见问题
研究
JoVE JournalMethods CollectionsJoVE Encyclopedia of Experiments存档
教育
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab Manual教师资源中心教师网站
使用条款与条件
隐私政策
政策

相关概念视频

Cluster Sampling Method01:20

Cluster Sampling Method

11.6K
Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...
11.6K
One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for ka Estimation01:24

One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for ka Estimation

385
This lesson introduces two critical methods in pharmacokinetics, the Wagner-Nelson and Loo-Riegelman methods, used for estimating the absorption rate constant (ka) for drugs administered via non-intravenous routes. The Wagner-Nelson method relates ka to the plasma concentration derived from the slope of a semilog percent unabsorbed time plot. However, it is limited to drugs with one-compartment kinetics and can be impacted by factors like gastrointestinal motility or enzymatic degradation.
On...
385
Random Sampling Method01:09

Random Sampling Method

11.0K
Sampling is a technique to select a portion (or subset) of the larger population and study that portion (the sample) to gain information about the population. Data are the result of sampling from a population. The sampling method ensures that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest. Among the various sampling methods used by...
11.0K
Maxwell-Boltzmann Distribution: Problem Solving01:20

Maxwell-Boltzmann Distribution: Problem Solving

1.4K
Individual molecules in a gas move in random directions, but a gas containing numerous molecules has a predictable distribution of molecular speeds, which is known as the Maxwell-Boltzmann distribution, f(v).
This distribution function f(v) is defined by saying that the expected number N (v1,v2) of particles with speeds between v1 and v2 is given by
1.4K
Distributed Loads: Problem Solving01:21

Distributed Loads: Problem Solving

623
Beams are structural elements commonly employed in engineering applications requiring different load-carrying capacities. The first step in analyzing a beam under a distributed load is to simplify the problem by dividing the load into smaller regions, which allows one to consider each region separately and calculate the magnitude of the equivalent resultant load acting on each portion of the beam. The magnitude of the equivalent resultant load for each region can be determined by calculating...
623
One-Way ANOVA: Equal Sample Sizes01:15

One-Way ANOVA: Equal Sample Sizes

3.2K
One-Way ANOVA can be performed on three or more samples with equal or unequal sample sizes. When one-way ANOVA is performed on two datasets with samples of equal sizes, it can be easily observed that the computed F statistic is highly sensitive to the sample mean.
Different sample means can result in different values for the variance estimate: variance between samples. This is because the variance between samples is calculated as the product of the sample size and the variance between the...
3.2K

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序
Same author

Echocardiographic prediction of functional coronary stenosis: global longitudinal strain as a key determinant of quantitative flow ratio.

Internal and emergency medicine·2026
Same author

SLC25A21 promotes ferroptosis by inducing mitochondrial GPX4 deficiency in colorectal cancer.

Cellular and molecular life sciences : CMLS·2026
Same author

A rapidly personalized in-hospital bloodstream infection prediction model: a multicenter retrospective study.

BMC infectious diseases·2026
Same author

Microstructured electrode coupled with electrochemical deposition enrichment laser-induced breakdown spectroscopy for ppb-level sensitive detection of Pb<sup>2+</sup> and Cr<sup>3+</sup> in water.

Talanta·2026
Same author

A novel serum phosphorus to chloride and bicarbonate ratio predicts severe acute kidney injury in critically ill patients: a multicenter cohort study.

Respiratory medicine·2026
Same author

Epigenetic and O-glycosylation regulation of p66Shc mitigates mitochondrial oxidative stress in aortic dissection.

Theranostics·2026
Same journal

Analysis of strength degradation of coal and rock masses and stability of mined areas under long term immersion environment.

PloS one·2026
Same journal

Biogenic Silver-Selenium nanocomposite with anticancer activity and potent efficacy against vancomycin-resistant Staphylococcus aureus.

PloS one·2026
Same journal

Preparation and physicochemical characterization of a biodegradable chitosan/carboxymethyl cellulose hydrogel synthesized in NaOH/urea medium.

PloS one·2026
Same journal

Action-guilt, survivor-guilt, and depression in combat-related PTSD.

PloS one·2026
Same journal

Explainable machine learning for predicting activities of daily living at discharge in stroke patients: A retrospective study using SHAP interpretability.

PloS one·2026
Same journal

Deep learning based two-way feature depiction model for brain tumor detection.

PloS one·2026
查看所有相关文章

相关实验视频

Updated: Jun 4, 2025

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations
12:27

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

6.9K

分布式K-Means算法基于一个Spark优化样本.

Yongan Feng1, Jiapeng Zou1, Wanjun Liu1

  • 1Liaoning Technical University, Huludao, China.

PloS one
|December 23, 2024
PubMed
概括
此摘要是机器生成的。

我们开发了SOSK-Means,这是一个针对大数据的优化K-Means算法. 它显著提高了大规模集群任务的计算速度和准确性.

更多相关视频

ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data
05:12

ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data

Published on: January 16, 2019

11.4K
Determination of Aggregate Surface Morphology at the Interfacial Transition Zone ITZ
08:59

Determination of Aggregate Surface Morphology at the Interfacial Transition Zone ITZ

Published on: December 16, 2019

8.1K

相关实验视频

Last Updated: Jun 4, 2025

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations
12:27

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

6.9K
ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data
05:12

ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data

Published on: January 16, 2019

11.4K
Determination of Aggregate Surface Morphology at the Interfacial Transition Zone ITZ
08:59

Determination of Aggregate Surface Morphology at the Interfacial Transition Zone ITZ

Published on: December 16, 2019

8.1K

科学领域:

  • 数据科学数据科学数据科学
  • 机器学习 机器学习
  • 大数据分析大数据分析

背景情况:

  • 经典的K-Means算法在大量数据集中存在不稳定性和性能问题.
  • 对各种数据挖掘应用程序来说,对大规模数据的高效聚类至关重要.

研究的目的:

  • 介绍SOSK-Means,一个针对Spark优化的增强K-Means算法,以解决大规模数据集上的经典K-Means的局限性.
  • 为了提高K-Means集群大规模数据的计算速度和准确性.

主要方法:

  • 实施了加权跳槽方法,以实现高效的随机抽样和预集群,改善初始中心选择.
  • 使用加权的最大-最小距离与差异来进行增强的距离计算,考虑数据重量和差异.
  • 采用了一种新的距离比较方法和定向环形图 (DAG) 来优化Spark上的计算和分布式处理.

主要成果:

  • 与经典的K-Means相比,SOSK-Means在计算速度方面取得了显著的改进.
  • 该算法保持了高的计算精度,有效地处理大量数据集.
  • 改进的初始中心选择和距离计算有助于提高聚类性能.

结论:

  • 通过Spark优化,SOSK-Means为大规模数据聚类提供了强大而高效的解决方案.
  • 拟议的修改有效地解决了传统K-Means的不稳定性和性能瓶.
  • 这种优化的算法非常适合大数据分析,需要快速准确的集群.