Search research articles

关于 JoVE

概览领导团队博客 JoVE 帮助中心

作者

出版流程编辑委员会范围与政策同行评审常见问题投稿

图书馆员

用户评价订阅访问资源图书馆顾问委员会常见问题

研究

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments 存档

教育

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual 教师资源中心教师网站

使用条款与条件

相关概念视频

Cluster Sampling Method

Cluster Sampling Method

Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...

One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for ka Estimation

One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for k_a Estimation

This lesson introduces two critical methods in pharmacokinetics, the Wagner-Nelson and Loo-Riegelman methods, used for estimating the absorption rate constant (ka) for drugs administered via non-intravenous routes. The Wagner-Nelson method relates ka to the plasma concentration derived from the slope of a semilog percent unabsorbed time plot. However, it is limited to drugs with one-compartment kinetics and can be impacted by factors like gastrointestinal motility or enzymatic degradation.
On...

Random Sampling Method

Random Sampling Method

Sampling is a technique to select a portion (or subset) of the larger population and study that portion (the sample) to gain information about the population. Data are the result of sampling from a population. The sampling method ensures that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest. Among the various sampling methods used by...

Maxwell-Boltzmann Distribution: Problem Solving

Maxwell-Boltzmann Distribution: Problem Solving

Individual molecules in a gas move in random directions, but a gas containing numerous molecules has a predictable distribution of molecular speeds, which is known as the Maxwell-Boltzmann distribution, f(v).
This distribution function f(v) is defined by saying that the expected number N (v1,v2) of particles with speeds between v1 and v2 is given by

Distributed Loads: Problem Solving

Distributed Loads: Problem Solving

Beams are structural elements commonly employed in engineering applications requiring different load-carrying capacities. The first step in analyzing a beam under a distributed load is to simplify the problem by dividing the load into smaller regions, which allows one to consider each region separately and calculate the magnitude of the equivalent resultant load acting on each portion of the beam. The magnitude of the equivalent resultant load for each region can be determined by calculating...

One-Way ANOVA: Equal Sample Sizes

One-Way ANOVA: Equal Sample Sizes

One-Way ANOVA can be performed on three or more samples with equal or unequal sample sizes. When one-way ANOVA is performed on two datasets with samples of equal sizes, it can be easily observed that the computed F statistic is highly sensitive to the sample mean.
Different sample means can result in different values for the variance estimate: variance between samples. This is because the variance between samples is calculated as the product of the sample size and the variance between the...

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序

Same author

Echocardiographic prediction of functional coronary stenosis: global longitudinal strain as a key determinant of quantitative flow ratio.

Internal and emergency medicine·2026

Same author

SLC25A21 promotes ferroptosis by inducing mitochondrial GPX4 deficiency in colorectal cancer.

Cellular and molecular life sciences : CMLS·2026

Same author

A rapidly personalized in-hospital bloodstream infection prediction model: a multicenter retrospective study.

BMC infectious diseases·2026

Same author

Microstructured electrode coupled with electrochemical deposition enrichment laser-induced breakdown spectroscopy for ppb-level sensitive detection of Pb<sup>2+</sup> and Cr<sup>3+</sup> in water.

Talanta·2026

Same author

A novel serum phosphorus to chloride and bicarbonate ratio predicts severe acute kidney injury in critically ill patients: a multicenter cohort study.

Respiratory medicine·2026

Same author

Epigenetic and O-glycosylation regulation of p66Shc mitigates mitochondrial oxidative stress in aortic dissection.

Theranostics·2026

Same journal

Analysis of strength degradation of coal and rock masses and stability of mined areas under long term immersion environment.

PloS one·2026

Same journal

Biogenic Silver-Selenium nanocomposite with anticancer activity and potent efficacy against vancomycin-resistant Staphylococcus aureus.

PloS one·2026

Same journal

Preparation and physicochemical characterization of a biodegradable chitosan/carboxymethyl cellulose hydrogel synthesized in NaOH/urea medium.

PloS one·2026

Same journal

Action-guilt, survivor-guilt, and depression in combat-related PTSD.

PloS one·2026

Same journal

Explainable machine learning for predicting activities of daily living at discharge in stroke patients: A retrospective study using SHAP interpretability.

PloS one·2026

Same journal

Deep learning based two-way feature depiction model for brain tumor detection.

PloS one·2026

查看所有相关文章

Search research articles

相关实验视频

Updated: Jun 4, 2025

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

分布式K-Means算法基于一个Spark优化样本.

Yongan Feng¹, Jiapeng Zou¹, Wanjun Liu¹

¹Liaoning Technical University, Huludao, China.

|December 23, 2024

概括

此摘要是机器生成的。

我们开发了SOSK-Means,这是一个针对大数据的优化K-Means算法. 它显著提高了大规模集群任务的计算速度和准确性.

更多相关视频

ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data

ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data

Published on: January 16, 2019

Determination of Aggregate Surface Morphology at the Interfacial Transition Zone ITZ

Determination of Aggregate Surface Morphology at the Interfacial Transition Zone ITZ

Published on: December 16, 2019

相关实验视频

Last Updated: Jun 4, 2025

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data

ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data

Published on: January 16, 2019

Determination of Aggregate Surface Morphology at the Interfacial Transition Zone ITZ

Determination of Aggregate Surface Morphology at the Interfacial Transition Zone ITZ

Published on: December 16, 2019

科学领域:

数据科学数据科学数据科学
机器学习机器学习
大数据分析大数据分析

背景情况:

经典的K-Means算法在大量数据集中存在不稳定性和性能问题.
对各种数据挖掘应用程序来说,对大规模数据的高效聚类至关重要.

研究的目的:

介绍SOSK-Means,一个针对Spark优化的增强K-Means算法,以解决大规模数据集上的经典K-Means的局限性.
为了提高K-Means集群大规模数据的计算速度和准确性.

主要方法:

实施了加权跳槽方法,以实现高效的随机抽样和预集群,改善初始中心选择.
使用加权的最大-最小距离与差异来进行增强的距离计算,考虑数据重量和差异.
采用了一种新的距离比较方法和定向环形图 (DAG) 来优化Spark上的计算和分布式处理.

主要成果:

与经典的K-Means相比,SOSK-Means在计算速度方面取得了显著的改进.
该算法保持了高的计算精度,有效地处理大量数据集.
改进的初始中心选择和距离计算有助于提高聚类性能.

结论:

通过Spark优化,SOSK-Means为大规模数据聚类提供了强大而高效的解决方案.
拟议的修改有效地解决了传统K-Means的不稳定性和性能瓶.
这种优化的算法非常适合大数据分析,需要快速准确的集群.