Search research articles

关于 JoVE

概览领导团队博客 JoVE 帮助中心

作者

出版流程编辑委员会范围与政策同行评审常见问题投稿

图书馆员

用户评价订阅访问资源图书馆顾问委员会常见问题

研究

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments 存档

教育

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual 教师资源中心教师网站

使用条款与条件

相关概念视频

Quantifying and Rejecting Outliers: The Grubbs Test

Quantifying and Rejecting Outliers: The Grubbs Test

Sometimes, a data set can have a recorded numerical observation that greatly deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier. To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...

Types of Errors: Detection and Minimization

Types of Errors: Detection and Minimization

Error is the deviation of the obtained result from the true, expected value or the estimated central value. Errors are expressed in absolute or relative terms.
Absolute error in a measurement is the numerical difference from the true or central value. Relative error is the ratio between absolute error and the true or central value, expressed as a percentage.
Errors can be classified by source, magnitude, and sign. There are three types of errors: systematic, random, and gross.
Systematic or...

Detection of Gross Error: The Q Test

Detection of Gross Error: The Q Test

When one or more data points appear far from the rest of the data, there is a need to determine whether they are outliers and whether they should be eliminated from the data set to ensure an accurate representation of the measured value. In many cases, outliers arise from gross errors (or human errors) and do not accurately reflect the underlying phenomenon. In some cases, however, these apparent outliers reflect true phenomenological differences. In these cases, we can use statistical methods...

Classification of Signals

Classification of Signals

In signal processing, signals are classified based on various characteristics: continuous-time versus discrete-time, periodic versus aperiodic, analog versus digital, and causal versus noncausal. Each category highlights distinct properties crucial for understanding and manipulating signals.
A continuous-time signal holds a value at every instant in time, representing information seamlessly. In contrast, a discrete-time signal holds values only at specific moments, often denoted as x(n), where...

Goodness-of-Fit Test

Goodness-of-Fit Test

The goodness-of-fit test is a type of hypothesis test which determines whether the data "fits" a particular distribution. For example, one may suspect that some anonymous data may fit a binomial distribution. A chi-square test (meaning the distribution for the hypothesis test is chi-square) can be used to determine if there is a fit. The null and alternative hypotheses may be written in sentences or stated as equations or inequalities. The test statistic for a goodness-of-fit test is given as...

Survival Tree

Survival Tree

Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
Building a Survival Tree
Constructing a...

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序

Same author

DIVE: A Multi-Label Smart Contract Vulnerability Dataset.

Scientific data·2026

Same author

SmellyCode++: Multi-Label Dataset for Code Smell Detection.

Scientific data·2025

Same author

Dynamic stacking ensemble for cross-language code smell detection.

PeerJ. Computer science·2024

Same journal

DARUMA: a gateway to fast and easy prediction of intrinsically disordered regions.

PeerJ. Computer science·2026

Same journal

Alzheimer's disease detection using a quantum deep neural network with Haralick feature extraction and simulated annealing optimization.

PeerJ. Computer science·2026

Same journal

Network anomaly detection using Deep Autoencoder and parallel Artificial Bee Colony algorithm-trained neural network.

PeerJ. Computer science·2026

Same journal

An anomaly detection model for multivariate time series with anomaly perception.

PeerJ. Computer science·2026

Same journal

Retraction: A wormhole attack detection method for tactical wireless sensor networks.

PeerJ. Computer science·2026

Same journal

Evaluation of mental disorder with prioritization of its type by utilizing the bipolar complex fuzzy decision-making approach based on Schweizer-Sklar prioritized aggregation operators.

PeerJ. Computer science·2026

查看所有相关文章

Search research articles

相关实验视频

Updated: Jul 26, 2025

Design and Analysis for Fall Detection System Simplification

Design and Analysis for Fall Detection System Simplification

Published on: April 6, 2020

使用常规机器学习模型的Python代码嗅探检测.

Rana Sandouka¹, Hamoud Aljamaan¹

¹Information and Computer Science Department, King Fahd University of Petroleum and Minerals, Dhahran, Saudi Arabia.

PeerJ. Computer science

|June 22, 2023

概括

此摘要是机器生成的。

这项研究引入了一个新的Python数据集,用于检测大类和长方法代码的气味. 机器学习模型表现出不同的性能,随机森林在大类检测和决策树在长方法检测方面表现出色.

关键词:

密码的味道是密码的味道检测检测检测检测检测大型班级的大型班级长时间方法长时间方法机器学习机器学习在这里,Python是Python.

更多相关视频

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model

Published on: August 16, 2020

Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images

Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images

Published on: October 27, 2023

相关实验视频

Last Updated: Jul 26, 2025

Design and Analysis for Fall Detection System Simplification

Design and Analysis for Fall Detection System Simplification

Published on: April 6, 2020

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model

Published on: August 16, 2020

Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images

Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images

Published on: October 27, 2023

科学领域:

软件工程软件工程软件工程
机器学习机器学习
数据科学数据科学数据科学

背景情况:

代码臭味会降低软件质量,并使维护复杂化.
现有关于代码嗅觉检测的研究主要使用Java数据集.
在专门的Python代码嗅觉数据集中存在一个空白,用于机器学习.

研究的目的:

提出并引入一个新的Python代码气味数据集.
为了评估基线机器学习模型的性能,用于检测Python中大类和长方法代码的臭味.
为未来的Python代码嗅觉检测研究建立基准.

主要方法:

开发一个Python代码气味数据集,每个数据集包含大类和长方法气味的1000个样本,包含18个提取的源代码特征.
调查六个机器学习模型作为代码嗅觉检测的基线.
使用准确度和马修斯相关系数 (MCC) 评估模型性能.

主要成果:

随机森林模型在大型类代码气味检测方面实现了0.77的最高MCC.
决策树模型证明了长方法代码气味检测的最佳性能,MCC为0.89.
性能在不同型号和代码气味类型中各不相同,突出显示了对量身定制方法的需求.

结论:

开发的Python数据集为这种广泛使用的语言促进了对代码嗅觉检测的研究.
特定的机器学习模型显示了检测不同类型的Python代码气味的前景.
进一步的研究可以基于这些发现来改进Python项目中的自动化代码质量评估.