Search research articles

关于 JoVE

概览领导团队博客 JoVE 帮助中心

作者

出版流程编辑委员会范围与政策同行评审常见问题投稿

图书馆员

用户评价订阅访问资源图书馆顾问委员会常见问题

研究

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments 存档

教育

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual 教师资源中心教师网站

使用条款与条件

相关概念视频

Statistical Software for Data Analysis and Clinical Trials

Statistical Software for Data Analysis and Clinical Trials

Statistical software is pivotal in data analysis and clinical trials by providing tools to analyze data, draw conclusions, and make predictions. These software packages range from simple data management applications to complex analytical platforms, supporting various statistical tests, models, and simulation techniques. Their significance lies in their ability to handle vast amounts of data with precision and efficiency, enabling researchers to validate hypotheses, identify trends, and make...

Statistical Methods to Analyze Parametric Data: Student t-Test and Goodness-of-Fit Test

Statistical Methods to Analyze Parametric Data: Student t-Test and Goodness-of-Fit Test

In parametric statistics, two fundamental tests stand out for their utility and wide application: the Student's t-test and goodness-of-fit tests. These tests provide researchers with a robust method for drawing insights from data, testing hypotheses, and making informed decisions based on their findings.
The Student's t-test is a statistical test that examines if there is a statistically significant difference between the means of two groups. This test is instrumental when dealing with...

Statistical Analysis: Overview

Statistical Analysis: Overview

When we take repeated measurements on the same or replicated samples, we will observe inconsistencies in the magnitude. These inconsistencies are called errors. To categorize and characterize these results and their errors, the researcher can use statistical analysis to determine the quality of the measurements and/or suitability of the methods.
One of the most commonly used statistical quantifiers is the mean, which is the ratio between the sum of the numerical values of all results and the...

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

Statistical inference techniques, paramount in hypothesis testing, differentiate into two broad categories: parametric and nonparametric statistics.
Parametric statistics, as the name suggests, assumes that data follow a specific distribution, often a normal distribution. This assumption enables robust hypothesis testing and estimation. Parametric methods, like the Student's t-test or Goodness-of-fit test, are frequently employed in biostatistics due to their robustness. For instance,...

Estimating Population Mean with Unknown Standard Deviation

Estimating Population Mean with Unknown Standard Deviation

In practice, we rarely know the population standard deviation. In the past, when the sample size was large, this did not present a problem to statisticians. They used the sample standard deviation s as an estimate for σ and proceeded as before to calculate a confidence interval with close enough results. However, statisticians ran into problems when the sample size was small. A small sample size caused inaccuracies in the confidence interval.
William S. Gosset (1876–1937) of the...

Estimating Population Standard Deviation

Estimating Population Standard Deviation

When the population standard deviation is unknown and the sample size is large, the sample standard deviation s is commonly used as a point estimate of σ. However, it can sometimes under or overestimate the population standard deviation. To overcome this drawback, confidence intervals are determined to estimate population parameters and eliminate any calculation bias accurately. However, this only applies to random samples from normally distributed populations. Knowing the sample mean and...

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序

Same author

SATB1 is a targetable modulator of JAK-STAT signaling and cytokines in human Treg and Tconv cells.

EMBO reports·2026

Same author

Critical evaluation of drug response prediction models with DrEval.

Nature communications·2026

Same author

Drugst.One DREAM-Drug repurposing through expert annotation and modification.

British journal of pharmacology·2026

Same author

SATB1 is a targetable modulator of JAK-STAT signaling and cytokines in human Treg and Tconv cells.

bioRxiv : the preprint server for biology·2026

Same author

Correction: Maurer et al. Gut Microbial Disruption in Critically Ill Patients with COVID-19-Associated Pulmonary Aspergillosis. <i>J. Fungi</i> 2022, <i>8</i>, 1265.

Journal of fungi (Basel, Switzerland)·2026

Same author

Detection of Candidate Circular RNAs to Monitor Anti-Hormonal Response in the Mammary Gland.

bioRxiv : the preprint server for biology·2026

Same journal

NanoporeDB: A Structural Resource Of Multimeric Protein Nanopores For Single-Molecule Sensing.

GigaScience·2026

Same journal

From the Brain Cell Atlas to Precision Neurology: A review of the application of AI-driven multi-omics in brain science.

GigaScience·2026

Same journal

Comparison of Deep Learning Approaches for Extreme Low-SNR Image Restoration.

GigaScience·2026

Same journal

ScopeViewer: A Browser-Based Solution for Visualizing Large Biological Images.

GigaScience·2026

Same journal

ChatMDV: Reducing Technical Barriers in Bioinformatics Analysis using Large Language Models.

GigaScience·2026

Same journal

ClusterGraph: a new tool for visualisation and compression of multidimensional data.

GigaScience·2026

查看所有相关文章

Search research articles

相关实验视频

Updated: Jan 11, 2026

A User-friendly and Powerful R Analysis of Large-scale Datasets

A User-friendly and Powerful R Analysis of Large-scale Datasets

Published on: November 4, 2025

NApy:在Python中提供高效的统计数据,用于大规模的异质数据,并增强了对缺失数据的支持.

Fabian Woller^1,2, Lis Arend^2,3, Christian Fuchsberger²

¹Biomedical Network Science Lab, Department Artificial Intelligence in Biomedical Engineering, Friedrich-Alexander-Universität Erlangen-Nürnberg, Nürnberger Straße 74, 91052 Erlangen, Germany.

|November 9, 2025

概括

此摘要是机器生成的。

NApy是一个新的Python包,用于在缺失值的大数据集上进行高效的统计测试. 与现有工具相比,它显著改善了运行时间和内存使用,使实时数据分析成为可能.

关键词:

在这里,Python是Python.有效的计算和并行化.这是一个大规模的数据集.缺失的数据缺失的数据统计软件统计软件

更多相关视频

Databases to Efficiently Manage Medium Sized, Low Velocity, Multidimensional Data in Tissue Engineering

Databases to Efficiently Manage Medium Sized, Low Velocity, Multidimensional Data in Tissue Engineering

Published on: November 22, 2019

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

相关实验视频

Last Updated: Jan 11, 2026

A User-friendly and Powerful R Analysis of Large-scale Datasets

A User-friendly and Powerful R Analysis of Large-scale Datasets

Published on: November 4, 2025

Databases to Efficiently Manage Medium Sized, Low Velocity, Multidimensional Data in Tissue Engineering

Databases to Efficiently Manage Medium Sized, Low Velocity, Multidimensional Data in Tissue Engineering

Published on: November 22, 2019

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

科学领域:

计算生物学计算生物学
数据科学数据科学数据科学
统计计算统计计算

背景情况:

现有的Python库在含有缺失值的大型数据集上进行有效的统计测试时遇到困难.
运行时间和内存限制对于诸如交互式生物医学数据分析等应用程序至关重要.
这种限制阻碍了资源密集型领域的探索性数据分析.

研究的目的:

介绍NApy,这是一个Python包,旨在进行可扩展的统计测试.
为了应对大型混合类型数据集中处理缺失值的挑战.
为数据科学和生物信息学中的计算任务提供有效的解决方案.

主要方法:

使用Numba和C++后台开发NApy.
实现了OpenMP,用于并行实现,以提高性能.
专注于优化缺失条目数据集的统计测试计算.

主要成果:

在运行时间和内存消耗方面,NApy表现出显著的改进.
比现有工具和天真并行化方法的性能要大得多.
能够为交互式应用程序进行高效的即时统计分析.

结论:

NApy提供了一个可扩展和高效的解决方案,用于缺少数据的统计测试.
该软件包可在交互式环境中进行实时数据分析.
NApy是公开可用的,促进其在研究和工业的采用.