Search research articles

关于 JoVE

概览领导团队博客 JoVE 帮助中心

作者

出版流程编辑委员会范围与政策同行评审常见问题投稿

图书馆员

用户评价订阅访问资源图书馆顾问委员会常见问题

研究

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments 存档

教育

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual 教师资源中心教师网站

使用条款与条件

相关概念视频

Detection of Gross Error: The Q Test

Detection of Gross Error: The Q Test

When one or more data points appear far from the rest of the data, there is a need to determine whether they are outliers and whether they should be eliminated from the data set to ensure an accurate representation of the measured value. In many cases, outliers arise from gross errors (or human errors) and do not accurately reflect the underlying phenomenon. In some cases, however, these apparent outliers reflect true phenomenological differences. In these cases, we can use statistical methods...

Quantifying and Rejecting Outliers: The Grubbs Test

Quantifying and Rejecting Outliers: The Grubbs Test

Sometimes, a data set can have a recorded numerical observation that greatly deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier. To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...

Causes of Similarity-Dissimilarity Effect

Causes of Similarity-Dissimilarity Effect

The similarity-dissimilarity effect, a fundamental concept in social psychology, explains how interpersonal similarities and differences influence attraction and social interactions. This effect is supported by three key psychological perspectives: balance theory, social comparison theory, and consensual validation.Balance Theory and Cognitive ConsistencyBalance theory, developed by Fritz Heider, posits that individuals seek cognitive consistency in their relationships. When two people share...

Difference from Background: Limit of Detection

Difference from Background: Limit of Detection

The limit of detection (LOD) is the smallest amount of analyte that can be distinguished from the background noise. The LOD value corresponds to the concentration at which the analyte signal is three times larger than the standard deviation of the blank signal. Below this value, the analyte signal cannot be differentiated from the background noise. It is calculated by dividing the calibration slope by 3 times the standard deviation of the blank signals.
The LOD indicates the presence or absence...

Types of Errors: Detection and Minimization

Types of Errors: Detection and Minimization

Error is the deviation of the obtained result from the true, expected value or the estimated central value. Errors are expressed in absolute or relative terms.
Absolute error in a measurement is the numerical difference from the true or central value. Relative error is the ratio between absolute error and the true or central value, expressed as a percentage.
Errors can be classified by source, magnitude, and sign. There are three types of errors: systematic, random, and gross.
Systematic or...

Expected Frequencies in Goodness-of-Fit Tests

Expected Frequencies in Goodness-of-Fit Tests

A goodness-of-fit test is conducted to determine whether the observed frequency values are statistically similar to the frequencies expected for the dataset. Suppose the expected frequencies for a dataset are equal such as when predicting the frequency of any number appearing when casting a die. In that case, the expected frequency is the ratio of the total number of observations (n) to the number of categories (k).

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序

Same author

Prognostic value of peri-operative circulating tumour DNA levels estimated by cell-free DNA methylation in patients with resectable colorectal liver metastases.

EBioMedicine·2026

Same author

Signpost Testing to Navigate the Parameter Space of the Gaussian Graphical Model With High-Dimensional Data.

Biometrical journal. Biometrische Zeitschrift·2026

Same author

Informative Co-Data Learning for High-Dimensional Horseshoe Regression.

Biometrical journal. Biometrische Zeitschrift·2025

Same author

Sparse Canonical Correlation Analysis for Multiple Measurements With Latent Trajectories.

Biometrical journal. Biometrische Zeitschrift·2025

Same author

Leveraging external information by guided adaptive shrinkage to improve variable selection in high-dimensional regression settings.

The international journal of biostatistics·2025

Same author

Alternatives to default shrinkage methods can improve prediction accuracy, calibration, and coverage: A methods comparison study.

Statistical methods in medical research·2025

Same journal

A Mixture of Distributed Lag Non-Linear Models to Account for Spatially Heterogeneous Exposure-Lag-Response Associations.

Statistics in medicine·2026

Same journal

Practical Considerations for Gaussian Process Modeling for Causal Inference in Quasi-Experimental Studies With Panel Data.

Statistics in medicine·2026

Same journal

Covariate Adjustment for Wilcoxon Two Sample Statistic and Test.

Statistics in medicine·2026

Same journal

Beyond Fixed Thresholds: Optimizing Summaries of Wearable Device Data via Piecewise Linearization of Quantile Functions.

Statistics in medicine·2026

Same journal

A Causal Framework for Evaluating the Total Effect of Strategies Aiming to Expand Screening and to Improve Outcomes.

Statistics in medicine·2026

Same journal

Causal Effects on Nonterminal Event Time With Application to Antibiotic Usage and Future Resistance.

Statistics in medicine·2026

查看所有相关文章

Search research articles

相关实验视频

Updated: Jan 14, 2026

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

在记录链接中错误发现估计.

Kayané Robach^1,2, Michel H Hof^1,2, Mark A van de Wiel^1,2

¹Department of Epidemiology and Data Science, Amsterdam UMC Location Vrije Universiteit Amsterdam, Amsterdam, the Netherlands.

Statistics in medicine

|October 17, 2025

概括

此摘要是机器生成的。

本研究引入了一种新方法,通过使用合成数据来估计记录链接 (RL) 中的错误发现比例 (FDP). 这种方法提高了链接数据集的可靠性,这对于研究中准确的数据分析至关重要.

关键词:

错误发现比例错误发现比例错误链接错误链接记录链接记录链接

更多相关视频

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

An Integrated Workflow of Identification and Quantification on FDR Control-Based Untargeted Metabolome

An Integrated Workflow of Identification and Quantification on FDR Control-Based Untargeted Metabolome

Published on: September 20, 2022

相关实验视频

Last Updated: Jan 14, 2026

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

An Integrated Workflow of Identification and Quantification on FDR Control-Based Untargeted Metabolome

An Integrated Workflow of Identification and Quantification on FDR Control-Based Untargeted Metabolome

Published on: September 20, 2022

科学领域:

数据科学数据科学数据科学
生物统计学生物统计学
流行病学流行病学

背景情况:

整合多样化的数据集提供了研究优势,但由于隐私和各种收集方法,缺乏独特的标识符.
记录链接 (RL) 算法使用识别变量概率地链接记录,但不完美的匹配需要评估错误发现.
错误发现比例 (FDP) 对于在后续分析中验证链接数据可靠性至关重要.

研究的目的:

为两个重叠的数据集引入一种用于估计RL中的FDP的新方法.
提供一种可靠的方法来评估和提高各种RL技术和环境中链接数据的质量.
强调在医疗记录分析中考虑联系错误的重要性.

主要方法:

一种新的FDP估计方法,使用从实证分布与真实数据一起生成的合成数据.
合成记录,无法与真实实体联系起来,量化错误链接的对.
该方法适用于所有RL技术,特别是在具有差别区分变量的复杂场景中.

主要成果:

拟议的方法有效地估计了RL中的FDP,从而可以评估和改进链接数据的可靠性.
使用已建立的RL算法和基准数据集评估性能.
在荷兰围产阶段登记册中成功应用于连接兄弟姐妹,证实了其实际实用性.

结论:

开发的方法提供了一种可靠的方法来估计RL中的FDP,提高数据可靠性.
准确的FDP估计对于来自链接数据集的可靠研究结果至关重要.
考虑链接错误是必不可少的,特别是在敏感的医疗数据研究中,例如母婴动态.