Jove
Visualize
联系我们
JoVE
x logofacebook logolinkedin logoyoutube logo
关于 JoVE
概览领导团队博客JoVE 帮助中心
作者
出版流程编辑委员会范围与政策同行评审常见问题投稿
图书馆员
用户评价订阅访问资源图书馆顾问委员会常见问题
研究
JoVE JournalMethods CollectionsJoVE Encyclopedia of Experiments存档
教育
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab Manual教师资源中心教师网站
使用条款与条件
隐私政策
政策

相关概念视频

Detection of Gross Error: The Q Test01:00

Detection of Gross Error: The Q Test

6.9K
When one or more data points appear far from the rest of the data, there is a need to determine whether they are outliers and whether they should be eliminated from the data set to ensure an accurate representation of the measured value. In many cases, outliers arise from gross errors (or human errors) and do not accurately reflect the underlying phenomenon. In some cases, however, these apparent outliers reflect true phenomenological differences. In these cases, we can use statistical methods...
6.9K
Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

3.5K
Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...
3.5K
Causes of Similarity-Dissimilarity Effect01:26

Causes of Similarity-Dissimilarity Effect

253
The similarity-dissimilarity effect, a fundamental concept in social psychology, explains how interpersonal similarities and differences influence attraction and social interactions. This effect is supported by three key psychological perspectives: balance theory, social comparison theory, and consensual validation.Balance Theory and Cognitive ConsistencyBalance theory, developed by Fritz Heider, posits that individuals seek cognitive consistency in their relationships. When two people share...
253
Difference from Background: Limit of Detection01:05

Difference from Background: Limit of Detection

8.0K
The limit of detection (LOD) is the smallest amount of analyte that can be distinguished from the background noise. The LOD value corresponds to the concentration at which the analyte signal is three times larger than the standard deviation of the blank signal. Below this value, the analyte signal cannot be differentiated from the background noise. It is calculated by dividing the calibration slope by 3 times the standard deviation of the blank signals.
The LOD indicates the presence or absence...
8.0K
Types of Errors: Detection and Minimization01:12

Types of Errors: Detection and Minimization

9.7K
Error is the deviation of the obtained result from the true, expected value or the estimated central value. Errors are expressed in absolute or relative terms.
Absolute error in a measurement is the numerical difference from the true or central value. Relative error is the ratio between absolute error and the true or central value, expressed as a percentage.
Errors can be classified by source, magnitude, and sign. There are three types of errors: systematic, random, and gross.
Systematic or...
9.7K
Expected Frequencies in Goodness-of-Fit Tests01:19

Expected Frequencies in Goodness-of-Fit Tests

7.2K
A goodness-of-fit test is conducted to determine whether the observed frequency values are statistically similar to the frequencies expected for the dataset. Suppose the expected frequencies for a dataset are equal such as when predicting the frequency of any number appearing when casting a die. In that case, the expected frequency is the ratio of the total number of observations (n)  to the number of categories (k).
7.2K

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序
Same author

Prognostic value of peri-operative circulating tumour DNA levels estimated by cell-free DNA methylation in patients with resectable colorectal liver metastases.

EBioMedicine·2026
Same author

Signpost Testing to Navigate the Parameter Space of the Gaussian Graphical Model With High-Dimensional Data.

Biometrical journal. Biometrische Zeitschrift·2026
Same author

Informative Co-Data Learning for High-Dimensional Horseshoe Regression.

Biometrical journal. Biometrische Zeitschrift·2025
Same author

Sparse Canonical Correlation Analysis for Multiple Measurements With Latent Trajectories.

Biometrical journal. Biometrische Zeitschrift·2025
Same author

Leveraging external information by guided adaptive shrinkage to improve variable selection in high-dimensional regression settings.

The international journal of biostatistics·2025
Same author

Alternatives to default shrinkage methods can improve prediction accuracy, calibration, and coverage: A methods comparison study.

Statistical methods in medical research·2025
Same journal

A Mixture of Distributed Lag Non-Linear Models to Account for Spatially Heterogeneous Exposure-Lag-Response Associations.

Statistics in medicine·2026
Same journal

Practical Considerations for Gaussian Process Modeling for Causal Inference in Quasi-Experimental Studies With Panel Data.

Statistics in medicine·2026
Same journal

Covariate Adjustment for Wilcoxon Two Sample Statistic and Test.

Statistics in medicine·2026
Same journal

Beyond Fixed Thresholds: Optimizing Summaries of Wearable Device Data via Piecewise Linearization of Quantile Functions.

Statistics in medicine·2026
Same journal

A Causal Framework for Evaluating the Total Effect of Strategies Aiming to Expand Screening and to Improve Outcomes.

Statistics in medicine·2026
Same journal

Causal Effects on Nonterminal Event Time With Application to Antibiotic Usage and Future Resistance.

Statistics in medicine·2026
查看所有相关文章

相关实验视频

Updated: Jan 14, 2026

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER
14:06

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

15.7K

在记录链接中错误发现估计.

Kayané Robach1,2, Michel H Hof1,2, Mark A van de Wiel1,2

  • 1Department of Epidemiology and Data Science, Amsterdam UMC Location Vrije Universiteit Amsterdam, Amsterdam, the Netherlands.

Statistics in medicine
|October 17, 2025
PubMed
概括
此摘要是机器生成的。

本研究引入了一种新方法,通过使用合成数据来估计记录链接 (RL) 中的错误发现比例 (FDP). 这种方法提高了链接数据集的可靠性,这对于研究中准确的数据分析至关重要.

关键词:
错误发现比例 错误发现比例错误链接 错误链接记录链接记录链接

更多相关视频

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments
08:12

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

2.9K
An Integrated Workflow of Identification and Quantification on FDR Control-Based Untargeted Metabolome
05:35

An Integrated Workflow of Identification and Quantification on FDR Control-Based Untargeted Metabolome

Published on: September 20, 2022

4.2K

相关实验视频

Last Updated: Jan 14, 2026

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER
14:06

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

15.7K
A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments
08:12

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

2.9K
An Integrated Workflow of Identification and Quantification on FDR Control-Based Untargeted Metabolome
05:35

An Integrated Workflow of Identification and Quantification on FDR Control-Based Untargeted Metabolome

Published on: September 20, 2022

4.2K

科学领域:

  • 数据科学数据科学数据科学
  • 生物统计学 生物统计学
  • 流行病学 流行病学

背景情况:

  • 整合多样化的数据集提供了研究优势,但由于隐私和各种收集方法,缺乏独特的标识符.
  • 记录链接 (RL) 算法使用识别变量概率地链接记录,但不完美的匹配需要评估错误发现.
  • 错误发现比例 (FDP) 对于在后续分析中验证链接数据可靠性至关重要.

研究的目的:

  • 为两个重叠的数据集引入一种用于估计RL中的FDP的新方法.
  • 提供一种可靠的方法来评估和提高各种RL技术和环境中链接数据的质量.
  • 强调在医疗记录分析中考虑联系错误的重要性.

主要方法:

  • 一种新的FDP估计方法,使用从实证分布与真实数据一起生成的合成数据.
  • 合成记录,无法与真实实体联系起来,量化错误链接的对.
  • 该方法适用于所有RL技术,特别是在具有差别区分变量的复杂场景中.

主要成果:

  • 拟议的方法有效地估计了RL中的FDP,从而可以评估和改进链接数据的可靠性.
  • 使用已建立的RL算法和基准数据集评估性能.
  • 在荷兰围产阶段登记册中成功应用于连接兄弟姐妹,证实了其实际实用性.

结论:

  • 开发的方法提供了一种可靠的方法来估计RL中的FDP,提高数据可靠性.
  • 准确的FDP估计对于来自链接数据集的可靠研究结果至关重要.
  • 考虑链接错误是必不可少的,特别是在敏感的医疗数据研究中,例如母婴动态.