Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Errors In Hypothesis Tests01:14

Errors In Hypothesis Tests

When performing a hypothesis test, there are four possible outcomes depending on the actual truth (or falseness) of the null hypothesis and the decision to reject or not.
Accuracy and Errors in Hypothesis Testing01:13

Accuracy and Errors in Hypothesis Testing

Hypothesis testing is a fundamental statistical tool that begins with the assumption that the null hypothesis H0 is true. During this process, two types of errors can occur: Type I and Type II. A Type I error refers to the incorrect rejection of a true null hypothesis, while a Type II error involves the failure to reject a false null hypothesis.
In hypothesis testing, the probability of making a Type I error, denoted as α, is commonly set at 0.05. This significance level indicates a 5% chance...
Genome Copying Errors02:46

Genome Copying Errors

DNA replication is a well-evolved process that copies millions of base pairs with high fidelity during each cell division. Occasionally a wrong base or a long stretch of wrong bases may get added to the daughter strands. If the errors are left unchecked, cells might accumulate several mutations that might endanger their  survival. Therefore, the copying errors are checked and repaired at three levels.
Detection of Gross Error: The Q Test01:00

Detection of Gross Error: The Q Test

When one or more data points appear far from the rest of the data, there is a need to determine whether they are outliers and whether they should be eliminated from the data set to ensure an accurate representation of the measured value. In many cases, outliers arise from gross errors (or human errors) and do not accurately reflect the underlying phenomenon. In some cases, however, these apparent outliers reflect true phenomenological differences. In these cases, we can use statistical methods...
Random and Systematic Errors01:20

Random and Systematic Errors

Scientists always try their best to record measurements with the utmost accuracy and precision. However, sometimes errors do occur. These errors can be random or systematic. Random errors are observed due to the inconsistency or fluctuation in the measurement process, or variations in the quantity itself that is being measured. Such errors fluctuate from being greater than or less than the true value in repeated measurements. Consider a scientist measuring the length of an earthworm using a...
Random and Systematic Errors01:20

Random and Systematic Errors

Scientists always try their best to record measurements with the utmost accuracy and precision. However, sometimes errors do occur. These errors can be random or systematic. Random errors are observed due to the inconsistency or fluctuation in the measurement process, or variations in the quantity itself that is being measured. Such errors fluctuate from being greater than or less than the true value in repeated measurements. Consider a scientist measuring the length of an earthworm using a...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Detection of cancer recurrence from Thai-English electronic medical records using sentence embeddings.

BMJ health & care informatics·2026
Same author

Developing a Thai User Interface Terminology for Systematized Nomenclature of Medicine Clinical Terms Implementation in Primary Care: Cross-Sectional Content Coverage Analysis.

JMIR medical informatics·2026
Same author

Finding digital health governance mechanism to support country's health systems: Thailand case study.

Oxford open digital health·2025
Same author

Community of Interoperability Labs: Pragmatic Approach to Achieving Interoperability.

Studies in health technology and informatics·2024
Same author

Design of a Vaccine Passport Validation System Using Blockchain-based Architecture: Development Study.

JMIR public health and surveillance·2022
Same author

Global Infectious Disease Surveillance and Case Tracking System for COVID-19: Development Study.

JMIR medical informatics·2020
Same journal

Sensitivity Analyses of a Scoring System for a Contraception Decision Aid.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
Same journal

Improving electronic health record processing of large language models via retrieval-augmented generation: A case study on dietary supplements.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
Same journal

Developing a User-Centered Mobile Application Prototype: Bridging Lower-Limb Fracture Care from Skilled Nursing Facility and Back to the Community.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
Same journal

KERAP: A Knowledge-Enhanced Reasoning Approach for Accurate Zero-shot Diagnosis Prediction Using Multi-agent LLMs.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
Same journal

Automating Adjudication of Cardiovascular Events Using Large Language Models.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
Same journal

Predictive Factors and State-Level Barriers to Postpartum Birth Control Usage in the United States: Insights from PRAMS Phase 8.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
See all related articles

Related Experiment Video

Updated: Jun 28, 2026

Databases to Efficiently Manage Medium Sized, Low Velocity, Multidimensional Data in Tissue Engineering
09:43

Databases to Efficiently Manage Medium Sized, Low Velocity, Multidimensional Data in Tissue Engineering

Published on: November 22, 2019

Creating a large database test bed with typographical errors for record linkage evaluation.

Nawanan Theera-Ampornpunt1, Boonchai Kijsanayotin, Stuart M Speedie

  • 1Health Informatics, University of Minnesota, Minneapolis, Minnesota, USA.

AMIA ... Annual Symposium Proceedings. AMIA Symposium
|November 13, 2008
PubMed
Summary
This summary is machine-generated.

Researchers developed a large, realistic database for testing record linkage algorithms. This dataset accurately reflects real-world data, including common data entry errors, ensuring reliable algorithm evaluation.

More Related Videos

Generation of Comprehensive Thoracic Oncology Database - Tool for Translational Research
11:18

Generation of Comprehensive Thoracic Oncology Database - Tool for Translational Research

Published on: January 22, 2011

Related Experiment Videos

Last Updated: Jun 28, 2026

Databases to Efficiently Manage Medium Sized, Low Velocity, Multidimensional Data in Tissue Engineering
09:43

Databases to Efficiently Manage Medium Sized, Low Velocity, Multidimensional Data in Tissue Engineering

Published on: November 22, 2019

Generation of Comprehensive Thoracic Oncology Database - Tool for Translational Research
11:18

Generation of Comprehensive Thoracic Oncology Database - Tool for Translational Research

Published on: January 22, 2011

Area of Science:

  • Computer Science
  • Data Science
  • Bioinformatics

Background:

  • Accurate record linkage is crucial for data analysis and integration.
  • Existing test beds may not fully represent real-world data complexities.
  • Evaluating record linkage algorithms necessitates representative datasets.

Purpose of the Study:

  • To create a large-scale, representative database for evaluating record linkage algorithms.
  • To simulate real-world data characteristics, including demographic distributions and typographical errors.
  • To provide a reliable test bed for assessing the performance of various record linkage methods.

Main Methods:

  • Constructed a large database mirroring typical population demographics.
  • Introduced common typographical errors characteristic of data entry.
  • Ensured the database reflects real-world data complexity for algorithm testing.

Main Results:

  • Successfully created a large database representative of real-world data.
  • The database incorporates realistic demographic distributions and common data entry errors.
  • The developed test bed is suitable for high-confidence evaluation of record linkage algorithms.

Conclusions:

  • The created database serves as a valuable and reliable test bed for record linkage algorithm evaluation.
  • This resource enables more accurate assessment of algorithm performance in realistic scenarios.
  • Facilitates advancements in data linkage methodologies through robust testing.