Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Types of Errors: Detection and Minimization01:12

Types of Errors: Detection and Minimization

Error is the deviation of the obtained result from the true, expected value or the estimated central value. Errors are expressed in absolute or relative terms.
Absolute error in a measurement is the numerical difference from the true or central value. Relative error is the ratio between absolute error and the true or central value, expressed as a percentage.
Errors can be classified by source, magnitude, and sign. There are three types of errors: systematic, random, and gross.
Systematic or...
Detection of Gross Error: The Q Test01:00

Detection of Gross Error: The Q Test

When one or more data points appear far from the rest of the data, there is a need to determine whether they are outliers and whether they should be eliminated from the data set to ensure an accurate representation of the measured value. In many cases, outliers arise from gross errors (or human errors) and do not accurately reflect the underlying phenomenon. In some cases, however, these apparent outliers reflect true phenomenological differences. In these cases, we can use statistical methods...
Genome Copying Errors02:46

Genome Copying Errors

DNA replication is a well-evolved process that copies millions of base pairs with high fidelity during each cell division. Occasionally a wrong base or a long stretch of wrong bases may get added to the daughter strands. If the errors are left unchecked, cells might accumulate several mutations that might endanger their  survival. Therefore, the copying errors are checked and repaired at three levels.
Ligand Binding and Linkage00:49

Ligand Binding and Linkage

Allosteric proteins have more than one ligand binding site; the binding of a ligand to any of these sites influences the binding of ligands to the other sites. When a protein is allosteric, its binding sites are called coupled or linked.  In the case of enzymes, the site that binds to the substrate is known as the active site and the other site is known as the regulatory site. When a ligand binds to the regulatory site, this leads to conformational changes in the protein that can influence the...
Ligand Binding and Linkage00:49

Ligand Binding and Linkage

Allosteric proteins have more than one ligand binding site; the binding of a ligand to any of these sites influences the binding of ligands to the other sites. When a protein is allosteric, its binding sites are called coupled or linked.  In the case of enzymes, the site that binds to the substrate is known as the active site and the other site is known as the regulatory site. When a ligand binds to the regulatory site, this leads to conformational changes in the protein that can influence the...
Mismatch Repair01:36

Mismatch Repair

Overview

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Antiseptic Cleansing to Reduce Vertical Transmission of Pathogens to Neonates: The NeoVT-AMR Randomized Clinical Trial.

JAMA network open·2026
Same author

CMV viraemia is associated with mortality among children with HIV starting antiretroviral therapy in sub-Saharan Africa.

Clinical infectious diseases : an official publication of the Infectious Diseases Society of America·2026
Same author

Intravenous Artesunate in Artemisinin-Resistant Severe Malaria in Uganda.

The New England journal of medicine·2026
Same author

Multicenter Validation of Clinical Sepsis Phenotypes.

JAMA network open·2026
Same author

Impact of co-amoxiclav versus amoxicillin on the gastrointestinal microbiota in sub-Saharan children hospitalized with pneumonia.

npj antimicrobials and resistance·2026
Same author

How to design a ROCI (Response Over Continuous Intervention) randomised trial: guidance and a case study.

BMC medical research methodology·2026
Same journal

Established machine learning matches tabular foundation models in clinical predictions.

BMC medical informatics and decision making·2026
Same journal

Explainable AI machine learning framework for chronic kidney disease prediction utilizing electronic health records.

BMC medical informatics and decision making·2026
Same journal

Interpretable SHAP-based machine learning framework for patient satisfaction prediction: a case study in Thammasat University Hospital.

BMC medical informatics and decision making·2026
Same journal

Automated generation of structured breast ultrasound reports using BreastViT and ChatGPT.

BMC medical informatics and decision making·2026
Same journal

Shared decision-making and medication adherence among community adults with chronic diseases: a cross-sectional study in Hubei Province, China.

BMC medical informatics and decision making·2026
Same journal

Classification of periapical radiographic findings for root canal therapy decision support using deep neural networks.

BMC medical informatics and decision making·2026
See all related articles

Related Experiment Video

Updated: Jun 4, 2026

Generating Strictly Controlled Stimuli for Figure Recognition Experiments
05:39

Generating Strictly Controlled Stimuli for Figure Recognition Experiments

Published on: March 18, 2019

An efficient record linkage scheme using graphical analysis for identifier error detection.

John M Finney1, A Sarah Walker, Tim E A Peto

  • 1NIHR Biomedical Research Centre, John Radcliffe Hospital, Oxford, UK.

BMC Medical Informatics and Decision Making
|February 3, 2011
PubMed
Summary
This summary is machine-generated.

This study presents a fast, two-step algorithm for linking millions of health records, achieving high accuracy in patient record identification. The method efficiently clusters large datasets, improving healthcare data integration.

Related Experiment Videos

Last Updated: Jun 4, 2026

Generating Strictly Controlled Stimuli for Figure Recognition Experiments
05:39

Generating Strictly Controlled Stimuli for Figure Recognition Experiments

Published on: March 18, 2019

Area of Science:

  • Health Informatics
  • Data Science
  • Computational Epidemiology

Background:

  • Accurate patient record linkage is crucial for healthcare delivery, epidemiology, and business intelligence.
  • Linking large datasets is challenging due to incomplete and error-prone identifiers, such as National Health Service (NHS) numbers.

Purpose of the Study:

  • To develop and evaluate a novel two-step algorithm for efficient and accurate large-scale record linkage.
  • To address the challenges of integrating disparate health information systems.

Main Methods:

  • A two-step record linkage algorithm was implemented, starting with high-cardinality identifier matching.
  • A graph-based algorithm was used for partitioning clusters and detecting erroneous identifiers.

Main Results:

  • The algorithm successfully clustered over 250 million health records from five data sources in approximately 30 minutes.
  • Generated 3.6 million clusters with 99.8% high likelihood of containing records from a single patient.
  • The method demonstrated computational efficiency for large-scale data linkage.

Conclusions:

  • The described technique provides a simple, fast, and highly efficient two-step method for initial large-scale record linkage.
  • The approach is particularly suitable for health records within the UK's National Health Service (NHS).
  • A potential limitation is the reliance on exact matching for initial cluster formation, which may impact databases with low identifier quality.