Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Improving Translational Accuracy02:07

Improving Translational Accuracy

11.9K
Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
11.9K
Language Development01:22

Language Development

450
Children master language quickly and with relative ease, supported by both biological predisposition and reinforcement. B. F. Skinner (1957) proposed that language is learned through reinforcement, while Noam Chomsky (1965) argued that language acquisition mechanisms are biologically determined.
The critical period for language acquisition suggests that the ability to acquire language is at its peak early in life. As people age, this proficiency decreases. Language development begins very...
450
Stereotype Content Model02:16

Stereotype Content Model

14.9K
The Stereotype Content Model (SCM) was first proposed by Susan Fiske and her colleagues (Fiske, Cuddy, Glick & Xu, 2002; see also Fiske, 2012 and Fiske, 2017). The SCM specifies that when someone encounters a new group, they will stereotype them based on two metrics: warmth—or that group’s perceived intent, and how likely they are to provide help or inflict harm—and competence—or their ability to carry out that objective. Depending on the warmth-competence...
14.9K
Masking and Demasking Agents01:19

Masking and Demasking Agents

2.7K
EDTA titrations may necessitate masking and demasking agents to temporarily protect a particular metal ion in a mixture from the EDTA reaction. These agents facilitate the sequential analysis of the metal ions by forming stable complexes with some—but not all—metal ions during certain steps.
There are many masking agents, such as cyanide, fluoride, triethanolamine, thiourea, and 2,3-bis(sulfanyl)propan-1-ol (formerly 2,3-dimercapto-1-propanol), with the masking agent chosen based on...
2.7K
Deindividuation00:57

Deindividuation

27.3K
Deindividuation is a form of social influence on an individual’s behavior such that the individual engages in unusual or non-normal behavior while in a group setting. Why? Because in these group settings, the individual no longer sees themselves as an individual anymore, disinhibiting their behavior and personal restraint.
27.3K
Language01:16

Language

425
Language is a unique communication system that uses words and systematic rules to organize and transmit information. Unlike other forms of communication, which may involve postures, movements, odors, or vocalizations, language relies on symbols and grammar. This makes human communication distinct from that of other species, who also communicate but do not use language in the same way humans do.
Corballis and Suddendorf (2007) and Tomasello and Rakoczy (2003) highlight the role of language in...
425

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Trends in Invasive Interventions and Risk Factors for Early Critical Events in Multiple System Atrophy.

Cerebellum (London, England)·2025
Same author

Evaluation of the Japanese-English Mapping in the Adverse Event Terminology for Medical Devices.

Studies in health technology and informatics·2025
Same author

Developing and Assessing the Technical Term Extraction Tools for Teaching Clinical Trial Protocols.

Studies in health technology and informatics·2025
Same author

A Pilot Study of Educational System Review to Keep the Record Against a Problem Employee in a Clinical Trial Datacenter.

Studies in health technology and informatics·2025
Same author

Developing artificial intelligence tools for institutional review board pre-review: A pilot study on ChatGPT's accuracy and reproducibility.

PLOS digital health·2025
Same author

Multicenter phase II trial of trastuzumab and docetaxel for HER2-positive salivary gland cancer.

Japanese journal of clinical oncology·2025
Same journal

The Essential Components and Critical Conditions for Success in a Learning Health System in Oncology.

Studies in health technology and informatics·2026
Same journal

Use of Artificial Intelligence in Screening for Adolescent Idiopathic Scoliosis: A Scoping Review.

Studies in health technology and informatics·2026
Same journal

Movement Related Biomechanics in Adolescent Idiopathic Scoliosis: A Review of Reviews.

Studies in health technology and informatics·2026
Same journal

The Impact of Surgical Correction of Adolescent Idiopathic Scoliosis Using Posterior Spinal Fusion on Selected Radiological Parameters and Respiratory Function.

Studies in health technology and informatics·2026
Same journal

Acute Effect of Physio-logic® Exercises on Muscle Tone and Stiffness in Adolescent Idiopathic Scoliosis Patients: A Preliminary Study.

Studies in health technology and informatics·2026
Same journal

Effects of Integrated Music and Occupational Therapy on Motor and Autonomic Function in Children with Neurogenic Scoliosis.

Studies in health technology and informatics·2026
See all related articles

Related Experiment Video

Updated: Sep 12, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

682

Comparing the Accuracy of Deidentification in Japanese Text Using Large Language Models.

Ayako Yagahara1, Haluna Mori1, Naoki Nishimoto2

  • 1Department of Radiological Technology, Hokkaido University of Science.

Studies in Health Technology and Informatics
|August 8, 2025
PubMed
Summary
This summary is machine-generated.

This study evaluated large language models (LLMs) for extracting personal information. BERT demonstrated the highest accuracy in identifying names and locations, making it ideal for deidentification tools.

Keywords:
BERTChatGPTDeidentification

More Related Videos

Objectification of Tongue Diagnosis in Traditional Medicine, Data Analysis, and Study Application
05:56

Objectification of Tongue Diagnosis in Traditional Medicine, Data Analysis, and Study Application

Published on: April 14, 2023

2.6K
Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment
06:48

Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment

Published on: June 25, 2019

9.3K

Related Experiment Videos

Last Updated: Sep 12, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

682
Objectification of Tongue Diagnosis in Traditional Medicine, Data Analysis, and Study Application
05:56

Objectification of Tongue Diagnosis in Traditional Medicine, Data Analysis, and Study Application

Published on: April 14, 2023

2.6K
Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment
06:48

Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment

Published on: June 25, 2019

9.3K

Area of Science:

  • Natural Language Processing
  • Data Deidentification
  • Machine Learning

Background:

  • Accurate extraction of personal information is crucial for data privacy.
  • Large Language Models (LLMs) show potential for automated information extraction.
  • Developing effective deidentification tools is essential for handling sensitive data.

Purpose of the Study:

  • To evaluate the accuracy of different LLMs in extracting personal, facility, and place names.
  • To assess the feasibility of using LLMs for developing an in-house deidentification tool.
  • To compare the performance of BERT, GPT3.5, and GPT4o-mini for named entity recognition.

Main Methods:

  • Three LLMs (BERT, GPT3.5, GPT4o-mini) were employed for information extraction.
  • A pilot study analyzed 20 Japanese newspaper articles.
  • Extraction accuracy was measured using F1-scores for personal names, facility names, and place names.

Main Results:

  • BERT achieved the highest overall F1-score of 0.94.
  • BERT demonstrated superior performance in extracting personal names (0.99), facility names (0.89), and place names (0.93).
  • GPT3.5 and GPT4o-mini showed lower accuracy compared to BERT.

Conclusions:

  • BERT is the most effective LLM for the automatic extraction of personal information among the models tested.
  • The findings support the use of BERT for developing robust deidentification tools for clinical data.
  • Further research can explore BERT's application in diverse datasets for enhanced data privacy.