Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Mouse Models of Cancer Study02:43

Mouse Models of Cancer Study

Mice have long served as models for studying human biology and pathology because of their phylogenetic and physiological similarity with humans. They are also easy to maintain and breed in the laboratory, and hence, many inbred strains are now available for research. Studies on mice have contributed immeasurably to our understanding of cancer biology.
The development of transgenic, knockout, and knock-in mice has led to an exponential increase in their use as model organisms in research,...
Classification of Illness01:17

Classification of Illness

The meaning of illness is individualized to each person who experiences an alteration in health. In contrast, disease is a medical term indicating a pathological change in the structure and function of the body or mind. It is a condition that has specific symptoms and boundaries.
An illness is a response to a disease in which the person's level of functioning is changed compared with a previous level. The general classification of illness includes acute and chronic.
Acute illness is severe and...
Mouse Models of Cancer Study02:43

Mouse Models of Cancer Study

Mice have long served as models for studying human biology and pathology because of their phylogenetic and physiological similarity with humans. They are also easy to maintain and breed in the laboratory, and hence, many inbred strains are now available for research. Studies on mice have contributed immeasurably to our understanding of cancer biology.
The development of transgenic, knockout, and knock-in mice has led to an exponential increase in their use as model organisms in research,...
Cancer Survival Analysis01:21

Cancer Survival Analysis

Cancer survival analysis focuses on quantifying and interpreting the time from a key starting point, such as diagnosis or the initiation of treatment, to a specific endpoint, such as remission or death. This analysis provides critical insights into treatment effectiveness and factors that influence patient outcomes, helping to shape clinical decisions and guide prognostic evaluations. A cornerstone of oncology research, survival analysis tackles the challenges of skewed, non-normally...
Statistical Software for Data Analysis and Clinical Trials01:12

Statistical Software for Data Analysis and Clinical Trials

Statistical software is pivotal in data analysis and clinical trials by providing tools to analyze data, draw conclusions, and make predictions. These software packages range from simple data management applications to complex analytical platforms, supporting various statistical tests, models, and simulation techniques. Their significance lies in their ability to handle vast amounts of data with precision and efficiency, enabling researchers to validate hypotheses, identify trends, and make...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Cost-effectiveness and health impact of gender-neutral and single-dose HPV vaccination in Hong Kong: a modeling analysis.

The Lancet regional health. Western Pacific·2026
Same author

Efficacy and effectiveness of long-acting monoclonal antibodies and vaccines against RSV: a systematic review and meta-analysis of studies from 2018 to 2025.

EClinicalMedicine·2026
Same author

Patient Preferences for Lipid-Lowering Agents in Hypercholesterolemia: A Discrete Choice Experiment.

Value in health regional issues·2026
Same author

Type 2 Diabetes Is Associated With Increased Complications and Mortality After Hip Fracture in Older Adults Aged 60 Years or Older.

Diabetes, obesity & metabolism·2026
Same author

Disruption of CTCF binding by germline non-coding variants in <i>CDKN2B</i> suppress <i>CDKN2A</i> expression and predispose to melanoma.

medRxiv : the preprint server for health sciences·2026
Same author

Global Sentiment Toward Health AI at the Dawn of the ChatGPT Era: Empirical Analysis of Twitter (X) Discourse.

Journal of medical Internet research·2026
Same journal

Enhancing anatomical recognition by surgeons during pelvic lymph node dissection using artificial intelligence.

NPJ digital medicine·2026
Same journal

AFP assistant: a retrieval-augmented generation and large language model-powered multilingual polio chatbot for low-resource language communities.

NPJ digital medicine·2026
Same journal

Structured reasoning failures compromise LLM interpretation of clinical oncology notes.

NPJ digital medicine·2026
Same journal

Translation of frozen sections into FFPE images for skin cancer resection margins using generative AI.

NPJ digital medicine·2026
Same journal

FedFound: a federated foundation model for lifespan brain morphological connectome analysis.

NPJ digital medicine·2026
Same journal

A multimodal instruction dataset and benchmark for ultrasound understanding.

NPJ digital medicine·2026
See all related articles

Related Experiment Video

Updated: May 8, 2026

Computer-Aided Three-Dimensional Visualization in the Treatment of Locally Advanced Thyroid Cancer
03:55

Computer-Aided Three-Dimensional Visualization in the Treatment of Locally Advanced Thyroid Cancer

Published on: June 9, 2023

472

Developing a named entity framework for thyroid cancer staging and risk level classification using large language

Matrix M H Fung1, Eric H M Tang2,3, Tingting Wu2

  • 1Division of Endocrine Surgery, Department of Surgery, School of Clinical Medicine, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China.

NPJ Digital Medicine
|March 2, 2025
PubMed
Summary
This summary is machine-generated.

We created a framework using Large Language Models (LLMs) to extract cancer staging and risk information from thyroid cancer clinical notes. This approach efficiently and accurately classifies well-differentiated thyroid cancer patients.

More Related Videos

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images
04:23

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

1.7K
Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

475

Related Experiment Videos

Last Updated: May 8, 2026

Computer-Aided Three-Dimensional Visualization in the Treatment of Locally Advanced Thyroid Cancer
03:55

Computer-Aided Three-Dimensional Visualization in the Treatment of Locally Advanced Thyroid Cancer

Published on: June 9, 2023

472
A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images
04:23

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

1.7K
Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

475

Area of Science:

  • Oncology
  • Medical Informatics
  • Artificial Intelligence

Background:

  • Classifying well-differentiated thyroid cancer staging and risk is crucial for patient management.
  • Extracting this information from clinical notes can be challenging due to semi-structured data.
  • Existing methods may lack efficiency and accuracy in processing large datasets.

Purpose of the Study:

  • To develop a named entity (NE) framework for information extraction from The Cancer Genome Atlas-Thyroid Cancer (TCGA-THCA) database.
  • To evaluate Large Language Models (LLMs) for classifying American Joint Committee on Cancer (AJCC) staging and American Thyroid Association (ATA) risk categories.
  • To optimize the efficiency and accuracy of classifying thyroid cancer stage and risk.

Main Methods:

  • Developed an NE framework including annotation guidelines, ground truth labeling, prompting strategies, and evaluation codes.
  • Utilized four LLMs (Mistral-7B-Instruct, Llama-3.1-8B-Instruct, Gemma-2-9B-Instruct, Qwen2.5-7B-Instruct) for offline information extraction.
  • Employed an ensemble-like majority-vote strategy for classification, validated on TCGA-THCA pathology notes and pseudo-clinical cases.

Main Results:

  • The NE framework was developed and validated using 50 and 289 TCGA-THCA notes, respectively.
  • An ensemble strategy achieved satisfactory performance in classifying AJCC staging and ATA risk categories.
  • The developed framework and classifier demonstrated optimized efficiency and accuracy.

Conclusions:

  • The proposed NE framework and LLM-based ensemble classifier effectively extract and classify critical clinical information for thyroid cancer.
  • This approach enhances the accuracy and efficiency of determining AJCC staging and ATA risk categories.
  • The study provides a valuable tool for analyzing large-scale thyroid cancer data.