Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Improving Translational Accuracy02:07

Improving Translational Accuracy

11.5K
Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
11.5K
Classification of Illness01:17

Classification of Illness

7.6K
The meaning of illness is individualized to each person who experiences an alteration in health. In contrast, disease is a medical term indicating a pathological change in the structure and function of the body or mind. It is a condition that has specific symptoms and boundaries.
An illness is a response to a disease in which the person's level of functioning is changed compared with a previous level. The general classification of illness includes acute and chronic.
Acute illness is severe...
7.6K
Classification of Systems-II01:31

Classification of Systems-II

171
Continuous-time systems have continuous input and output signals, with time measured continuously. These systems are generally defined by differential or algebraic equations. For instance, in an RC circuit, the relationship between input and output voltage is expressed through a differential equation derived from Ohm's law and the capacitor relation,
171
Classification of Systems-I01:26

Classification of Systems-I

211
Linearity is a system property characterized by a direct input-output relationship, combining homogeneity and additivity.
Homogeneity dictates that if an input x(t) is multiplied by a constant c, the output y(t) is multiplied by the same constant. Mathematically, this is expressed as:
211
Classification of Neurotransmitters01:30

Classification of Neurotransmitters

3.0K
Neurotransmitters play a crucial role in the communication between neurons in the autonomic nervous system. Neurons in the autonomic nervous system can be cholinergic or adrenergic depending on the neurotransmitters synthesized. Cholinergic neurons use acetylcholine as their primary neurotransmitter. This includes all the preganglionic fibers of the sympathetic and pre- and postganglionic fibers of the parasympathetic nervous systems. In addition, neurons of the somatic nervous system also use...
3.0K
Generalization, Discrimination, and Extinction01:24

Generalization, Discrimination, and Extinction

604
Generalization, discrimination, and extinction are key concepts in operant conditioning that influence how behaviors are learned and maintained.
Generalization occurs when a behavior reinforced in one context is performed in similar situations. For instance, a student who studies diligently for calculus and receives excellent grades might apply the same study habits to psychology and history, expecting similar results. Generalization shows how learning in one setting can influence behavior in...
604

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Methodological innovations to advance substance use disorder research: Proceedings of a NIDA workshop on target trial emulation and translational testing of digital health tools.

Journal of substance use and addiction treatment·2026
Same author

Challenges in AI Based Tumor Board Case Summarization and Recommendations.

Research square·2026
Same author

Automated Identification of Surgical Site Infections From Electronic Medical Records: Retrospective Observational Predictive Modeling Study.

JMIR perioperative medicine·2026
Same author

Exploring Phosphatidylethanol Cutoffs for Self-Reported Unhealthy Alcohol Use: An International Multi-Site Analysis.

Alcohol, clinical & experimental research·2026
Same author

End-to-end extraction of temporal information from psychiatric discharge summaries.

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science·2026
Same author

Don't stop the heart: a performance analysis of large language models and potassium dosing.

medRxiv : the preprint server for health sciences·2026
Same journal

Digital divide in clinical and operational artificial intelligence adoption and implementation stages: US hospital diffusion patterns and AI deserts.

Journal of the American Medical Informatics Association : JAMIA·2026
Same journal

Extending the fundamental theorem of biomedical informatics: a proposal and illustrative examples.

Journal of the American Medical Informatics Association : JAMIA·2026
Same journal

Human factors methods for designing safe health information technology: what do the experts think?

Journal of the American Medical Informatics Association : JAMIA·2026
Same journal

Equity-by-design for socially assistive robots as digital health tools.

Journal of the American Medical Informatics Association : JAMIA·2026
Same journal

Orchestrator multi-agent clinical decision support system for secondary headache diagnosis in primary care.

Journal of the American Medical Informatics Association : JAMIA·2026
Same journal

CUI-Curate: a GraphRAG-based framework for automated clinical concept curation for NLP applications.

Journal of the American Medical Informatics Association : JAMIA·2026
See all related articles

Related Experiment Video

Updated: Jul 16, 2025

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

798

Improving model transferability for clinical note section classification models using continued pretraining.

Weipeng Zhou1, Meliha Yetisgen1, Majid Afshar2

  • 1Department of Biomedical Informatics and Medical Education, School of Medicine, University of Washington-Seattle, Seattle, WA, United States.

Journal of the American Medical Informatics Association : JAMIA
|September 19, 2023
PubMed
Summary
This summary is machine-generated.

Continued pretraining enhances clinical note section classification transferability. This method, combined with a small amount of in-domain data, significantly improves model performance across different institutions.

Keywords:
continued pretrainingnatural language processingsection classificationtext classificationtransfer learning

More Related Videos

Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images
08:20

Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images

Published on: October 27, 2023

1.5K
Objectification of Tongue Diagnosis in Traditional Medicine, Data Analysis, and Study Application
05:56

Objectification of Tongue Diagnosis in Traditional Medicine, Data Analysis, and Study Application

Published on: April 14, 2023

2.5K

Related Experiment Videos

Last Updated: Jul 16, 2025

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

798
Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images
08:20

Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images

Published on: October 27, 2023

1.5K
Objectification of Tongue Diagnosis in Traditional Medicine, Data Analysis, and Study Application
05:56

Objectification of Tongue Diagnosis in Traditional Medicine, Data Analysis, and Study Application

Published on: April 14, 2023

2.5K

Area of Science:

  • Natural Language Processing (NLP)
  • Machine Learning in Healthcare
  • Clinical Informatics

Background:

  • Accurate classification of clinical note sections (e.g., Subjective, Objective, Assessment, Plan - SOAP) is crucial for downstream NLP tasks like social determinants of health and temporal information extraction.
  • Clinical note section classification models often exhibit poor transferability across different healthcare institutions, leading to significant accuracy degradation.
  • Existing methods struggle with cross-domain generalization, even with advanced neural network architectures.

Purpose of the Study:

  • To develop and evaluate methods for classifying clinical note sections within the SOAP framework that demonstrate improved transferability across institutions.
  • To investigate the impact of continued pretraining strategies on the cross-domain performance of clinical note section classification models.

Main Methods:

  • Baseline models were fine-tuned using BERT-based architectures.
  • Continued pretraining techniques, including domain-adaptive pretraining and task-adaptive pretraining, were employed to enhance model transferability.
  • The effect of incorporating in-domain annotated samples during fine-tuning was analyzed across varying sample sizes.
  • The performance improvement from continued pretraining was quantified in terms of the equivalent number of in-domain annotated samples.

Main Results:

  • Continued pretraining alone did not significantly improve model performance.
  • When combined with a small number of in-domain annotated samples, continued pretraining substantially improved the F1 score from 0.756 to 0.808 (averaged across 3 datasets).
  • This performance gain was found to be equivalent to adding approximately 35 in-domain annotated samples.

Conclusions:

  • Continued pretraining is an effective strategy for enhancing the transferability of clinical note section classification models, particularly in cross-domain scenarios.
  • The benefits of continued pretraining are most pronounced when augmented with even a limited amount of in-domain labeled data.
  • This approach offers a viable method to improve the generalizability of NLP models in healthcare settings without requiring extensive in-domain data annotation.