Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Video

Updated: May 2, 2026

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

1.6K

Fine-Tuning and Benchmarking Transformer Models for Multiclass Classification of Clinical Research Papers:

Fangwen Zhou1, Cynthia Lokker1, Rick Parrish1

  • 1Department of Health Research Methods, Evidence, and Impact, Faculty of Health Sciences, McMaster University, Hamilton, ON, Canada.

JMIR AI
|April 30, 2026
PubMed
Summary

Related Concept Videos

Transformers with Off-Nominal Turns Ratios01:25

Transformers with Off-Nominal Turns Ratios

739
In scenarios involving parallel transformers with disparate ratings, developing per-unit models requires accommodating off-nominal turns ratios. This situation arises when the selected base voltages are not proportional to the transformer’s voltage ratings. Consider a transformer where the rated voltages are related by the term a. If the chosen voltage bases satisfy a relationship involving term b, term c is defined as the ratio of these bases. This ratio is then substituted into the...
739

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Understanding Transformer-Based Classifications of Medical Text Using a Large Language Model for the Attribution of Feature Importance: Proof-of-Concept Algorithm Development and Validation Study.

JMIR medical informatics·2026
Same author

Zero-shot interpretable biomedical literature appraisal with generative large language models.

JAMIA open·2026
Same author

What You May Have Missed in 2025.

Annals of internal medicine·2026
Same author

Attitudes of medical and life sciences university students and postdoctoral fellows toward AI chatbots in education: an international cross-sectional survey.

Scientific reports·2026
Same author

Evaluation of the Burden of Bone Fractures in People Living With Haemophilia: A Registry-Based Matched Cohort Study.

Haemophilia : the official journal of the World Federation of Hemophilia·2026
Same author

GRADE Guidance: Update on Developing Good Practice Statements in Guidelines.

Annals of internal medicine·2026
Same journal

Supporting Radiology Resident Education and Clinical Decision-Making With Large Language Models: Comparative Study of Reasoning Models DeepSeek-R1 and ChatGPT-o1.

JMIR AI·2026
Same journal

Patient Perceptions on the Use of Artificial Intelligence in Creating Clinical Research Documents: Survey Study.

JMIR AI·2026
Same journal

Application of Language Models for the Analysis of Adverse Drug Events in Pharmaceutical Research and Development: Scoping Review.

JMIR AI·2026
Same journal

Correction: Deep Learning for Age Estimation and Sex Prediction Using Mandibular-Cropped Cephalometric Images: Comparative Model Development and Validation Study.

JMIR AI·2026
Same journal

AI-Assisted Systematic Literature Review of the Economic Burden of Pneumococcal Disease: Development and Validation Study.

JMIR AI·2026
Same journal

Knowledge-Augmented Large Language Model for Multimodal Electronic Health Record-Based Risk Prediction: Development and Validation Study.

JMIR AI·2026
See all related articles
This summary is machine-generated.

Fine-tuned transformer models, particularly BioBERT, excel at multiclass classification of clinical literature. Optimal hyperparameter tuning is key for robust performance in evidence synthesis and knowledge translation.

Area of Science:

  • Natural Language Processing
  • Machine Learning in Healthcare
  • Biomedical Informatics

Background:

  • The rapid growth of digital health information necessitates efficient text classification.
  • Multiclass classification of study types is crucial for evidence synthesis but remains under-explored.
  • Transformer models offer promise for enhancing knowledge translation workflows.

Purpose of the Study:

  • To fine-tune and evaluate domain-specific transformer models for multiclass classification of clinical literature.
  • To categorize papers into original studies, reviews, guidelines, and nonexperimental studies.
  • To identify optimal model configurations for accurate literature classification.

Main Methods:

  • Fine-tuned seven transformer models on the McMaster PLUS dataset (162,380 papers).
Keywords:
classificationdeep learninginformation sciencemedical informaticsnatural language processing

Related Experiment Videos

Last Updated: May 2, 2026

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

1.6K
  • Utilized a comprehensive grid search (1890 configurations) for hyperparameter optimization (class weight, learning rate, batch size, etc.).
  • Assessed models using 10 metrics, including AUROC, F1-score, and MCC, with external validation on the Clinical Hedges dataset.
  • Main Results:

    • Top models achieved macro AUROC ≥0.99, F1-score ≥0.89, and MCC ≥0.88.
    • BioBERT-based models demonstrated superior calibration, especially for original studies and reviews.
    • Models struggled with nonexperimental and guideline studies, likely due to class imbalance and heterogeneity.

    Conclusions:

    • Fine-tuned transformer models, especially BioBERT, are effective for multiclass clinical literature classification.
    • Hyperparameter optimization is critical for achieving robust model performance.
    • Future work should explore methods to address class imbalance for improved classification of all study types.