Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Applying AI to Support Categorization of Heterogeneous Epidemiological Datasets.

Julia Sasse1, Guillaume Fabre2, Isabel Fortier2

  • 1ZB MED - Information Centre for Life Sciences, Cologne, Germany, https://ror.org/0259fwx54.

Studies in Health Technology and Informatics
|May 17, 2025
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Cohort profile: Swiss personalized health network cohort consortium.

European journal of epidemiology·2026
Same author

FAIRification on Trial? A Provenance and Reproducible Metadata Build Pipeline for Clinical Research.

Studies in health technology and informatics·2026
Same author

Prospective harmonisation of four international randomised controlled trials in Canada, China, India and South Africa: the Healthy Life Trajectories Initiative.

BMJ open·2025
Same author

NFDI4Health Workflow and Service for Synthetic Data Generation, Assessment and Risk Management.

Studies in health technology and informatics·2024
Same author

The German Central Health Study Hub - A Service to Find and Publish Clinical, Public Health and Epidemiolocal Studies and Associated Documents.

Studies in health technology and informatics·2024
Same author

Privacy Risk Assessment for Synthetic Longitudinal Health Data.

Studies in health technology and informatics·2024
Same journal

The Essential Components and Critical Conditions for Success in a Learning Health System in Oncology.

Studies in health technology and informatics·2026
Same journal

Use of Artificial Intelligence in Screening for Adolescent Idiopathic Scoliosis: A Scoping Review.

Studies in health technology and informatics·2026
Same journal

Movement Related Biomechanics in Adolescent Idiopathic Scoliosis: A Review of Reviews.

Studies in health technology and informatics·2026
Same journal

The Impact of Surgical Correction of Adolescent Idiopathic Scoliosis Using Posterior Spinal Fusion on Selected Radiological Parameters and Respiratory Function.

Studies in health technology and informatics·2026
Same journal

Acute Effect of Physio-logic® Exercises on Muscle Tone and Stiffness in Adolescent Idiopathic Scoliosis Patients: A Preliminary Study.

Studies in health technology and informatics·2026
Same journal

Effects of Integrated Music and Occupational Therapy on Motor and Autonomic Function in Children with Neurogenic Scoliosis.

Studies in health technology and informatics·2026
See all related articles

An AI tool enhances health data reuse by automatically classifying study variables. This improves data findability and interoperability, accelerating research and reducing curator workload.

Area of Science:

  • Health Informatics
  • Data Science
  • Biomedical Research

Background:

  • The increasing importance of Findable, Accessible, Interoperable, and Reusable (FAIR) data in research.
  • NFDI4Health aims to improve health data findability, reusability, and interoperability for epidemiological, clinical, and public health studies.
  • Existing platforms like the German Central Health Study Hub and Maelstrom Catalog use standardized categorization for data reuse.

Purpose of the Study:

  • To present an AI solution for automatic classification and annotation of health study variables.
  • To integrate this AI solution into the NFDI4Health Metadata Annotation Workbench.
  • To enhance data findability and reuse through accelerated and improved variable categorization.

Main Methods:

  • Development of a BioBERT-based text classifier for automatic classification and annotation.
Keywords:
BERTFAIR dataMachine learninginteroperabilitymetadata annotationtext classification

Related Experiment Videos

  • Integration of the AI model into the NFDI4Health Metadata Annotation Workbench service.
  • Evaluation of the model's performance using a weighted F1-score.
  • Main Results:

    • The BioBERT-based text classifier achieved a weighted F1-score exceeding 92%.
    • The AI solution demonstrated improved annotation performance, especially for non-expert users.
    • Accelerated categorization of study variables was observed, enhancing data findability and reuse.

    Conclusions:

    • The AI solution significantly accelerates the categorization of study variables, boosting data findability and reuse.
    • Further development of AI approaches is expected to reduce curatorial workload.
    • The AI tool promotes the creation of semantically annotated, interoperable data catalogs, advancing FAIR data principles in health research.