Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

MALDI-TOF Mass Spectrometry01:19

MALDI-TOF Mass Spectrometry

6.5K
Mass spectrometry is a powerful characterization technique that can identify and separate a wide variety of compounds ranging from chemical to biological entities, based on their mass-to-charge ratio (m/z). The instruments that allow this detection, known as mass spectrometers, have three components: an ion source, a mass analyzer, and a detector. These spectrometers differ based on the nature of their ion source and analyzers.Matrix-assisted laser desorption ionization (MALDI) is a commonly...
6.5K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Structured reasoning failures compromise LLM interpretation of clinical oncology notes.

NPJ digital medicine·2026
Same author

Fine-Tuning, Retrieval-Augmented Generation, and Hybrid Large Language Models for Postoperative Decision Support: A Comparative Analysis.

Journal of medical Internet research·2026
Same author

Federated target trial emulation for time-to-event outcomes via POLARIS: Pooled-equivalent One-shot Likelihood Aggregation for Real-world Inference in Survival.

Research square·2026
Same author

Optimizing Retrieval-Augmented Generation (RAG) in clinical medicine: methods and performance evaluation.

Journal of the American Medical Informatics Association : JAMIA·2026
Same author

From "negative" trial to positive clinical impact: mitigating eligibility criteria-induced temporal selection bias in emulated clinical trials.

npj health systems·2026
Same author

AI-Generated Avatar Videos for Postoperative Patient Education Among Health Care Workers: Pilot Randomized Controlled Trial.

JMIR perioperative medicine·2026
Same journal

A hybrid approach for diabetic retinopathy stages classification using spatial and textural features.

Health informatics journal·2026
Same journal

Integrating platform usage into the comprehensive model of information seeking: Health information seeking on WeChat among Chinese young adults.

Health informatics journal·2026
Same journal

The impact of telehealth on patient-centered communication during the COVID-19 pandemic.

Health informatics journal·2026
Same journal

Evaluating the quality and reliability of short videos about tongue cancer on TikTok: A cross-sectional study.

Health informatics journal·2026
Same journal

Needs assessment and development of an EMR-integrated AI system to enhance nursing handover: NurSync.

Health informatics journal·2026
Same journal

Editorial: AI and robotics for the smart hospitals of the future.

Health informatics journal·2026
See all related articles

Related Experiment Video

Updated: Jan 18, 2026

Constructing and Visualizing Models using Mime-based Machine-learning Framework
06:19

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Published on: July 22, 2025

2.3K

Recognizing software names in biomedical literature using machine learning.

Qiang Wei, Yaoyun Zhang, Muhammad Amith1

  • 1The University of Texas Health Science Center at Houston, USA.

Health Informatics Journal
|October 1, 2019
PubMed
Summary
This summary is machine-generated.

This study developed machine learning models to automatically identify biomedical software from research papers, creating a large software catalog. These natural language processing methods offer a scalable solution for indexing scientific software.

Keywords:
biomedical literaturebiomedical softwarebiomedical software indexnamed entity recognitionnatural language processing

More Related Videos

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model
07:15

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model

Published on: August 16, 2020

7.4K
Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.9K

Related Experiment Videos

Last Updated: Jan 18, 2026

Constructing and Visualizing Models using Mime-based Machine-learning Framework
06:19

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Published on: July 22, 2025

2.3K
Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model
07:15

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model

Published on: August 16, 2020

7.4K
Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.9K

Area of Science:

  • Biomedical Informatics
  • Computational Biology
  • Natural Language Processing

Background:

  • Biomedical research relies heavily on software tools, but existing repositories are difficult to scale due to manual curation.
  • There is a need for automated methods to index and catalog software used in the biomedical domain.

Purpose of the Study:

  • To develop and evaluate machine learning-based named entity recognition systems for automatically identifying biomedical software from literature.
  • To create a comprehensive biomedical software catalog using automated methods.

Main Methods:

  • Manually annotated a corpus of 1,120 MEDLINE abstracts and titles for software names.
  • Developed named entity recognition systems using domain knowledge features and unsupervised word embeddings.
  • Evaluated system performance using F-measure with inexact matching criteria.

Main Results:

  • The best system achieved an F-measure of 91.79% for software recognition in titles and 86.35% in titles and abstracts.
  • A biomedical software catalog with 19,557 entries was successfully created using the developed system.

Conclusions:

  • Natural language processing methods are feasible for automatically building a high-quality software index from biomedical literature.
  • Automated software cataloging can overcome the scalability limitations of manual curation, supporting biomedical research.