Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Improving Translational Accuracy

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...

Improving Translational Accuracy

Improving Translational Accuracy

Ribosome Profiling

Ribosome Profiling

Ribosome profiling or ribo-sequencing is a deep sequencing technique that produces a snapshot of active translation in a cell. It selectively sequences the mRNAs protected by ribosomes to get an insight into a cell’s translation landscape at any given point in time.
Applications of ribosome profiling
Ribosome profiling has many applications, including in vivo monitoring of translation inside a particular organ or tissue type and quantifying new protein synthesis levels.
The technique...

Scaling

Scaling

In designing and analyzing filters, resonant circuits, or circuit analysis at large, working with standard element values like 1 ohm, 1 henry, or 1 farad can be convenient before scaling these values to more realistic figures. This approach is widely utilized by not employing realistic element values in numerous examples and problems; it simplifies mastering circuit analysis through convenient component values. The complexity of calculations is thereby reduced, with the understanding that...

Language and Cognition

Language and Cognition

Language serves as a bridge between ideas and communication, influencing how individuals perceive and interact with the world. Psychologists have long debated whether language shapes thought or vice versa. This discussion gained grip with Edward Sapir and Benjamin Lee Whorf in the 1940s, who proposed that language determines thought, a concept known as linguistic determinism. They suggested that the vocabulary and structure of a language influence how its speakers think and perceive reality.

Aggregates Classification

Aggregates Classification

Aggregate classification is generally based on its size, petrographic characteristics, weight, and source. Size classification ranges from coarse to fine aggregates, defined by the size of the particles. Coarse aggregates are particles that do not pass through ASTM sieve No. 4, and aggregates that pass through the sieve are fine aggregates.
Petrographic classification groups aggregates based on common mineralogical characteristics. Some of the common mineral groups found in aggregates are...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Improving Retrieval-Augmented Generation without Taxonomy-based Error Categorization.

Proceedings of the conference. Association for Computational Linguistics. Meeting·2026

Same author

Hard to Halt: Automation Bias in Agent-Driven Sequencing Prior Authorization Workflows.

medRxiv : the preprint server for health sciences·2026

Same author

Unsupervised characterization of 100,272 EHR patients identifies high-risk groups and comorbidities linked to premature aging.

NPJ digital medicine·2026

Same author

TimeX: Phenotype Onset Extraction from Clinical Narratives.

npj health systems·2026

Same author

Completeness of Common Data Elements for Breast Cancer Clinical Trials in Observational Databases.

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science·2026

Same author

Wrong-side imaging orders: automated detection using electronic health record data - a retrospective cohort study.

BMJ open quality·2026

Same journal

Evaluation of temporal preservation in synthetic longitudinal patient data.

Journal of biomedical informatics·2026

Same journal

ARKE: An ontology-driven framework for automated mapping of local radiology procedure terms to the LOINC-RadLex playbook using large language model.

Journal of biomedical informatics·2026

Same journal

A validation-driven training controller for cross-lingual biomedical NER via reinforcement learning-based adaptive loss weighting.

Journal of biomedical informatics·2026

Same journal

ASP-HR: An Adaptive Spatial Perception and Hierarchical Reasoning mechanism for document-level biomedical relation extraction.

Journal of biomedical informatics·2026

Same journal

Beyond Accuracy: Safety-Centered guidelines for the evaluation of LLM-based therapy recommendation systems for chronic multimorbidity patients.

Journal of biomedical informatics·2026

Same journal

DeepEN: A deep reinforcement learning framework for personalized enteral nutrition in critical care.

Journal of biomedical informatics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jan 12, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Scalable scientific interest profiling using large language models.

Yilun Liang¹, Gongbo Zhang², Edward Sun³

¹Tandon School of Engineering, New York University, Brooklyn, NY, USA.

Journal of Biomedical Informatics

|November 2, 2025

Summary

This summary is machine-generated.

Large Language Models (LLMs) can automate scientific interest profiling. Profiles generated using Medical Subject Headings (MeSH) terms are more readable, though human-written profiles offer more novel concepts.

Keywords:

Kullback-Leibler Divergence Large Language Models Natural Language Generation Researcher Profiling

More Related Videos

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

A User-friendly and Powerful R Analysis of Large-scale Datasets

A User-friendly and Powerful R Analysis of Large-scale Datasets

Published on: November 4, 2025

Related Experiment Videos

Last Updated: Jan 12, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

A User-friendly and Powerful R Analysis of Large-scale Datasets

A User-friendly and Powerful R Analysis of Large-scale Datasets

Published on: November 4, 2025

Area of Science:

Biomedical informatics
Artificial intelligence in research

Background:

Scientific research profiles are crucial for talent discovery and collaboration.
Existing profiles are often outdated, necessitating automated and scalable solutions.
Large Language Models (LLMs) offer a potential solution for dynamic profile generation.

Purpose of the Study:

To design and evaluate LLM-based methods for generating scientific interest profiles.
To compare machine-generated profiles with researchers' self-summarized interests.
To assess the performance of profiles generated from PubMed abstracts versus Medical Subject Headings (MeSH) terms.

Main Methods:

Two LLM-based methods were developed: one summarizing researcher abstracts and another using MeSH terms.
GPT-4o-mini was used to generate summaries for 595 researchers from Columbia University Irving Medical Center.
Automated metrics (ROUGE-L, BLEU, METEOR, BERTScore, KL Divergence) and manual evaluations were employed for comparison.

Main Results:

Automated metrics showed low lexical overlap but moderate semantic similarity (BERTScore F1: ~0.55) between machine-generated and human-written profiles.
Manually paraphrased summaries achieved higher similarity (F1: 0.851).
MeSH-based profiles demonstrated superior readability (93.44% favorable ratings) and were preferred in 67.86% of manual reviews, despite differences in keyword usage and factual accuracy compared to human-written profiles.

Conclusions:

LLMs show promise for scalable automation of scientific interest profiling.
MeSH-based LLM-generated profiles offer better readability than abstract-based ones.
While LLMs can generate semantically similar profiles, human-written summaries tend to introduce more novel concepts.