Survival Tree
Reinforcement Schedules
Observational Learning
Language Development
Reinforcement
Language and Cognition
You might also read
Articles linked to this work by shared authors, journal, and citation graph.
Updated: Jul 4, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
Published on: December 6, 2024
Hieu Tran1,2, Zonghai Yao1,2, Hong Yu1,2,3
1Center for Healthcare Organization and Implementation Research, VA Bedford Health Care.
Reinforcement learning for large language models (LLMs) is improved by TEMPO, a novel critic-free algorithm. TEMPO enhances policy optimization by using a prefix tree to better assign credit for token-level rewards, boosting model performance.
Area of Science:
Background:
Purpose of the Study:
Main Methods:
Main Results:
Conclusions: