Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving
Parallel Processing
Rapidly Varying Flow
You might also read
Articles linked to this work by shared authors, journal, and citation graph.
Updated: Jun 3, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
Published on: December 6, 2024
Juncheol Ahn1, Yubin Son1, Daemin Kim1
1System Software Laboratory, Department of Computer Engineering, Keimyung University, Daegu 42601, Republic of Korea.
We developed a runtime-adaptive scheduler for large language models (LLMs) in IoT-edge-cloud settings. This dynamic scheduling significantly reduces GPU idle time and improves throughput for LLM inference.
Area of Science:
Background:
Purpose of the Study:
Main Methods:
Main Results:
Conclusions: