Effects of feedback
Load-frequency control
Reinforcement
Feedback control systems
Confirmation Biases
Law of Effect
You might also read
Articles linked to this work by shared authors, journal, and citation graph.
Updated: May 21, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
Published on: December 6, 2024
Zafaryab Haider1, Md Hafizur Rahman2, Vijay Devabhaktuni3
1Department of Electrical and Computer Engineering (ECE), University of Maine, Orono, ME, USA. zafaryab.haider@maine.edu.
A new framework called COBRA addresses security risks in training Large Language Models (LLMs) using Reinforcement Learning from Human Feedback (RLHF). COBRA effectively filters out malicious human feedback, improving LLM performance and safety in real-world applications.
Area of Science:
Background:
Purpose of the Study:
Main Methods:
Main Results:
Conclusions: