Home
Archives
Tags

RLHF

Posted on 2023-04-09 | In Science

Train an agent 讓 agent 具有人的判斷。instead of train yes or no.

這是 RL on HF?

# GPT # HuggingFace # LLM # prompt

XYZ Is All You Need Rationale

Paper Study By Amazon Li Mu and Zhu

Allen Lu (from John Doe)

© 2026 Allen Lu (from John Doe)

Powered by Jekyll

Theme - NexT.Muse