NexT


  • Home

  • Archives

  • Tags

RLHF

Posted on 2023-04-09 | In Science

Train an agent 讓 agent 具有人的判斷。instead of train yes or no.

這是 RL on HF?

# GPT # HuggingFace # LLM # prompt
XYZ Is All You Need Rationale
Paper Study By Amazon Li Mu and Zhu
Allen Lu (from John Doe)

Allen Lu (from John Doe)

341 posts
8 categories
230 tags
RSS
© 2026 Allen Lu (from John Doe)
Powered by Jekyll
Theme - NexT.Muse