NexT


  • Home

  • Archives

  • Tags

RLHF

Posted on 2023-04-09 | In Language

Train an agent 讓 agent 具有人的判斷。instead of train yes or no.

這是 RL on HF?

# GPT # LLM # HuggingFace # prompt
Attention Is All You Need
Paper Study By Amazon Li Mu and Zhu
Allen Lu (from John Doe)

Allen Lu (from John Doe)

243 posts
18 categories
140 tags
RSS
© 2024 Allen Lu (from John Doe)
Powered by Jekyll
Theme - NexT.Muse