Lynx Li Blog
  • Home
  • Archives
  • Categories
  • Tags
  • About
Lynx Li May 12, 2025 pm
2 words 1 mins

RLHF -- GRPO

Last updated on November 23, 2025 pm

RLHF – GRPO

ongoing


LLM > RLHF
#深度学习 #智能系统 #AIGC
RLHF -- GRPO
https://lynx-li.github.io/2025/05/12/llms/rlhf/grpo/
Author
Lynx Li
Posted on
May 12, 2025
Licensed under
RLHF -- DPO Previous
RLHF -- From Zero to PPO 代码篇 Next

Table of Contents

Search

Hexo Fluid