Lynx Li Blog
  • Home
  • Archives
  • Categories
  • Tags
  • About
Lynx Li May 12, 2025 pm
19 words 1 mins

RLHF -- From Zero to PPO 代码篇

Last updated on November 23, 2025 pm

RLHF: From Zero to PPO 代码篇

1 简单的强化学习示例

ongoing

2 从OpenRLHF中看PPO实现

ongoing


LLM > RLHF
#深度学习 #智能系统 #AIGC
RLHF -- From Zero to PPO 代码篇
https://lynx-li.github.io/2025/05/12/llms/rlhf/ppo_from_start_code/
Author
Lynx Li
Posted on
May 12, 2025
Licensed under
RLHF -- GRPO Previous
RLHF -- From Zero to PPO 理论篇 Next

Table of Contents

Search

Hexo Fluid