Lynx Li Blog
  • Home
  • Archives
  • Categories
  • Tags
  • About
Lynx Li May 8, 2025 am
1 words 1 mins

Rethinking R1-like Rule-based RL

Last updated on November 23, 2025 pm

Introduction


Research Blogs
#LLM #Reasoning
Rethinking R1-like Rule-based RL
https://lynx-li.github.io/2025/05/08/research/rule_based_rl/
Author
Lynx Li
Posted on
May 8, 2025
Licensed under
Why model.enable_input_require_grads()? Previous
05:矩阵分解 Next

Table of Contents

Search

Hexo Fluid