Lynx Li Blog
Home
Archives
Categories
Tags
About
54 posts in total
2025
05-12
RLHF -- From Zero to PPO 代码篇
05-12
RLHF -- From Zero to PPO 理论篇
05-12
最初的sin/cos编码
05-12
Why model.enable_input_require_grads()?
05-08
Rethinking R1-like Rule-based RL
05-08
05:矩阵分解
2024
01-07
02:信号的分析方法
01-07
03:调制模拟信号
01-06
01:信号与系统的基础
01-06
01:通信的基本概念
1
2
3
4
5
6
Search
×
Keyword
Blog works best with JavaScript enabled