Lynx Li Blog
  • Home
  • Archives
  • Categories
  • Tags
  • About

54 posts in total


2025

05-12
RLHF -- From Zero to PPO 代码篇
05-12
RLHF -- From Zero to PPO 理论篇
05-12
最初的sin/cos编码
05-12
Why model.enable_input_require_grads()?
05-08
Rethinking R1-like Rule-based RL
05-08
05:矩阵分解

2024

01-07
02:信号的分析方法
01-07
03:调制模拟信号
01-06
01:信号与系统的基础
01-06
01:通信的基本概念
123456

Search

Hexo Fluid