Lynx Li Blog

Efficient PyTorch Implementation of MoE with Aux loss and Token drop

1 Preliminaries Mixture-of-Experts is an essential architecture choice when building LLMs. Since the prevalence of DeepSeekV3, companies will consider whether to use MoE structure before LLM pretrain

2025-08-03

AI Infra

#Deep Learning #AI Infra

04-GPU Programming 101

This is a lecture note of the course CSE 234 - Data Systems for ML - LE [A00].From UC SanDiegoProf. Zhang HaoWinter, 2025Link: https://podcast.ucsd.edu/watch/wi25/cse234_a00/1

2025-06-29

AI Infra > ML System

#Deep Learning #AI Infra #ML Systems

03-Optimization on Operator and Matrix Multiplication

This is a lecture note of the course CSE 234 - Data Systems for ML - LE [A00].From UC SanDiegoProf. Zhang HaoWinter, 2025Link: https://podcast.ucsd.edu/watch/wi25/cse234_a00/1

2025-06-28

AI Infra > ML System

#Deep Learning #AI Infra #ML Systems

02-Behind ML Framework

This is a lecture note of the course CSE 234 - Data Systems for ML - LE [A00].From UC SanDiegoProf. Zhang HaoWinter, 2025Link: https://podcast.ucsd.edu/watch/wi25/cse234_a00/1

2025-06-23

AI Infra > ML System

#Deep Learning #AI Infra #ML Systems

01-Introduction

This is a lecture note of the course CSE 234 - Data Systems for ML - LE [A00].From UC SanDiegoProf. Zhang HaoWinter, 2025Link: https://podcast.ucsd.edu/watch/wi25/cse234_a00/1

2025-06-23

AI Infra > ML System

#Deep Learning #AI Infra #ML Systems

03-Flow Matching and Conditional Flow Matchings

The series of tutorial is based on Flow Matching Guide and CodearXiv: 2412.06264Thank you, META Flow Matching Problem Instead of learning the likelihood of the target like the

2025-06-09

Visual Generation > Flow Matching

#Deep Learning #Generative Model #Flow Matching

02-Flow model, Everything Before Flow Matching

The series of tutorial is based on Flow Matching Guide and CodearXiv: 2412.06264Thank you, META Flow Models Before flow matching, flow models are hard to train, it entails sol

2025-06-08

Visual Generation > Flow Matching

#Deep Learning #Generative Model #Flow Matching

01-Overview of Flow Matching

The series of tutorial is based on Flow Matching Guide and CodearXiv: 2412.06264Thank you, META 1 The Definition of the Velocity ODE Let’s consider a particle moving in the sp

2025-06-08

Visual Generation > Flow Matching

#Deep Learning #Generative Model #Flow Matching

AI Research--A Year of Ramblings

The Beginning of Everything Today marks exactly one year since I became an amateur AI researcher. I still remember how disheartened I was with my circumstances back then. I was originally supposed to

2025-05-12

Life Moments

#Emotional #AI researcher #Independent Researcher

Why model.enable_input_require_grads()?

What happens when using LoRA? It starts with the error RuntimeError: element 0 of tensors does not require grad and does not have a grad_fn when you have part of the parameter

2025-05-12

LLM > Troubleshooting

#Deep Learning #AI