[GRPO] add chunked grpo streaming over vocab by kashif · Pull Request #1160 · linkedin/Liger-Kernel

kashif · 2026-03-23T14:49:36Z

Summary

This PR fixes the chunked GRPO loss to compute only selected-token log-probs by streaming over the vocab dimension. This reduces peak memory for the fused-linear chunked path and preserves the existing high-level fused-linear API.

We also fixes the luspo reduction in the chunked path to match TRL exactly, and tightens the torch.compile boundary so we only compile the pure loss computation instead of compiling through the closure that calls torch.autograd.grad.

So now the two implementations are correctly implementing the trade-offs of their design.

Testing Done

Hardware Type:
run make test to ensure correctness
run make checkstyle to ensure code style
run make test-convergence to ensure convergence

kashif · 2026-03-23T16:03:47Z

cc @vaibhavjindal for your review

kashif added 4 commits March 23, 2026 15:48

add chunked grpo streaming over vocab

3a84ebc

remove luspo changes

12f00a1

luspo is not valid for token level

11f5710

ignore luspo for token level

0248d72

kashif added 2 commits March 24, 2026 10:23

luspo fix

bc068d8

use pytorch version only

e25f787

kashif force-pushed the chunked_grpo_streaming_origin_main branch from 0e54614 to e25f787 Compare March 24, 2026 21:56

Renamed _selective_logprob_forward_torch → _selective_logprob_forward

846dc1f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[GRPO] add chunked grpo streaming over vocab#1160

[GRPO] add chunked grpo streaming over vocab#1160
kashif wants to merge 7 commits intolinkedin:mainfrom
kashif:chunked_grpo_streaming_origin_main

kashif commented Mar 23, 2026 •

edited

Loading

Uh oh!

kashif commented Mar 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

kashif commented Mar 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Testing Done

Uh oh!

kashif commented Mar 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

kashif commented Mar 23, 2026 •

edited

Loading