generated from fastai/nbdev_template
-
Notifications
You must be signed in to change notification settings - Fork 2.6k
Pull requests: huggingface/trl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Pass tools as None to
apply_chat_template when it is an empty list
#5380
opened Mar 26, 2026 by
rabinadk1
Loading…
2 of 5 tasks
Use datasets Json dtype to prevent insertion of None values
#5376
opened Mar 26, 2026 by
albertvillanova
Loading…
[vllm-serve] Add extra_llm_kwargs for passing additional arguments to vllm.LLM()
#5367
opened Mar 25, 2026 by
jonahsamost
Loading…
1 of 5 tasks
Remove deprecated
TRACKIO_SPACE_ID env var from all scripts
#5365
opened Mar 24, 2026 by
sergiopaniego
Loading…
5 tasks
Enforce PR template for first-time contributors and document AI usage policy
#5356
opened Mar 24, 2026 by
qgallouedec
Loading…
3 of 8 tasks
Add chunked LM head for memory-efficient log-prob computation for AsyncGRPOTrainer
#5349
opened Mar 23, 2026 by
AmineDiro
Loading…
[Test] Fix *test_training_vlm_multi_image* by skipping vision params in assertion
#5341
opened Mar 22, 2026 by
YangKai0616
Loading…
Fix Liger kernel crash with device_map="auto" on multi-GPU in GRPOTrainer
#5340
opened Mar 22, 2026 by
YangKai0616
Loading…
Support multimodal tool responses in
environment_factory for VLM training
#5323
opened Mar 20, 2026 by
sergiopaniego
Loading…
5 tasks
(4/5) async grpo break out of generation loop (is_done)
#5321
opened Mar 20, 2026 by
AmineDiro
Loading…
(1/5) Add callback to sync weights before training begins
#5319
opened Mar 20, 2026 by
AmineDiro
Loading…
(2/5) Refactor RolloutCompletion in Async Rollout Worker
#5318
opened Mar 20, 2026 by
AmineDiro
Loading…
Expand the list of attention implementations compatible with packing
#5316
opened Mar 19, 2026 by
mariosasko
Loading…
1 of 5 tasks
fix: fix a bug in vLLM weight synchronization when
vllm_enable_sleep_mode=True
🩹 for patch
#5313
opened Mar 19, 2026 by
muupan
Loading…
1 of 5 tasks
Show conversations instead of decoded text in the completions table
#5309
opened Mar 19, 2026 by
qgallouedec
Loading…
Add support for logging extra columns in reward functions and update related tests
#5308
opened Mar 19, 2026 by
qgallouedec
Loading…
fix: skip ref adapter when peft config uses target_parameters
#5292
opened Mar 16, 2026 by
gambletan
Loading…
3 tasks
Introduce backend rollout-completions interface and decouple OpenEnv helper from vLLM internals
#5256
opened Mar 10, 2026 by
rycerzes
Loading…
Previous Next
ProTip!
no:milestone will show everything without a milestone.