[Fix][Generator] Use custom_chat_template in each step retokenization by CharlieFRuan · Pull Request #233 · NovaSky-AI/SkyRL

CharlieFRuan · 2025-08-31T08:33:39Z

Prior to this PR, in the re-tokenize chat history codepath, we did not use self.custom_chat_template in some places (only used it in the final tokenization). Though currently only Qwen3 enters this codepath, which would not be affected, it is still good to apply the custom_chat_template for consistency.

gemini-code-assist

Code Review

This pull request correctly applies the custom_chat_template in the re-tokenization step for consistency. While the change is good in isolation, it highlights a critical bug in the agent_loop logic. The initial prompt length is calculated using the default chat template, but the token IDs from which the prompt is later extracted are generated using the custom chat template. This mismatch will lead to incorrect prompt data. I've added a critical review comment with a detailed explanation of the issue and a suggested fix.

skyrl-train/skyrl_train/generators/skyrl_gym_generator.py

CharlieFRuan · 2025-08-31T09:19:56Z

/gemini review

gemini-code-assist

Code Review

This pull request addresses an inconsistency by ensuring custom_chat_template is used during the re-tokenization of chat history. The changes in agent_loop and _get_next_input_ids_by_retokenizing_chat_history correctly apply the custom template when needed. The fix is sound and improves code consistency. I have provided one suggestion to enhance the code style in agent_loop.

skyrl-train/skyrl_train/generators/skyrl_gym_generator.py

…#233) Prior to this PR, in the re-tokenize chat history codepath, we did not use `self.custom_chat_template` in some places (only used it in the final tokenization). Though currently only Qwen3 enters this codepath, which would not be affected, it is still good to apply the `custom_chat_template` for consistency.

[Fix][Generator] Add chat_template in each step retokenization

8d9c7d9

CharlieFRuan changed the title ~~[Fix][Generator] Add chat_template in each step retokenization~~ [Fix][Generator] Use custom_chat_template in each step retokenization Aug 31, 2025

gemini-code-assist bot reviewed Aug 31, 2025

View reviewed changes

skyrl-train/skyrl_train/generators/skyrl_gym_generator.py Show resolved Hide resolved

address comments

05110c5

gemini-code-assist bot reviewed Aug 31, 2025

View reviewed changes

skyrl-train/skyrl_train/generators/skyrl_gym_generator.py Outdated Show resolved Hide resolved

address gemini comments

d25fe7a

SumanthRH self-assigned this Sep 2, 2025

SumanthRH approved these changes Sep 2, 2025

View reviewed changes

SumanthRH merged commit 9c592e9 into NovaSky-AI:main Sep 2, 2025
3 checks passed

CharlieFRuan deleted the fix-0831-custom-chat branch September 8, 2025 15:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Fix][Generator] Use custom_chat_template in each step retokenization#233

[Fix][Generator] Use custom_chat_template in each step retokenization#233
SumanthRH merged 3 commits intoNovaSky-AI:mainfrom
CharlieFRuan:fix-0831-custom-chat

CharlieFRuan commented Aug 31, 2025 •

edited

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

CharlieFRuan commented Aug 31, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

CharlieFRuan commented Aug 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

CharlieFRuan commented Aug 31, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

CharlieFRuan commented Aug 31, 2025 •

edited

Loading