[skyrl-train] assert that the policy loss type is regular/dual clip for tis by erictang000 · Pull Request #546 · NovaSky-AI/SkyRL

erictang000 · 2025-10-21T19:27:33Z

TIS is currently only enabled for policy loss types that use the ppo_policy_loss code path

gemini-code-assist

Code Review

This pull request adds a validation check to ensure that Truncated Importance Sampling (TIS) is only used with compatible policy loss types, which is a good safeguard. My feedback suggests a small improvement to make this validation more robust by using a ValueError instead of an assert, aligning with the existing validation patterns in the file.

gemini-code-assist · 2025-10-21T19:28:16Z

skyrl-train/skyrl_train/utils/utils.py

            raise ValueError(
                "Gneration with `trainer.algorithm.use_tis` needs to be batched with only single turn generation"
            )
+        assert cfg.trainer.algorithm.policy_loss_type in ["regular", "dual_clip"], "TIS is only implemented for regular and dual_clip policy loss types"


For configuration validation, it's better to raise a ValueError instead of using assert. assert statements can be disabled when Python is run with the -O (optimize) flag, which would cause this important validation to be skipped. Using ValueError ensures the check is always performed and is consistent with other checks in this function.

Suggested change

assert cfg.trainer.algorithm.policy_loss_type in ["regular", "dual_clip"], "TIS is only implemented for regular and dual_clip policy loss types"

if cfg.trainer.algorithm.policy_loss_type not in ["regular", "dual_clip"]:

raise ValueError(

f"TIS is only implemented for 'regular' and 'dual_clip' policy loss types, but got '{cfg.trainer.algorithm.policy_loss_type}'"

)

…or tis (NovaSky-AI#546) TIS is currently only enabled for policy loss types that use the `ppo_policy_loss` code path

assert that the policy loss type is regular/dual clip for tis

dad7ae2

erictang000 requested a review from SumanthRH October 21, 2025 19:27

gemini-code-assist bot reviewed Oct 21, 2025

View reviewed changes

SumanthRH approved these changes Oct 21, 2025

View reviewed changes

x

2e65c96

erictang000 merged commit b714003 into NovaSky-AI:main Oct 21, 2025
3 checks passed

erictang000 deleted the tis_check branch October 21, 2025 19:33

atemaguer pushed a commit to atemaguer/SkyRL that referenced this pull request Oct 24, 2025

[skyrl-train] assert that the policy loss type is regular/dual clip f…

01c6f6d

…or tis (NovaSky-AI#546) TIS is currently only enabled for policy loss types that use the `ppo_policy_loss` code path

li-boxuan pushed a commit to li-boxuan/SkyRL that referenced this pull request Nov 23, 2025

[skyrl-train] assert that the policy loss type is regular/dual clip f…

29e0b82

…or tis (NovaSky-AI#546) TIS is currently only enabled for policy loss types that use the `ppo_policy_loss` code path

dzorlu pushed a commit to fleet-ai/SkyRL that referenced this pull request Feb 4, 2026

[skyrl-train] assert that the policy loss type is regular/dual clip f…

f1871ca

…or tis (NovaSky-AI#546) TIS is currently only enabled for policy loss types that use the `ppo_policy_loss` code path

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[skyrl-train] assert that the policy loss type is regular/dual clip for tis#546

[skyrl-train] assert that the policy loss type is regular/dual clip for tis#546
erictang000 merged 2 commits intoNovaSky-AI:mainfrom
erictang000:tis_check

erictang000 commented Oct 21, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Oct 21, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

erictang000 commented Oct 21, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Oct 21, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants