[tx] Fix loss function config keys and add validation by pcmoritz · Pull Request #1159 · NovaSky-AI/SkyRL

pcmoritz · 2026-02-16T20:22:18Z

This fixes the loss function keys and brings them in line with the upstream keys in https://tinker-docs.thinkingmachines.ai/losses

gemini-code-assist

Code Review

This pull request correctly refactors the loss function configuration keys, replacing the ambiguous clip_ratio with the more explicit clip_low_threshold and clip_high_threshold. The changes are consistently applied across the backend implementation, loss functions, and tests. This significantly improves the clarity and maintainability of the configuration handling, and as the description notes, brings it in line with the documentation. The test updates are thorough and correctly validate the new behavior. This is a high-quality and beneficial change.

devin-ai-integration

✅ Devin Review: No Issues Found

Devin Review analyzed this PR and found no potential bugs to report.

View in Devin Review to see 4 additional findings.

pcmoritz · 2026-02-16T21:45:18Z

/gemini review

gemini-code-assist

Code Review

This pull request correctly updates the loss function configuration keys for PPO, replacing clip_ratio with clip_low_threshold and clip_high_threshold. The changes are applied consistently across the API, backend, and tests. The addition of a Pydantic validator to enforce correct loss_fn_config keys is a great improvement for API robustness. The new tests effectively cover the changes. I have one suggestion to improve maintainability.

gemini-code-assist · 2026-02-16T21:47:50Z

skyrl-tx/tx/tinker/api.py

+        allowed_keys_by_loss_fn = {
+            "cross_entropy": set(),
+            "importance_sampling": set(),
+            "ppo": {"clip_low_threshold", "clip_high_threshold"},
+        }


The allowed_keys_by_loss_fn dictionary is defined inside the validate_loss_fn_config_keys method. This means it will be recreated every time an instance of ForwardBackwardInput is validated. For better performance and to follow the convention of defining constants at the class or module level, consider moving this dictionary to be a class attribute of ForwardBackwardInput (e.g., _ALLOWED_KEYS_BY_LOSS_FN). This will avoid re-creating the dictionary on each validation call.

pcmoritz · 2026-02-16T22:20:27Z

/gemini review

pcmoritz · 2026-02-17T00:29:27Z

/gemini review

gemini-code-assist

Code Review

This pull request successfully refactors the loss function configuration by replacing the ambiguous clip_ratio with explicit clip_low_threshold and clip_high_threshold keys for the PPO loss function. It also introduces robust validation at the API level to ensure only valid configuration keys are used for each loss function, which is a great improvement for API clarity and correctness. The changes are consistently applied across the API, backend, and loss function implementations, and are well-supported by new and updated tests.

gemini-code-assist · 2026-02-17T00:32:23Z

skyrl-tx/tests/tinker/test_api_validation.py

+
+
+def test_forward_backward_input_rejects_invalid_ppo_loss_fn_config_keys():
+    with pytest.raises(ValidationError, match="Invalid loss_fn_config keys"):


The match string for pytest.raises is a bit broad. Making it more specific to the error message for the 'ppo' loss function will make this test more robust against future changes to other validation error messages.

Suggested change

with pytest.raises(ValidationError, match="Invalid loss_fn_config keys"):

with pytest.raises(ValidationError, match="Invalid loss_fn_config keys for loss_fn='ppo'"):

See #1159  --- <a href="https://app.devin.ai/review/novasky-ai/skyrl/pull/1168" target="_blank"> <picture> <source media="(prefers-color-scheme: dark)" srcset="https://static.devin.ai/assets/gh-open-in-devin-review-dark.svg?v=1"> <img src="https://static.devin.ai/assets/gh-open-in-devin-review-light.svg?v=1" alt="Open with Devin"> </picture> </a>

[tx] Fix loss function config keys

4a1628f

gemini-code-assist bot reviewed Feb 16, 2026

View reviewed changes

devin-ai-integration bot reviewed Feb 16, 2026

View reviewed changes

add validation

dccd8a7

pcmoritz changed the title ~~[tx] Fix loss function config keys~~ [tx] Fix loss function config keys and add validation Feb 16, 2026

pcmoritz added the tx label Feb 16, 2026

gemini-code-assist bot reviewed Feb 16, 2026

View reviewed changes

update

e9de29c

This comment was marked as resolved.

Sign in to view

use valid config for the test

5fed91c

gemini-code-assist bot reviewed Feb 17, 2026

View reviewed changes

pcmoritz merged commit 6896471 into NovaSky-AI:main Feb 17, 2026
1 of 2 checks passed

pcmoritz added a commit to pcmoritz/SkyRL that referenced this pull request Feb 17, 2026

Port NovaSky-AI#1159 to skyrl folder

3a9503d

pcmoritz mentioned this pull request Feb 17, 2026

Port https://github.com/NovaSky-AI/SkyRL/pull/1159 to skyrl folder #1168

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[tx] Fix loss function config keys and add validation#1159

[tx] Fix loss function config keys and add validation#1159
pcmoritz merged 4 commits intoNovaSky-AI:mainfrom
pcmoritz:tx-fix-loss-fn-config

pcmoritz commented Feb 16, 2026 •

edited by devin-ai-integration bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

devin-ai-integration bot left a comment

Uh oh!

pcmoritz commented Feb 16, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Feb 16, 2026

Uh oh!

pcmoritz commented Feb 16, 2026

Uh oh!

This comment was marked as resolved.

Uh oh!

pcmoritz commented Feb 17, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Feb 17, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant



		def test_forward_backward_input_rejects_invalid_ppo_loss_fn_config_keys():
		with pytest.raises(ValidationError, match="Invalid loss_fn_config keys"):

Conversation

pcmoritz commented Feb 16, 2026 • edited by devin-ai-integration bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

devin-ai-integration bot left a comment

Choose a reason for hiding this comment

✅ Devin Review: No Issues Found

Uh oh!

pcmoritz commented Feb 16, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Feb 16, 2026

Choose a reason for hiding this comment

Uh oh!

pcmoritz commented Feb 16, 2026

Uh oh!

This comment was marked as resolved.

Uh oh!

pcmoritz commented Feb 17, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Feb 17, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

pcmoritz commented Feb 16, 2026 •

edited by devin-ai-integration bot

Loading