[skyrl-train] add option to specify ref model path by erictang000 · Pull Request #623 · NovaSky-AI/SkyRL

erictang000 · 2025-11-04T00:46:47Z

Add option to specify ref model path separately from policy model path. Default stays as policy model path.

gemini-code-assist

Code Review

This pull request introduces a valuable enhancement by allowing the reference model path to be specified independently from the policy model path. The changes are well-implemented across the configuration, documentation, and trainer logic, ensuring backward compatibility by defaulting the reference model path to the policy model's path. My review includes a minor suggestion to improve the clarity of the documentation. Overall, this is a good change that increases the flexibility of the training setup.

gemini-code-assist · 2025-11-04T00:47:51Z

skyrl-train/docs/configuration/config.rst

        fsdp_size: -1
      sequence_parallel_size: 1

+- ``ref.model.path``: Path to the reference model. Defaults to the policy model path, but can be separately set (i.e. for on policy distillation, the reference model can be a different model than the policy model).


The term "on policy distillation" is a bit ambiguous and could be a typo. While PPO is an on-policy algorithm, using a separate reference model is a concept often associated with off-policy methods or distillation. To improve clarity, I suggest rephrasing this part of the sentence.

For example, you could say "(i.e., for distillation, the reference model can be a different model than the policy model)" or more generally "(e.g., for distillation-based approaches, ...)".

SumanthRH

Stamp

Add option to specify ref model path separately from policy model path. Default stays as policy model path.

add option to specify ref model path

25ab9ba

erictang000 requested a review from SumanthRH November 4, 2025 00:47

gemini-code-assist bot reviewed Nov 4, 2025

View reviewed changes

x

6702eaf

SumanthRH approved these changes Nov 4, 2025

View reviewed changes

erictang000 merged commit 31f8f51 into NovaSky-AI:main Nov 4, 2025
3 checks passed

erictang000 deleted the add_ref_path branch November 4, 2025 00:53

li-boxuan pushed a commit to li-boxuan/SkyRL that referenced this pull request Nov 23, 2025

[skyrl-train] add option to specify ref model path (NovaSky-AI#623)

38c7501

Add option to specify ref model path separately from policy model path. Default stays as policy model path.

dzorlu pushed a commit to fleet-ai/SkyRL that referenced this pull request Feb 4, 2026

[skyrl-train] add option to specify ref model path (NovaSky-AI#623)

327d66a

Add option to specify ref model path separately from policy model path. Default stays as policy model path.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[skyrl-train] add option to specify ref model path#623

[skyrl-train] add option to specify ref model path#623
erictang000 merged 2 commits intoNovaSky-AI:mainfrom
erictang000:add_ref_path

erictang000 commented Nov 4, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Nov 4, 2025

Uh oh!

SumanthRH left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

erictang000 commented Nov 4, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Nov 4, 2025

Choose a reason for hiding this comment

Uh oh!

SumanthRH left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants