[train] Fix issue with unset `pad_token_id` by SumanthRH · Pull Request #1232 · NovaSky-AI/SkyRL

SumanthRH · 2026-02-27T18:26:25Z

What does this PR do?

Signed-off-by: SumanthRH <sumanthrh99@gmail.com>

SumanthRH · 2026-02-27T23:26:31Z

With this PR, it is still not possible to successfully train with meta-llama/Llama-3.2-1B with --fsdp.

The issue is that that the repo for meta-llama/Llama-3.2-1B on Huggingface is gated, and we need to propagate HF_TOKEN env var to the FSDP workers. However, the current recommended path for doing so with --env-file .env does not work. I've opened another issue: #1234 .

If I manually set the env var in the workers, I am able to run SFT with meta-llama/Llama-3.2-1B

Signed-off-by: SumanthRH <sumanthrh99@gmail.com>

devin-ai-integration

✅ Devin Review: No Issues Found

Devin Review analyzed this PR and found no potential bugs to report.

View in Devin Review to see 3 additional findings.

gemini-code-assist

Code Review

This pull request introduces a centralized get_tokenizer utility to consistently handle tokenizer instantiation and fix an issue with unset pad_token_id. While the refactoring is well-executed, a critical security concern was identified: the use of trust_remote_code=True when loading tokenizers from user-supplied model paths can lead to Remote Code Execution (RCE) if a malicious model is loaded. It is recommended to disable trust_remote_code by default and only allow it if explicitly requested with appropriate warnings. Furthermore, to improve robustness, the new utility should handle cases where a tokenizer lacks both a pad_token_id and an eos_token_id to prevent potential runtime errors.

gemini-code-assist · 2026-02-27T23:57:54Z

skyrl/utils/tok.py

+        tokenizer.pad_token_id = tokenizer.eos_token_id
+        tokenizer.pad_token = tokenizer.eos_token


The current implementation to set pad_token_id from eos_token_id is not fully robust. If a tokenizer has neither a pad_token_id nor an eos_token_id, pad_token_id will be set to None. This can lead to runtime errors in downstream code that expects an integer pad_token_id for padding (e.g., in skyrl.backends.skyrl_train_backend._to_training_batch).

To prevent this, you should verify that eos_token_id is available before the assignment. If it's not, raising an explicit ValueError would provide a clear error message to the user, indicating that the tokenizer configuration is incomplete for the required padding operations.

Suggested change

tokenizer.pad_token_id = tokenizer.eos_token_id

tokenizer.pad_token = tokenizer.eos_token

if tokenizer.eos_token_id is not None:

tokenizer.pad_token_id = tokenizer.eos_token_id

tokenizer.pad_token = tokenizer.eos_token

else:

raise ValueError(

f"Tokenizer for '{model_name_or_path}' has no `pad_token_id` and no `eos_token_id`. "

"Please set `pad_token_id` for this model to ensure correct padding."

)

gemini-code-assist · 2026-02-27T23:57:54Z

skyrl/backends/skyrl_train/workers/megatron/megatron_worker.py

        Initialize the Megatron-Bridge bridge and provider objects + hf_config and tokenizer
        """
-        tokenizer = AutoTokenizer.from_pretrained(model_path, trust_remote_code=True)
+        tokenizer = get_tokenizer(model_path, trust_remote_code=True)


The use of trust_remote_code=True when loading a tokenizer from a potentially untrusted model_path (which can be supplied via command-line arguments) poses a significant security risk. If an attacker provides a path to a malicious model, arbitrary code contained within the model's configuration or tokenizer files could be executed on the system. It is highly recommended to set trust_remote_code=False by default and only enable it if the user explicitly opts in through a configuration flag, ideally with a warning about the risks involved.

gemini-code-assist · 2026-02-27T23:57:54Z

skyrl/train/entrypoints/main_base.py

+        self.tokenizer = get_tokenizer(
+            self.cfg.trainer.policy.model.path,
+            trust_remote_code=True,
+            use_fast=not self.cfg.trainer.disable_fast_tokenizer,
+            padding_side="left",
+        )


Similar to the finding in megatron_worker.py, this call to get_tokenizer explicitly enables trust_remote_code for a model path derived from user-controlled configuration (self.cfg.trainer.policy.model.path). This creates a vector for Remote Code Execution (RCE) if the model path points to a malicious repository or local directory. Consider making this an optional user-controlled setting that defaults to False.

pad token id none

eb6db37

Signed-off-by: SumanthRH <sumanthrh99@gmail.com>

vercel bot deployed to Preview February 27, 2026 18:31 View deployment

SumanthRH added 3 commits February 27, 2026 23:43

move to a common utility

14d84c3

Signed-off-by: SumanthRH <sumanthrh99@gmail.com>

x

0390df5

Signed-off-by: SumanthRH <sumanthrh99@gmail.com>

x

fbc6e47

Signed-off-by: SumanthRH <sumanthrh99@gmail.com>

vercel bot deployed to Preview February 27, 2026 23:50 View deployment

SumanthRH marked this pull request as ready for review February 27, 2026 23:52

devin-ai-integration bot reviewed Feb 27, 2026

View reviewed changes

SumanthRH merged commit be7ee34 into main Feb 27, 2026
5 of 6 checks passed

vercel bot deployed to Preview February 27, 2026 23:56 View deployment

gemini-code-assist bot reviewed Feb 27, 2026

View reviewed changes

SumanthRH mentioned this pull request Feb 27, 2026

SkyRLTrainBackend pads with None → crash in _to_training_batch (meta-llama/Llama-3.2-1B, --backend fsdp) #1231

Closed

SumanthRH deleted the sumanthrh/fix-none-pad-token-id branch February 27, 2026 23:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[train] Fix issue with unset `pad_token_id`#1232

[train] Fix issue with unset `pad_token_id`#1232
SumanthRH merged 4 commits intomainfrom
sumanthrh/fix-none-pad-token-id

SumanthRH commented Feb 27, 2026 •

edited by devin-ai-integration bot

Loading

Uh oh!

SumanthRH commented Feb 27, 2026

Uh oh!

devin-ai-integration bot left a comment

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Feb 27, 2026

Uh oh!

gemini-code-assist bot Feb 27, 2026

Uh oh!

gemini-code-assist bot Feb 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

		tokenizer.pad_token_id = tokenizer.eos_token_id
		tokenizer.pad_token = tokenizer.eos_token

Conversation

SumanthRH commented Feb 27, 2026 • edited by devin-ai-integration bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

SumanthRH commented Feb 27, 2026

Uh oh!

devin-ai-integration bot left a comment

Choose a reason for hiding this comment

✅ Devin Review: No Issues Found

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Feb 27, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 27, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 27, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

SumanthRH commented Feb 27, 2026 •

edited by devin-ai-integration bot

Loading