[train][fix] Temporarily rename `/update_weights` endpoint to prevent conflict with native vllm endpoint by SumanthRH · Pull Request #1265 · NovaSky-AI/SkyRL

SumanthRH · 2026-03-03T19:41:51Z

What does this PR do?

Temporary fix for weight sync CI failures reported in #1242 to unblock release.

Changes

test_weight_sync.py was failing because SkyRL's /update_weights endpoint was conflicting with vLLM's /update_weights endpoint.

This PR fixes the issue by renaming our update weights endpoint to /update_weights_skyrl.

The changes have been replicated even for the remote server codepath in the old InferenceEngineClient codepath (generators/inference_engines).

The long term fix is to migrate SkyRL's new inference servers codepath (generators/inference_servers) to use the native vLLM API endpoints.

Signed-off-by: SumanthRH <sumanthrh99@gmail.com>

…nc-temp-fix

Signed-off-by: SumanthRH <sumanthrh99@gmail.com>

gemini-code-assist

Code Review

This pull request addresses a conflict with a native vllm endpoint by renaming /update_weights to /update_weights_skyrl. However, a critical security vulnerability was identified: these sensitive endpoints lack any form of authentication. This allows anyone with network access to the inference server to trigger model weight updates, which could lead to model hijacking or denial of service. It is highly recommended to implement authentication for these custom endpoints, especially if the servers are reachable over a network. Additionally, to improve maintainability, consider using a constant for the new endpoint path to avoid hardcoding it in multiple locations.

skyrl/backends/skyrl_train/inference_servers/vllm_server_actor.py

skyrl/backends/skyrl_train/inference_engines/vllm/vllm_server.py

gemini-code-assist · 2026-03-03T19:49:33Z

skyrl/backends/skyrl_train/inference_engines/remote_inference_engine.py


                resp = await session.post(
-                    f"{self._url}/update_weights",
+                    f"{self._url}/update_weights_skyrl",


The endpoint path /update_weights_skyrl is hardcoded. This string is also used in the corresponding server implementation. To improve maintainability and ensure consistency, consider defining this path as a constant in a shared module (e.g., in a new constants.py file) and importing it where needed. This would make future changes to the endpoint easier and less error-prone.

gemini-code-assist · 2026-03-03T19:49:33Z

skyrl/backends/skyrl_train/inference_servers/remote_inference_client.py

            Dict mapping server_url to response.
        """
-        return await self._call_all_servers("/update_weights", request.to_json_dict())
+        return await self._call_all_servers("/update_weights_skyrl", request.to_json_dict())


The endpoint path /update_weights_skyrl is hardcoded. This string is also used in the corresponding server implementation. To improve maintainability and ensure consistency, consider defining this path as a constant in a shared module (e.g., in a new constants.py file) and importing it where needed. This would make future changes to the endpoint easier and less error-prone.

gemini-code-assist · 2026-03-03T19:49:33Z

tests/backends/skyrl_train/inference_servers/test_remote_inference_client.py

        return {"status": "ok", "server_id": server_id}

-    @app.post("/update_weights")
+    @app.post("/update_weights_skyrl")


The endpoint path /update_weights_skyrl is hardcoded in this test's mock server. To ensure tests stay in sync with the application code, it would be better to import the endpoint path from the same shared constant used in the application code.

gemini-code-assist · 2026-03-03T19:49:33Z

tests/backends/skyrl_train/weight_sync/test_remote_weight_loader.py

            # Verify correct endpoint called
            call_args = mock_session.post.call_args
-            assert call_args[0][0] == f"{url}/update_weights"
+            assert call_args[0][0] == f"{url}/update_weights_skyrl"


The endpoint path /update_weights_skyrl is hardcoded in this test assertion. To ensure tests stay in sync with the application code, it would be better to import the endpoint path from the same shared constant used in the application code and use it in the assertion.

SumanthRH added 4 commits March 3, 2026 18:15

rename update_weights -> update_weights_skyrl

4141495

Signed-off-by: SumanthRH <sumanthrh99@gmail.com>

fix cpu tests

722d725

Signed-off-by: SumanthRH <sumanthrh99@gmail.com>

Merge remote-tracking branch 'upstream/main' into sumanthrh/weight-sy…

0288d73

…nc-temp-fix

update docstrings

7ef34c2

Signed-off-by: SumanthRH <sumanthrh99@gmail.com>

SumanthRH marked this pull request as ready for review March 3, 2026 19:42

gemini-code-assist bot reviewed Mar 3, 2026

View reviewed changes

SumanthRH merged commit a93c11e into main Mar 3, 2026
5 of 7 checks passed

SumanthRH linked an issue Mar 3, 2026 that may be closed by this pull request

[train] Update new inference servers codepath after vllm upgrade to 0.16.0 #1242

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[train][fix] Temporarily rename `/update_weights` endpoint to prevent conflict with native vllm endpoint#1265

[train][fix] Temporarily rename `/update_weights` endpoint to prevent conflict with native vllm endpoint#1265
SumanthRH merged 4 commits intomainfrom
sumanthrh/weight-sync-temp-fix

SumanthRH commented Mar 3, 2026 •

edited by devin-ai-integration bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

gemini-code-assist bot Mar 3, 2026

Uh oh!

gemini-code-assist bot Mar 3, 2026

Uh oh!

gemini-code-assist bot Mar 3, 2026

Uh oh!

gemini-code-assist bot Mar 3, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

SumanthRH commented Mar 3, 2026 • edited by devin-ai-integration bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Changes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

gemini-code-assist bot Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

SumanthRH commented Mar 3, 2026 •

edited by devin-ai-integration bot

Loading