Skip to content

Disable SDP on BF16 default for generic diffusers HPU support#2251

Merged
regisss merged 1 commit intohuggingface:mainfrom
dsocek:generic-diffusers-sdp-on-bf16
Sep 10, 2025
Merged

Disable SDP on BF16 default for generic diffusers HPU support#2251
regisss merged 1 commit intohuggingface:mainfrom
dsocek:generic-diffusers-sdp-on-bf16

Conversation

@dsocek
Copy link
Contributor

@dsocek dsocek commented Sep 10, 2025

What does this PR do?

HPU OOB functionality for diffusers' models use GPU-HPU Migration Toolkit. After 1.21 we default to eager mode where the default SDP kernel is in FP32 precision. Previously, the default lazy mode in GPU-HPU Migration Toolkit used SDP in BF16 precision (SW-226104). This causes performance degradation. huggingface/diffusers#12310 PR is submitted to improve performance on HPU devices.

In OH we want to keep the default as is, which is what this PR is for.

Signed-off-by: Daniel Socek <daniel.socek@intel.com>
@dsocek dsocek requested a review from regisss as a code owner September 10, 2025 14:54
@HuggingFaceDocBuilderDev

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Copy link
Collaborator

@regisss regisss left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@regisss regisss merged commit 92b1579 into huggingface:main Sep 10, 2025
2 of 5 checks passed
astachowiczhabana pushed a commit that referenced this pull request Sep 11, 2025
Signed-off-by: Daniel Socek <daniel.socek@intel.com>
astachowiczhabana pushed a commit that referenced this pull request Sep 17, 2025
Signed-off-by: Daniel Socek <daniel.socek@intel.com>
gplutop7 pushed a commit to HabanaAI/optimum-habana-fork that referenced this pull request Oct 15, 2025
…gface#2251) (huggingface#671)

Signed-off-by: Daniel Socek <daniel.socek@intel.com>
Co-authored-by: Daniel Socek <daniel.socek@intel.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants