[opt] Fix in attention after transformers upgrade by ugolowic · Pull Request #2276 · huggingface/optimum-habana

ugolowic · 2025-09-22T08:00:21Z

This commit fixes two issues that appeared in opt attention after transformers upgrade.

incorrect shape of attn_weights caused mismatch in subsequent torch.bmm operation
applying scaling to attn_weights significantly worsened accuracy

This change should also be applied to v1.20-release

Example:

PT_HPU_LAZY_MODE=1  python3  run_generation.py --batch_size 1 --bf16 --model_name_or_path facebook/opt-125m --use_hpu_graphs --n_iterations 1 --use_kv_cache --max_new_tokens 100

This commit fixes two issues that appeared in opt attention after transformers upgrade. * incorrect shape of attn_weights caused mismatch in subsequent torch.bmm operation * applying scaling to attn_weights significantly worsened accuracy Signed-off-by: Urszula <urszula.golowicz@intel.com>

HuggingFaceDocBuilderDev · 2025-09-22T08:04:35Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

optimum/habana/transformers/models/opt/modeling_opt.py

astachowiczhabana · 2025-09-22T11:06:09Z

"Next Synapse Release Validation" is failing on one test with unrelated reason. It looks clean.

astachowiczhabana · 2025-09-22T11:06:17Z

LGTM, @regisss can we merge this?

regisss

LGTM

Signed-off-by: Urszula <urszula.golowicz@intel.com>

This reverts commit 4ac187f.

…uggingface#714) Signed-off-by: Urszula <urszula.golowicz@intel.com> Co-authored-by: Urszula Golowicz <urszula.golowicz@intel.com>

ugolowic requested a review from regisss September 22, 2025 08:00

astachowiczhabana reviewed Sep 22, 2025

View reviewed changes

optimum/habana/transformers/models/opt/modeling_opt.py Show resolved Hide resolved

astachowiczhabana approved these changes Sep 22, 2025

View reviewed changes

regisss approved these changes Sep 22, 2025

View reviewed changes

regisss merged commit 4ac187f into huggingface:main Sep 22, 2025
3 of 5 checks passed

astachowiczhabana pushed a commit that referenced this pull request Sep 23, 2025

[opt] Fix in attention after transformers upgrade (#2276)

283ebe8

Signed-off-by: Urszula <urszula.golowicz@intel.com>

astachowiczhabana added a commit that referenced this pull request Sep 23, 2025

Revert "[opt] Fix in attention after transformers upgrade (#2276)"

809cbe4

This reverts commit 4ac187f.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[opt] Fix in attention after transformers upgrade#2276

[opt] Fix in attention after transformers upgrade#2276
regisss merged 1 commit intohuggingface:mainfrom
ugolowic:opt-attn-fix

ugolowic commented Sep 22, 2025 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Sep 22, 2025

Uh oh!

Uh oh!

astachowiczhabana commented Sep 22, 2025

Uh oh!

astachowiczhabana commented Sep 22, 2025

Uh oh!

regisss left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

ugolowic commented Sep 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Sep 22, 2025

Uh oh!

Uh oh!

astachowiczhabana commented Sep 22, 2025

Uh oh!

astachowiczhabana commented Sep 22, 2025

Uh oh!

regisss left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ugolowic commented Sep 22, 2025 •

edited

Loading