Skip to content

Conversation

dacorvo
Copy link
Collaborator

@dacorvo dacorvo commented Jun 27, 2025

What does this PR do?

This bumps the optimum-neuron version to 0.2.2, adding support for Qwen3 models in the Neuron backend.

dacorvo added 3 commits June 27, 2025 11:33
Since the latest optimum-neuron uses a new modeling for granite and
qwen, the greedy outputs are slighly different.
Copy link
Collaborator

@tengomucho tengomucho left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@dacorvo dacorvo changed the title Optimum neuron 0.2.1 Optimum neuron 0.2.2 Jul 1, 2025
Copy link
Collaborator

@Narsil Narsil left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

Copy link
Collaborator

@tengomucho tengomucho left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@dacorvo dacorvo merged commit 3d2e7c8 into main Jul 3, 2025
29 of 31 checks passed
@dacorvo dacorvo deleted the optimum_neuron_0.2.1 branch July 3, 2025 05:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants