Skip to content

Bump neuron SDK version#3260

Merged
dacorvo merged 13 commits intomainfrom
neuron_use_nxd_backend
Jun 10, 2025
Merged

Bump neuron SDK version#3260
dacorvo merged 13 commits intomainfrom
neuron_use_nxd_backend

Conversation

@dacorvo
Copy link
Collaborator

@dacorvo dacorvo commented Jun 10, 2025

What does this PR do?

This pull-request updates the neuron backend to use the latest optimum-neuron package that is based on AWS Neuron SDK 2.22.

Note that the modeling code has been heavily modified in optimum-neuron:

  • mistral and gpt2 are not supported anymore,
  • llama now uses a different modeling.

Copy link
Contributor

@Narsil Narsil left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM !

@dacorvo dacorvo merged commit 79183d1 into main Jun 10, 2025
31 of 33 checks passed
@dacorvo dacorvo deleted the neuron_use_nxd_backend branch June 10, 2025 15:56
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants