Skip to content

[Feature]: Add Symetric mem all reduce strategy #8921

@MrGeva

Description

@MrGeva

🚀 The feature, motivation and pitch

see vllm implementatio. this allreduce strategy is 3x faster

Alternatives

No response

Additional context

No response

Before submitting a new issue...

  • Make sure you already searched for relevant issues, and checked the documentation and examples for answers to frequently asked questions.

Metadata

Metadata

Assignees

Labels

Scale-out<NV>Multi-GPU and distributed inference scaling issues, tensor/pipeline/data parallelismfeature requestNew feature or request. This includes new model, dtype, functionality support

Type

No type

Projects

Status

In review

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions