Is the hierarchical all-to-all communication primitive released in the latest version? #5023

roychen9462 · 2024-01-27T02:42:53Z

roychen9462
Jan 27, 2024

I am trying to understand the detail of how DeepSpeed speedup MoE inference process. In DeepSpeed-MoE paper section5.3, it mentioned two optimized communication to group and route token more efficiently. One is hierarchical all-to-all and parallelism coordinated communication optimization. Are these optimization implemented in latest version of DeepSpeed (v0.13.1)? Can any give some advice on it? Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is the hierarchical all-to-all communication primitive released in the latest version? #5023

{{title}}

Replies: 0 comments

Select a reply

Is the hierarchical all-to-all communication primitive released in the latest version? #5023

roychen9462 Jan 27, 2024

Replies: 0 comments

roychen9462
Jan 27, 2024