finer granular zero offload strategy #4741

chizhang118 · 2023-11-28T06:28:18Z

chizhang118
Nov 28, 2023

Now for DeepSpeed zero offload, we have fixed strategy for computation and memory, to offlaod optimizer state and gradient and optimizer computation in CPU, and others on GPU. Do we need a finer-granular strategy for various hardware and parameter settings?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

finer granular zero offload strategy #4741

{{title}}

Replies: 0 comments

Select a reply

finer granular zero offload strategy #4741

chizhang118 Nov 28, 2023

Replies: 0 comments

chizhang118
Nov 28, 2023