You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Initialize a model that takes no host or CPU memory (for cases where the model is larger than the host memory)
Replay the operations that were played while initializing a model or a partition of the model
APIs
from pipegoose.utils import lazy_init
# load the model from `transformers`
with lazy_init(parallel_context):
model = TensorParallel(model, parallel_context).parallelize()
model = PipelineParallel(model, parallel_context).parallelize()
model = DataParallel(model, parallel_context).parallelize()
logits = model(inputs)
APIs
Reading
The text was updated successfully, but these errors were encountered: