Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Instance type g6.xlarge #2697

Open
silvacarl2 opened this issue Jan 2, 2025 · 3 comments
Open

Instance type g6.xlarge #2697

silvacarl2 opened this issue Jan 2, 2025 · 3 comments

Comments

@silvacarl2
Copy link

AWS has a new instance type g6.xlarge. Its is extremely cost effective. its is 0.8048 per hour with these specs:

vCPUs: 4
Memory: 16 GiB
GPU: 1 NVIDIA L4 Tensor Core GPU
GPU Memory: 24 GiB
Processor: 3rd Generation AMD EPYC with a base clock speed of 2.6 GHz
Network Performance: Up to 10 Gigabits per second (Gbps)
Instance Storage: 1 x 250 GB NVMe SSD
EBS-Optimized: Yes, with a maximum bandwidth of 5,000 Mbps

but for some reason it gets stuck on this step when trying to build it on the g6.xlarge:

Untitled

@ggerganov
Copy link
Owner

Did you try waiting it out - this step can take several minutes to complete.

@silvacarl2
Copy link
Author

silvacarl2 commented Jan 3, 2025

will let it wait a lot longer and check back with you. it made it to 83% now.

llama.cpp worked fine.

this is running on ubuntu 22, CUDA 12.6

@silvacarl2
Copy link
Author

so we ran it for a few hours and eventually it killed the server. which makes no sense because it works great on other EC2 instance types like g4dn and g5.

if you would like i can let you borrow one a g6.xlarge?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants