I would sell a kidney for one of these. It's basically impossible to train langu...

YetAnotherNick · on March 21, 2023

Use 4 consumer grade 4090 then. It would be much cheaper and better in almost every aspect. Also even with this, forget about training foundational models. Meta spent 82k GPU hours on the smallest llama and 1M hours on largest.

throwaway743 · on March 21, 2023

Go with 2x 3090s instead. 4000 series doesn't support SLI, so you're stuck with the max of whatever one card you get.

bick_nyers · on March 21, 2023

If I remember correctly the NVLINK adds 100GB/s (where PCIE 4.0 is 64GB/s). Is it really worth getting 3090 performance (roughly half) for that extra bus speed?

rerx · on March 22, 2023

Ampere NVLink (NV3) was 600 GByte/sec, with Hopper (NV4) it's 900 GByte/sec. https://www.nvidia.com/en-us/data-center/nvlink/

bick_nyers · on March 22, 2023

That is for the data center NVLINK, according to Wikipedia, for GA102 (3090) it is a 56.25GB/s bidirectional, yielding 112.5GB/s total bus bandwidth.

rerx · on March 22, 2023

Ah, that's true, thanks. It's the same type of NVLink as on the A40 GPU. https://images.nvidia.com/content/Solutions/data-center/a40/...

YetAnotherNick · on March 21, 2023

PCIE 4.0*16 is 32 GB/s.

solarmist · on March 21, 2023

You think? It’s double 48 GB (per card) so why wouldn’t it be in the $20k range?

ipsum2 · on March 21, 2023

Machine learning is so hyped right now (with good reason) so customers are price insensitive.

solarmist · on March 21, 2023

I guess we'll see.

solarmist · on March 22, 2023

Tomshardware is estimating $80k.