Haha, I just finished ordering 32GB of additional memory for my PC so I can run ...

Semaphor · on March 13, 2023

Heh, you just made me upgrade as well. After originally paying 130 € for 32 GB, it’s nice that I only had to pay 70 € to double it ;) Not sure if I want to run LLMs (or if my Ryzen 5 3600 is even powerful enough), but I’ve wanted some more RAM for a while.

iambateman · on March 13, 2023

If I was running in a server context, would the 50gb of ram be required to respond to one request, or can it be used to respond to multiple requests simultaneously?

lolinder · on March 17, 2023

I'm very late to this question, but I believe that that amount is only required once, but the context tensor will need to be created per request. I haven't confirmed that, though.

boredemployee · on March 13, 2023

I'd assume that all the calculations used for 1 request would already eat up that amount of memory, but I could be wrong!