If I was running in a server context, would the 50gb of ram be required to respo... | Hacker News

Hacker News new | past | comments | ask | show | jobs | submit

login

iambateman on March 13, 2023 | parent | context | favorite | on: Dalai: Automatically install, run, and play with L...

If I was running in a server context, would the 50gb of ram be required to respond to one request, or can it be used to respond to multiple requests simultaneously?

lolinder on March 17, 2023 | [–]

I'm very late to this question, but I believe that that amount is only required once, but the context tensor will need to be created per request. I haven't confirmed that, though.

boredemployee on March 13, 2023 | [–]

I'd assume that all the calculations used for 1 request would already eat up that amount of memory, but I could be wrong!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact