Hacker News new | past | comments | ask | show | jobs | submit login

I always wondered if ChatGPT was somehow stateful. I assumed that it was not and the statefulness was simulated. Assumption validated.



I don’t believe this was ever in question. You can think of the model as a giant function that takes a list vectors as input and spits out a new vector. If you want the model to remember something, you have to include it in the list of input vectors for every request going forward.


It’s stateful in the web demo. But they do so by prepending chat history to new requests and automatically summarizing history once the model’s context window is exceeded.


I don't think they are doing summaries actually. Just cutting off the oldest parts of the conversation.


Summarizing would make sense however.


True




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: