Hacker News new | past | comments | ask | show | jobs | submit login

Yes, I think we are going to need a new architecture for LLMs to move beyond, "that is interesting", to something that is reliable and can be used for trusted applications.



It's not an architecture problem of the transformer at all. This is the result of thinking the idea that you can make inviolable rules for a system you don't understand is not anything but ridiculous. You're never going to make inviolable rules for a neural network because we don't understand what is going on on the inside.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: