Hacker News new | past | comments | ask | show | jobs | submit login

Some research to the contrary [1] - tldr is that they didn't find evidence that generative models really do zero shot well at all yet, if you show it something it literally hasn't seen before, it isn't "generally intelligent" enough to do it well. This isn't an issue for a lot of use-cases, but does seem to add some weight to the "giga-scale memorization" hypothesis.

[1] https://arxiv.org/html/2404.04125v2




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: