Hacker News new | past | comments | ask | show | jobs | submit login

Eating popcorn while the scaling doubters scramble to move the goalposts for the nth time.





Its number for one of the benchmark has:

**with browsing + python tools

Maybe we have different definitions of scaling?


I would consider unsupervised tool usage an achievement

But it’s not simply scaling. Who is moving the goalposts exactly?

Trying to parse this. What are you saying?



Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: