The cost of the LLM isn't the only or even the most important cost that matters....

HarHarVeryFunny · 2024-06-14T23:05:09

You're talking about applying tree search as a form of network architecture search (NAS), which is different from applying it to LLM output sampling.

Automated NAS has been tried for (highly constrained) image classifier design, before simpler designs like ResNets won the day. Doing this for billion parameter sized models would certainly seem to be prohibitively expensive.

typon · 2024-06-15T01:25:21

I'm not following. How do you propose search is performed by the ASI designed for "AI Research"? (as proposed by the article)

HarHarVeryFunny · 2024-06-15T01:54:21

Fair enough - he discusses GPT-4 search halfway down the article, but by the end is discussing self-improving AI.

Certainly compute to test ideas (at scale) is the limiting factor for LLM developments (says Sholto @ Google), but if we're talking moving beyond LLMs, not just tweaking them, then it seems we need more than architecture search anyways.

therobots927 · 2024-06-15T01:36:58

Well people certainly are good at finding new ways to consume compute power. Whether it’s mining bitcoins or training a million AI models at once to generate a “meta model” that we think could achieve escape velocity. What happens when it doesn’t? And Sam Altman and the author want to get the government to pay for this? Am I reading this right?