It's an inherent problem with on-device AI processing, not just in the browser. ...

Vetch · 2024-04-11T16:52:16 1712854336

This specific problem is certainly not one for all on-devices AI processing. As someone else mentioned, there are unique UX and browser constraints that come from serving large compute intensive binary blobs through the browser (that are almost identically shared by games).

Separately, having to rely on preinstallation very likely means stagnating on overly sanitized poorly done official instruction-tunes. With the exception of mixtral7x8, the trend has been the community overtime arrives at finetunes which far eclipse official ones.

jfoster · 2024-04-11T14:17:14 1712845034

> I think cloud hosted models will probably always be far better for most tasks

It might depend on just how good you need it to be. There are lots of use-cases where an LLM like GPT 3.5 might be "good enough" such that a better model won't be so noticeable.

Cloud models will likely have the advantage of being more cutting-edge, but running "good enough" models locally will probably be more economical.

fauigerzigerk · 2024-04-11T14:44:24 1712846664

I agree. The economic advantages of a hybrid approach could be very significant.