These models have proven to develop incredible abilities through pattern matching on massive text data, so I wouldn’t be too quick to dismiss the limits of what they could do.
Having them use specialized tools would probably be more effective (e.g. have the reasoning LLM use the DNA LLM), but in the long term with scale… who knows? The bitter lesson keeps biting us every time we think we know better.
These models have proven to develop incredible abilities through pattern matching on massive text data, so I wouldn’t be too quick to dismiss the limits of what they could do.
Having them use specialized tools would probably be more effective (e.g. have the reasoning LLM use the DNA LLM), but in the long term with scale… who knows? The bitter lesson keeps biting us every time we think we know better.