I think the big breakthrough will come once it uses inner monologues during training. You can jury rig an inner monologue like that, but it isn't the same thing as training a model from scratch that is optimized to solve problems using inner monologues.