Just train another AI model to do it then! I'm not joking -- Stable Diffusion generates some pretty grotesque and low quality faces, but there are add-on models that can identify and greatly improve the faces as part of the processing pipeline.
Doesn't seem like a stretch to have similar mini-models to improve known deficiencies in larger general models in the textual space.
Doesn't seem like a stretch to have similar mini-models to improve known deficiencies in larger general models in the textual space.