One thing they don't so far do is have consistent perspective and vanishing poin...

orbital-decay · 2024-02-22T21:33:19 1708637599

As well as light and shadows, yes. It can be fixed explicitly during training like the paper you linked suggests by offering a classifier, but it will probably also keep getting better in new models on its own, just as a result of better training sets, lower compression ratios, and better understanding of the real world by models.