I'm a bit out of the GPU game but so this might be slightly wrong in some places...

kllrnohj · 2024-09-05T22:49:57 1725576597

> So even if you do it in software the point is that if you can get rid of that 2x2 block penalty as much as possible you could be faster than GPU doing 2x2 blocks in hardware since pixel shaders can be very expensive.

Of course the obvious problem with that is if you don't have most of the screen covered in such small triangles then you're paying a large cost for nanite vs traditional means.

01HNNWZ0MV43FF · 2024-09-06T16:19:01 1725639541

Nanite has an heuristic to decide between pixel-sized compute shader rasterizing and fixed-function rasterizing. You can have screen-sized quads in Nanite and it's fine