Yep, did exactly that. IMO he threw a fit, even though AMD was working with him ...

nomel · on Oct 6, 2023

To be fair, kernel crashes from running an AMD provided demo loop isn’t something he should have to work with them on. That’s borderline incompetence. His perspective was around integration into his product, where every AMD bug is a bug in his product. They deserve criticism, and responded accordingly (actual resources to get their shit together). It’s not like GPU accelerated ML is some new thing.

aeyes · on Oct 6, 2023

He's back on it after getting AMD's CEO to commit resources to this:

https://twitter.com/realGeorgeHotz/status/166980346408248934...

https://twitter.com/LisaSu/status/1669848494637735936

JonChesterfield · on Oct 7, 2023

That's a tough issue to read through, thanks for the link. 'Your demo code on a system setup exactly as you describe dereferences null in the kernel and falls over'. Fuzz testing + a vaguely reasonable kernel debugging workflow should make things like that much harder to find.