Hacker News new | past | comments | ask | show | jobs | submit login

Imagine any diffusion-style text-to-image model on specialized ASIC hardware.



That’s what an ANE/TPU is.

If you mean putting the model weights into gates directly, it’d be useless because users would get bored of the model as soon as they figured out what its style looked like. Also, large models can memorize their training data so eventually you’ll get it to output something copyrighted.


These models are definitely entering the space where no one could ever get bored of them, and many styles can be generated.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: