Hacker News new | past | comments | ask | show | jobs | submit login

Vast majority of comparisons aren't really putting these new models through their paces.

The best prompt adherence on the market right now BY FAR is DALL-E 3 but it still falls down on more complicated concepts and obviously is hugely censored - though weirdly significantly less censored if you hit their API directly.

I quickly mocked up a few weird/complex prompts and did some side-by-side comparisons with Flux and DALL-E 3. Flux is impressive and significantly performant particularly since both the dev/shnell models have been confirmed by Black Forest to be runnable via ComfyUI.

https://mordenstar.com/blog/flux-comparisons




Your comparisons are all with the flux shnell model

> The fastest image generation model tailored for local development and personal use

Versus flux pro or dev models


I did put them through pro/dev as well just to be safe. The quality changes and you can play with guidance (cranking it all the way to 10) but it doesn't make a significant difference for these prompts from what I could tell.

Several iterations and these were the best I got out of schnell, dev and pro respectively for the following prompt:

"a fantasy creature with the body of a dragon and a beachball for a head, hybrid, best quality, shadows and lighting, fantasy illustration muted"

https://gondolaprime.pw/pictures/schnell-dev-pro.jpg




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: