Hacker News new | past | comments | ask | show | jobs | submit login

I don't know a lot about image generation models, but 1B sounds super low for this kind of model, so I'm pretty impressed, personally.





If I remember correctly, SD had less than 1B parameters at launch (~2 years ago?), and you could generate pretty impressive images with the right settings and prompts.

Janus Pro 1B is a multimodal LLM, not a diffusion model, so it's got a bit more things to pack in the parameters. It is super low parameter count, in an LLM context.

Yep! Less than 1B in total [0]:

> 860M UNet and 123M text encoder

[0] https://github.com/CompVis/stable-diffusion/blob/main/README...


Oh wow okay thank you for the context



Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: