Hacker News new | past | comments | ask | show | jobs | submit login

I didn't downvote, only replied.

CogAgent is also CogVLM modified to handle documents and larger images. CogVLM is better for VQA.




Consider applying for YC's W25 batch! Applications are open till Nov 12.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: