Hacker News new | past | comments | ask | show | jobs | submit login

We've had a decent amount of luck with InternVL 2.0 w/ Llama, and are pretty excited about Llama 3.2

It's still super early in the open source x vision model space. The limiter actually seems to be the vision encoder -- advancements here will pay off huge dividends

https://huggingface.co/spaces/opencompass/open_vlm_leaderboa...




Thank you! Great insight.




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: