Hacker News new | past | comments | ask | show | jobs | submit login

From the summary:

"We perform the most comprehensive evaluation of Code LLMs to date and show that StarCoderBase outperforms every open Code LLM that supports multiple programming languages and matches or outperforms the OpenAI code-cushman-001 model."

So I'd assume not up to par with gpt4 or copilot. Can't wait to see it evolve from here!




GPT4 is ways ahead. On HumanEval, it gets 67%, almost double this one.




Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: