- I use LLMs for code generation for a startup and they are not competitive for that yet.
- Most of the popular open models are non-commercial.
- The only practical way I know of to get large custom datasets for training is to have OpenAI's models generate them, and they forbid this in their terms of service.
Having something that's truly open and closer to GPT-4 for code generation will probably happen within less than a year (I hope) and will be a game changer for self-hosting.
- I use LLMs for code generation for a startup and they are not competitive for that yet.
- Most of the popular open models are non-commercial.
- The only practical way I know of to get large custom datasets for training is to have OpenAI's models generate them, and they forbid this in their terms of service.
Having something that's truly open and closer to GPT-4 for code generation will probably happen within less than a year (I hope) and will be a game changer for self-hosting.