In what way is this a ChatGPT implementation or equivalent? Seems like a chatbot based on a different backend, therefore it has absolutely zero link to ChatGPT.
It is a different backend but it supposedly should be roughly comparable to ChatGPT. Also, looks like it's both open source and requires a lot less hardware to run and train.
Its not open source until the weights are available. I have the hardware I need to run it but the required files are not available unless you receive special access.
You can't use what has been released unless you want to spend $500,000 on training.
With only a modicum of trolling-level here, I wonder what percentage of that training expense was used to identify and avoid "true things that must be muted because they offend someone"
Ignoring the subtext of "true things that must be muted because they offend someone", there's a whole section in the paper on how they didn't filter and the problems that causes. TL;DR:
> We observe that toxicity increases with the size of the model, especially for Respectful prompts.
It does outperform GPT3 slightly in terms of observed bias against protected groups (as in it is slightly less biased) but not substantially so.
It uses a different engine, so this is as related to ChatGPT as a Toyota Corolla is related to a BMW car. This is an efficient and open-source chatbot, which is very good news, but the authors just wrote a clickbait title and they know it.
In formal analogies, : is pronounced "is to" and :: is pronounced "as".
The purpose here is to use the relationship from a known, to describe the relationship between a partial known.
ChatGPT is to GPT3 as ChatLLaMa is to LLaMa. It uses the relationship between ChatGPT and GPT3 to extrapolate a relationship between an unknown and LLaMa.
Corolla:Toyota::3-Series:BMW. If you had heard of a Corolla, Toyota, and BMW, but not a 3-Series, you now roughly know that a 3-Series is BMWs equivalent to a Corolla.
I think I prefer the other commenter's point, referring to ChatGPT as a known learning paradigm for chatbots. But thanks for the little crash course on analogies ;)
Yes and no. You wouldn't say GPT to mean large language models or autoregressive language models. I would've thought the same to be true for ChatGPT instead of Chatbots with RL from human feedback (RLHF), perhaps the field is moving towards adopting ChatGPT as a paradigm name. Note that the title doesn't say a ChatGPT-like model based on LLaMa, it says outright opensource implementation of ChatGPT.
> You wouldn't say GPT to mean large language models or autoregressive language models.
In the analogy, that’s exactly what you are saying. Identical to Toyota and BMW meaning “the make of the car.”
Maybe reimplementation is a more precise word, a black box re-engineering/cloning. In this case I inferred it by knowing it was a different LLM underneath, and that this group didn’t have access to the chatgpt source code.
The analogy is somewhat accurate, but also moot, since within the ML community "ChatGPT" can be used either as the product or the method (more specifically called RLHF) somewhat interchangeably. It's more like Google/Googling, where the largest/most popular provider becomes the defacto way to refer to a method.
As someone who develops DL models, the title seems quite apt.