In what way is this a ChatGPT implementation or equivalent? Seems like a chatbot...

rnosov · on Feb 27, 2023

It is a different backend but it supposedly should be roughly comparable to ChatGPT. Also, looks like it's both open source and requires a lot less hardware to run and train.

Taek · on Feb 27, 2023

Its not open source until the weights are available. I have the hardware I need to run it but the required files are not available unless you receive special access.

You can't use what has been released unless you want to spend $500,000 on training.

pmarreck · on Feb 27, 2023

With only a modicum of trolling-level here, I wonder what percentage of that training expense was used to identify and avoid "true things that must be muted because they offend someone"

nl · on Feb 28, 2023

Ignoring the subtext of "true things that must be muted because they offend someone", there's a whole section in the paper on how they didn't filter and the problems that causes. TL;DR:

> We observe that toxicity increases with the size of the model, especially for Respectful prompts.

It does outperform GPT3 slightly in terms of observed bias against protected groups (as in it is slightly less biased) but not substantially so.

basch · on Feb 27, 2023

It is an analogy.

ChatGPT:GPT3::ChatLLaMa:LLaMa

levesque · on Feb 27, 2023

It uses a different engine, so this is as related to ChatGPT as a Toyota Corolla is related to a BMW car. This is an efficient and open-source chatbot, which is very good news, but the authors just wrote a clickbait title and they know it.

basch · on Feb 27, 2023

In formal analogies, : is pronounced "is to" and :: is pronounced "as".

The purpose here is to use the relationship from a known, to describe the relationship between a partial known.

ChatGPT is to GPT3 as ChatLLaMa is to LLaMa. It uses the relationship between ChatGPT and GPT3 to extrapolate a relationship between an unknown and LLaMa.

see Analogies.pdf https://resources.finalsite.net/images/v1584287027/brockton/...

Corolla:Toyota::3-Series:BMW. If you had heard of a Corolla, Toyota, and BMW, but not a 3-Series, you now roughly know that a 3-Series is BMWs equivalent to a Corolla.

levesque · on Feb 27, 2023

I think I prefer the other commenter's point, referring to ChatGPT as a known learning paradigm for chatbots. But thanks for the little crash course on analogies ;)

basch · on Feb 27, 2023

Isn't that consistent with, and the same as, my original comment?

ChatGPT:GPT3::ChatLLaMa:LLaMa::Chatbot-through-RLHF:LLM

aka ChatGPT is a known chatbot implementation, and GPT3 and LLaMa are known LLMs.

levesque · on Feb 27, 2023

Yes and no. You wouldn't say GPT to mean large language models or autoregressive language models. I would've thought the same to be true for ChatGPT instead of Chatbots with RL from human feedback (RLHF), perhaps the field is moving towards adopting ChatGPT as a paradigm name. Note that the title doesn't say a ChatGPT-like model based on LLaMa, it says outright opensource implementation of ChatGPT.

basch · on Feb 27, 2023

> You wouldn't say GPT to mean large language models or autoregressive language models.

In the analogy, that’s exactly what you are saying. Identical to Toyota and BMW meaning “the make of the car.”

Maybe reimplementation is a more precise word, a black box re-engineering/cloning. In this case I inferred it by knowing it was a different LLM underneath, and that this group didn’t have access to the chatgpt source code.

f_devd · on Feb 27, 2023

> Toyota Corolla is related to a BMW car.

The analogy is somewhat accurate, but also moot, since within the ML community "ChatGPT" can be used either as the product or the method (more specifically called RLHF) somewhat interchangeably. It's more like Google/Googling, where the largest/most popular provider becomes the defacto way to refer to a method. As someone who develops DL models, the title seems quite apt.