I just tried asking ChatGPT to rate various BBC and NYT articles out of 10, and ...

throwthrowuknow · 2024-07-24T10:29:32 1721816972

ChatGPT has a giant system prompt that you have no control over. Try using Llama and create a system prompt with clear instructions and examples. If you were going to use a model in a production system you would also want to either fine tune it or train a BERT-like model as a classifier that just outputs a score. Maybe even more than one for ranking along different dimensions.

czl · 2024-07-20T13:23:16 1721481796

Yes, do not rely on it for assessments. It generates ratings of 7 or 8 because those ratings are statistically common in its training data.