Baidu's Improving Retrieval Augmented Language Model with Self-Reasoning

kaspermarstal · 2024-08-01T11:21:51 1722511311

Can anyone explain what is gained by training a model? Why not use the foundational LLM for the relevance, evidence, and trajectory processes?

a-s-k-af · 2024-08-01T12:05:36 1722513936

I assume you are referring to fine tuning a model here?

Tostino · 2024-08-01T13:35:17 1722519317

You could also just continue pre-training of an existing foundation model. Would still be cheaper by not starting from zero.

a-s-k-af · 2024-08-01T13:44:44 1722519884

The amount of accuracy while doing fine tuning or distillation is usually better than pre-training an existing model, not to mention the graph against the cost.