For narrow stuff you can do better job than base gpt4/mistral/etc model. You fin...

simonw · 2024-02-27T04:53:24 1709009604

Have you done this? How did you do it?

I've been looking forward to someone providing a detailed guide on how to "fine tune it with your custom data" for ages!

anon373839 · 2024-02-27T05:48:11 1709012891

This is a very nice resource: https://github.com/mlabonne/llm-course

ipaddr · 2024-02-27T05:44:45 1709012685

https://www.datacamp.com/tutorial/fine-tuning-llama-2

arbitrandomuser · 2024-02-27T05:30:15 1709011815

this is imo the secret sauce that gives people an edge and not a lot of people will want to reveal

washadjeffmad · 2024-02-27T07:24:42 1709018682

You're not wrong. There's been a lot of drama over licensing and releasing datasets, and a lot of the LLM scene are just pitchmen and promoters with no better grasp over what they're doing than "trust me, it's better".

Like with "prompt engineering", a lot of people are just hiding how much of the heavy lifting is from base models and a fluke of the merge. The past few "secret" set leaks were low/no delta diffs to common releases.

I said it a year ago, but if we want to wowed, make this a job for MLIS holders and references librarians. Without thorough, thoughtful curation, these things are just toys in the wrong hands.