JetBrains' local single-line autocomplete model is 0.1B (w/ 1536-token context, ...

pseudosavant · 2025-01-21T22:22:40 1737498160

I wonder how big that model is in RAM/disk. I use LLMs for FFMPEG all the time, and I was thinking about training a model on just the FFMPEG CLI arguments. If it was small enough, it could be a package for FFMPEG. e.g. `ffmpeg llm "Convert this MP4 into the latest royalty-free codecs in an MKV."`

h0l0cube · 2025-01-21T23:10:49 1737501049

Please submit a blog post to HN when you're done. I'd be curious to know the most minimal LLM setup needed get consistently sane output for FFMPEG parameters.

jedbrooke · 2025-01-21T22:33:01 1737498781

the jetbrains models are about 70MB zipped on disk (one model per language)

pseudosavant · 2025-01-22T16:42:17 1737564137

That is easily small enough to host as a static SPA web app. I was first thinking it would be cool to make a static web app that would run the model locally. You'd make a query and it'd give the FFMPEG commands.

binary132 · 2025-01-22T00:29:53 1737505793

That’s a great idea, but I feel like it might be hard to get it to be correct enough

maujim · 2025-01-21T23:35:25 1737502525

from a few days ago: https://news.ycombinator.com/item?id=42706637

smaddox · 2025-01-21T22:11:27 1737497487

You can train that size of a model on ~1 billion tokens in ~3 minutes on a rented 8xH100 80GB node (~$9/hr on Lambda Labs, RunPod io, etc.) using the NanoGPT speed run repo: https://github.com/KellerJordan/modded-nanogpt

For that short of a run, you'll spend more time waiting for the node to come up, downloading the dataset, and compiling the model, though.

WithinReason · 2025-01-21T20:46:58 1737492418

That size is on the edge of something you can train at home

vineyardmike · 2025-01-21T21:31:57 1737495117

If you have modern hardware, you can absolutely train that at home. Or very affordable on a cloud service.

I’ve seen a number of “DIY GPT-2” tutorials that target this sweet spot. You won’t get amazing results unless you want to leave a personal computer running for a number of hours/days and you have solid data to train on locally, but fine-tuning should be in the realm of normal hobbyists patience.

nottorp · 2025-01-21T21:59:18 1737496758

Hmm is there anything reasonably ready made* for this spot? Training and querying a llm locally on an existing codebase?

* I don't mind compiling it myself but i'd rather not write it.

Sohcahtoa82 · 2025-01-21T23:50:15 1737503415

Not even on the edge. That's something you could train on a 2 GB GPU.

The general guidance I've used is that to train a model, you need an amount of RAM (or VRAM) equal to 8x the number of parameters, so a 0.125B model would need 1 GB of RAM to train.

staticautomatic · 2025-01-21T23:04:42 1737500682

Is that why their tab completion is so bad now?

sam_lowry_ · 2025-01-22T11:46:45 1737546405

Hm... I wonder what your use case it. I do the modern Enterprise Java and the tab completion is a major time saver.

While interactive AI is all about posing, meditating on the prompt, then trying to fix the outcome, IntelliJ tab completion... shows what it will complete as you type and you Tab when you are 100% OK with the completion, which surprisingly happens 90..99% of the time for me, depending on the project.