| | Why GRPO Is Important and How It Works (oxen.ai) |
|
1 point by Philpax 13 days ago | past | discuss
|
| | GRPO VRAM Requirements for the GPU Poor (oxen.ai) |
|
3 points by curiousinspo 19 days ago | past | 1 comment
|
| | No Hype DeepSeek-R1 Reading List (oxen.ai) |
|
4 points by Philpax 26 days ago | past
|
| | Live Dive into How to Finetune DeepSeek R1 on Synthetic Data (oxen.ai) |
|
5 points by mathi0750 26 days ago | past | 2 comments
|
| | Merkle Tree 101 (oxen.ai) |
|
1 point by gschoeni 28 days ago | past
|
| | Show HN: Beta Model Eval Tool – Run SOTA Models on Your CSV (oxen.ai) |
|
6 points by mathi0750 40 days ago | past
|
| | Show HN: GitLFS taking forever? No prob, heres the best AI data versioning tools (oxen.ai) |
|
8 points by mathi0750 57 days ago | past | 2 comments
|
| | We put 1M files into DVC, Git-LFS, and Oxen.ai (oxen.ai) |
|
7 points by sthoward 3 months ago | past | 4 comments
|
| | Paper Club: How Flux.1 models work under the hood (oxen.ai) |
|
2 points by gregschoeninger 5 months ago | past | 1 comment
|
| | Using Llama3.1 405B to generate political synthetic data (oxen.ai) |
|
5 points by gregschoeninger 6 months ago | past | 3 comments
|
| | Fine Tuning a Diffusion Transformer (DiT) from a Single YouTube Video (oxen.ai) |
|
4 points by gregschoeninger 9 months ago | past | 2 comments
|
| | How to train diffusion for text from scratch (oxen.ai) |
|
1 point by gregschoeninger 10 months ago | past | 1 comment
|
| | Oxen's Friday Paper Club: I-JEPA and a 3 Minute Challenge (oxen.ai) |
|
5 points by more_epochs 11 months ago | past | 1 comment
|
| | How Sora does its magic. Friday, 10am Pacific. Oxen.ai zoom Paper Club (oxen.ai) |
|
2 points by more_epochs 11 months ago | past | 1 comment
|
| | "Road to Sora" Paper Reading List (oxen.ai) |
|
32 points by gregschoeninger 11 months ago | past | 1 comment
|
| | Show HN: Oxen.ai – Data Diff tool to quickly find changes in CSV, parquet, etc. (oxen.ai) |
|
4 points by gregschoeninger 12 months ago | past
|
| | Guide to the Mamba architecture that claims to be a replacement for Transformers (oxen.ai) |
|
5 points by gregschoeninger on Dec 15, 2023 | past | 2 comments
|
| | SUDS – A Guide to Structuring Unstructured Data (oxen.ai) |
|
3 points by gschoeni on Dec 8, 2023 | past | 1 comment
|
| | Deep Dive into the Vision Transformers Paper (oxen.ai) |
|
40 points by gschoeni on Dec 1, 2023 | past | 8 comments
|
| | Reading List for Andrej Karpathy's "Intro to Large Language Models" Video (oxen.ai) |
|
75 points by gschoeni on Nov 27, 2023 | past | 6 comments
|
| | Fine tune and run Llama2 on CPUs (oxen.ai) |
|
2 points by byrneml on Nov 1, 2023 | past
|
| | How to run Llama-2 on CPU after fine-tuning with LoRA (oxen.ai) |
|
4 points by byrneml on Oct 24, 2023 | past | 1 comment
|