Hacker News new | past | comments | ask | show | jobs | submit | from login
Karpathy/Nano-Llama31 (github.com/karpathy)
74 points by tim_sw 21 days ago | past | 1 comment
Nano-Llama31 (github.com/karpathy)
3 points by yeldarb 26 days ago | past
Karpathy: Let's reproduce GPT-2 (1.6B): one 8XH100 node 24h $672 in llm.c (github.com/karpathy)
182 points by alecco 46 days ago | past | 58 comments
GitHub – Karpathy/LLM101n: LLM101n: Let's Build a Storyteller (github.com/karpathy)
61 points by bilsbie 66 days ago | past | 7 comments
NanoGPT: The simplest, fastest repository for training medium-sized GPTs (github.com/karpathy)
114 points by ulrischa 77 days ago | past | 21 comments
karpathy/build-nanogpt: Video + code lecture on building nanoGPT from scratch (github.com/karpathy)
9 points by codewiz 78 days ago | past | 3 comments
Reproducing GPT-2 (124M) in llm.c in 90 minutes for $20 (github.com/karpathy)
3 points by georgehill 3 months ago | past
Reproducing GPT-2 in llm.c (github.com/karpathy)
618 points by tosh 3 months ago | past | 117 comments
Llm.c State of the Union (github.com/karpathy)
1 point by neeleshs 3 months ago | past
Layernorm (github.com/karpathy)
3 points by sva_ 4 months ago | past
Full forward pass of GPT-2 in one file of pure CUDA (github.com/karpathy)
63 points by tosh 4 months ago | past | 4 comments
Llm.c – LLM training in simple, pure C/CUDA (github.com/karpathy)
1050 points by tosh 4 months ago | past | 169 comments
Karpathy: SVM vs. K-NN on Embeddings (github.com/karpathy)
1 point by skanderbm 6 months ago | past
Code for the Byte Pair Encoding algorithm, commonly used in LLM tokenization (github.com/karpathy)
81 points by magoghm 6 months ago | past | 31 comments
Karpathy removes llama licence from llama2.c (github.com/karpathy)
3 points by orwellg1984 on July 26, 2023 | past
Llama2.c: Inference llama 2 in one file of pure C (github.com/karpathy)
707 points by anjneymidha on July 23, 2023 | past | 165 comments
KNN vs. SVM (github.com/karpathy)
3 points by tosh on April 15, 2023 | past
Neural Networks: Zero to Hero (github.com/karpathy)
1 point by greenSunglass on Jan 24, 2023 | past
NanoGPT (github.com/karpathy)
1532 points by trekhleb on Jan 11, 2023 | past | 320 comments
The simplest, fastest repository for training and fine-tuning medium-sized GPTs (github.com/karpathy)
2 points by Terretta on Jan 10, 2023 | past
nanoGPT: The simplest repository for training medium-sized GPTs (github.com/karpathy)
3 points by isoprophlex on Jan 3, 2023 | past | 1 comment
Micrograd: A Tiny Autograd Engine (github.com/karpathy)
3 points by memorable on Nov 4, 2022 | past
Neural Networks: Zero to Hero (github.com/karpathy)
3 points by gzer0 on Sept 12, 2022 | past
An autoregressive character-level language model for making more things (github.com/karpathy)
2 points by phsilva on Sept 8, 2022 | past | 1 comment
MinGPT: Minimal PyTorch re-implementation of GPT (github.com/karpathy)
223 points by memorable on Sept 6, 2022 | past | 24 comments
ArXiv-sanity lite: get recommendations of similar papers (github.com/karpathy)
3 points by ofou on March 28, 2022 | past
Pure Python from-scratch zero-dependency implementation of Bitcoin (github.com/karpathy)
2 points by ofou on June 22, 2021 | past
Karpathy's MinGPT (github.com/karpathy)
374 points by aliabd on Aug 17, 2020 | past | 102 comments
A tiny scalar-valued autograd engine (github.com/karpathy)
2 points by mooreds on April 26, 2020 | past
Micrograd: A tiny autograd engine (~50 LOC) and a neural net library (~60 LOC) (github.com/karpathy)
3 points by sebg on April 17, 2020 | past

Consider applying for YC's first-ever Fall batch! Applications are open till Aug 27.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: