| | Some Math Behind Neural Tangent Kernel (lilianweng.github.io) |
|
2 points by reqo 22 days ago | past
|
| | Extrinsic Hallucinations in LLMs (lilianweng.github.io) |
|
1 point by gregzeng95 44 days ago | past
|
| | Extrinsic Hallucinations in LLMs (lilianweng.github.io) |
|
1 point by luu 75 days ago | past
|
| | Extrinsic Hallucinations in LLMs (lilianweng.github.io) |
|
1 point by sebg 86 days ago | past
|
| | Extrinsic Hallucinations in LLMs (lilianweng.github.io) |
|
3 points by RevoGen 89 days ago | past
|
| | Diffusion Models for Video Generation (lilianweng.github.io) |
|
1 point by moks 5 months ago | past
|
| | Diffusion Models for Video Generation (lilianweng.github.io) |
|
1 point by alexmolas 5 months ago | past
|
| | Diffusion Models for Video Generation (lilianweng.github.io) |
|
2 points by TheAlchemist 5 months ago | past
|
| | Thinking about high-quality human data (lilianweng.github.io) |
|
103 points by tim_sw 7 months ago | past | 4 comments
|
| | Meta-Learning: Learning to Learn Fast (lilianweng.github.io) |
|
3 points by jxmorris12 8 months ago | past
|
| | Exploration Strategies in Deep Reinforcement Learning (2020) (lilianweng.github.io) |
|
1 point by yamrzou 8 months ago | past
|
| | Attention Mechanism Explained (lilianweng.github.io) |
|
2 points by ashvanth 10 months ago | past | 1 comment
|
| | Adversarial Attacks on LLMs (lilianweng.github.io) |
|
1 point by georgehill 11 months ago | past
|
| | Controllable Neural Text Generation (2021) (lilianweng.github.io) |
|
1 point by typicalHNuser 11 months ago | past
|
| | Attention? Attention (lilianweng.github.io) |
|
1 point by todsacerdoti on Sept 20, 2023 | past
|
| | LLM Powered Autonomous Agents (lilianweng.github.io) |
|
285 points by DanielKehoe on June 27, 2023 | past | 176 comments
|
| | Prompt Engineering: Steer a large pretrained language model to do what you want (lilianweng.github.io) |
|
190 points by sebg on March 20, 2023 | past | 49 comments
|
| | How to train large models on many GPUs? (2021) (lilianweng.github.io) |
|
216 points by eternalban on Feb 11, 2023 | past | 33 comments
|
| | The Transformer Family Version 2.0 (lilianweng.github.io) |
|
3 points by lostConnection on Jan 29, 2023 | past
|
| | The Transformer Family (lilianweng.github.io) |
|
254 points by alexmolas on Jan 29, 2023 | past | 46 comments
|
| | The Transformer Family Version 2.0 (lilianweng.github.io) |
|
2 points by sadiq on Jan 28, 2023 | past
|
| | Large Transformer Model Inference Optimization (lilianweng.github.io) |
|
136 points by headalgorithm on Jan 20, 2023 | past | 20 comments
|
| | Large Transformer Model Inference Optimization (lilianweng.github.io) |
|
3 points by axit on Jan 12, 2023 | past
|
| | How to Build an Open-Domain Question Answering System? (2020) (lilianweng.github.io) |
|
1 point by eternalban on Dec 13, 2022 | past
|
| | What Are Diffusion Models? (lilianweng.github.io) |
|
3 points by mariuz on Aug 31, 2022 | past
|
| | Contrastive Representation Learning (lilianweng.github.io) |
|
87 points by gk1 on Aug 19, 2022 | past | 10 comments
|
| | Learning with Not Enough Data Part 3: Data Generation (lilianweng.github.io) |
|
1 point by takiwatanga on April 16, 2022 | past
|
| | Learning with Not Enough Data: Semi-Supervised Learning (lilianweng.github.io) |
|
1 point by gk1 on March 25, 2022 | past
|
| | A (Long) Peek into Reinforcement Learning (2018) (lilianweng.github.io) |
|
1 point by graderjs on March 24, 2022 | past
|
| | How to Train Really Large Models on Many GPUs? (2021) (lilianweng.github.io) |
|
3 points by lnyan on March 2, 2022 | past
|
|
|
More |