Hacker News new | past | comments | ask | show | jobs | submit | from login
Some Math Behind Neural Tangent Kernel (lilianweng.github.io)
2 points by reqo 22 days ago | past
Extrinsic Hallucinations in LLMs (lilianweng.github.io)
1 point by gregzeng95 44 days ago | past
Extrinsic Hallucinations in LLMs (lilianweng.github.io)
1 point by luu 75 days ago | past
Extrinsic Hallucinations in LLMs (lilianweng.github.io)
1 point by sebg 86 days ago | past
Extrinsic Hallucinations in LLMs (lilianweng.github.io)
3 points by RevoGen 89 days ago | past
Diffusion Models for Video Generation (lilianweng.github.io)
1 point by moks 5 months ago | past
Diffusion Models for Video Generation (lilianweng.github.io)
1 point by alexmolas 5 months ago | past
Diffusion Models for Video Generation (lilianweng.github.io)
2 points by TheAlchemist 5 months ago | past
Thinking about high-quality human data (lilianweng.github.io)
103 points by tim_sw 7 months ago | past | 4 comments
Meta-Learning: Learning to Learn Fast (lilianweng.github.io)
3 points by jxmorris12 8 months ago | past
Exploration Strategies in Deep Reinforcement Learning (2020) (lilianweng.github.io)
1 point by yamrzou 8 months ago | past
Attention Mechanism Explained (lilianweng.github.io)
2 points by ashvanth 10 months ago | past | 1 comment
Adversarial Attacks on LLMs (lilianweng.github.io)
1 point by georgehill 11 months ago | past
Controllable Neural Text Generation (2021) (lilianweng.github.io)
1 point by typicalHNuser 11 months ago | past
Attention? Attention (lilianweng.github.io)
1 point by todsacerdoti on Sept 20, 2023 | past
LLM Powered Autonomous Agents (lilianweng.github.io)
285 points by DanielKehoe on June 27, 2023 | past | 176 comments
Prompt Engineering: Steer a large pretrained language model to do what you want (lilianweng.github.io)
190 points by sebg on March 20, 2023 | past | 49 comments
How to train large models on many GPUs? (2021) (lilianweng.github.io)
216 points by eternalban on Feb 11, 2023 | past | 33 comments
The Transformer Family Version 2.0 (lilianweng.github.io)
3 points by lostConnection on Jan 29, 2023 | past
The Transformer Family (lilianweng.github.io)
254 points by alexmolas on Jan 29, 2023 | past | 46 comments
The Transformer Family Version 2.0 (lilianweng.github.io)
2 points by sadiq on Jan 28, 2023 | past
Large Transformer Model Inference Optimization (lilianweng.github.io)
136 points by headalgorithm on Jan 20, 2023 | past | 20 comments
Large Transformer Model Inference Optimization (lilianweng.github.io)
3 points by axit on Jan 12, 2023 | past
How to Build an Open-Domain Question Answering System? (2020) (lilianweng.github.io)
1 point by eternalban on Dec 13, 2022 | past
What Are Diffusion Models? (lilianweng.github.io)
3 points by mariuz on Aug 31, 2022 | past
Contrastive Representation Learning (lilianweng.github.io)
87 points by gk1 on Aug 19, 2022 | past | 10 comments
Learning with Not Enough Data Part 3: Data Generation (lilianweng.github.io)
1 point by takiwatanga on April 16, 2022 | past
Learning with Not Enough Data: Semi-Supervised Learning (lilianweng.github.io)
1 point by gk1 on March 25, 2022 | past
A (Long) Peek into Reinforcement Learning (2018) (lilianweng.github.io)
1 point by graderjs on March 24, 2022 | past
How to Train Really Large Models on Many GPUs? (2021) (lilianweng.github.io)
3 points by lnyan on March 2, 2022 | past

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: