Submissions from lilianweng.github.io

		Some Math Behind Neural Tangent Kernel (lilianweng.github.io)
		2 points by reqo 22 days ago \| past
		Extrinsic Hallucinations in LLMs (lilianweng.github.io)
		1 point by gregzeng95 44 days ago \| past
		Extrinsic Hallucinations in LLMs (lilianweng.github.io)
		1 point by luu 75 days ago \| past
		Extrinsic Hallucinations in LLMs (lilianweng.github.io)
		1 point by sebg 86 days ago \| past
		Extrinsic Hallucinations in LLMs (lilianweng.github.io)
		3 points by RevoGen 89 days ago \| past
		Diffusion Models for Video Generation (lilianweng.github.io)
		1 point by moks 5 months ago \| past
		Diffusion Models for Video Generation (lilianweng.github.io)
		1 point by alexmolas 5 months ago \| past
		Diffusion Models for Video Generation (lilianweng.github.io)
		2 points by TheAlchemist 5 months ago \| past
		Thinking about high-quality human data (lilianweng.github.io)
		103 points by tim_sw 7 months ago \| past \| 4 comments
		Meta-Learning: Learning to Learn Fast (lilianweng.github.io)
		3 points by jxmorris12 8 months ago \| past
		Exploration Strategies in Deep Reinforcement Learning (2020) (lilianweng.github.io)
		1 point by yamrzou 8 months ago \| past
		Attention Mechanism Explained (lilianweng.github.io)
		2 points by ashvanth 10 months ago \| past \| 1 comment
		Adversarial Attacks on LLMs (lilianweng.github.io)
		1 point by georgehill 11 months ago \| past
		Controllable Neural Text Generation (2021) (lilianweng.github.io)
		1 point by typicalHNuser 11 months ago \| past
		Attention? Attention (lilianweng.github.io)
		1 point by todsacerdoti on Sept 20, 2023 \| past
		LLM Powered Autonomous Agents (lilianweng.github.io)
		285 points by DanielKehoe on June 27, 2023 \| past \| 176 comments
		Prompt Engineering: Steer a large pretrained language model to do what you want (lilianweng.github.io)
		190 points by sebg on March 20, 2023 \| past \| 49 comments
		How to train large models on many GPUs? (2021) (lilianweng.github.io)
		216 points by eternalban on Feb 11, 2023 \| past \| 33 comments
		The Transformer Family Version 2.0 (lilianweng.github.io)
		3 points by lostConnection on Jan 29, 2023 \| past
		The Transformer Family (lilianweng.github.io)
		254 points by alexmolas on Jan 29, 2023 \| past \| 46 comments
		The Transformer Family Version 2.0 (lilianweng.github.io)
		2 points by sadiq on Jan 28, 2023 \| past
		Large Transformer Model Inference Optimization (lilianweng.github.io)
		136 points by headalgorithm on Jan 20, 2023 \| past \| 20 comments
		Large Transformer Model Inference Optimization (lilianweng.github.io)
		3 points by axit on Jan 12, 2023 \| past
		How to Build an Open-Domain Question Answering System? (2020) (lilianweng.github.io)
		1 point by eternalban on Dec 13, 2022 \| past
		What Are Diffusion Models? (lilianweng.github.io)
		3 points by mariuz on Aug 31, 2022 \| past
		Contrastive Representation Learning (lilianweng.github.io)
		87 points by gk1 on Aug 19, 2022 \| past \| 10 comments
		Learning with Not Enough Data Part 3: Data Generation (lilianweng.github.io)
		1 point by takiwatanga on April 16, 2022 \| past
		Learning with Not Enough Data: Semi-Supervised Learning (lilianweng.github.io)
		1 point by gk1 on March 25, 2022 \| past
		A (Long) Peek into Reinforcement Learning (2018) (lilianweng.github.io)
		1 point by graderjs on March 24, 2022 \| past
		How to Train Really Large Models on Many GPUs? (2021) (lilianweng.github.io)
		3 points by lnyan on March 2, 2022 \| past
		More