Submissions from yaofu.notion.site

		How Do Language Models Put Attention Weights over Long Context? (yaofu.notion.site)
		2 points by jxmorris12 6 months ago \| past
		Towards 100x Speedup: Full Stack Transformer Inference Optimization (yaofu.notion.site)
		1 point by magoghm on Dec 10, 2023 \| past
		A Stage Review of Instruction Tuning (yaofu.notion.site)
		2 points by panabee on June 29, 2023 \| past
		Towards Complex Reasoning: The Polaris of Large Language Models (yaofu.notion.site)
		2 points by bluehat974 on May 31, 2023 \| past
		Towards Complex Reasoning: The Polaris of Large Language Models (yaofu.notion.site)
		2 points by tim_sw on May 3, 2023 \| past
		How Does GPT Obtain Its Ability? Tracing Emergent Abilities of Language Models (yaofu.notion.site)
		4 points by johnthewise on April 18, 2023 \| past
		How does GPT obtain its ability? Tracing emergent abilities of language models (yaofu.notion.site)
		414 points by headalgorithm on Dec 14, 2022 \| past \| 192 comments
		How Does GPT Obtain Its Ability? Tracing Abilities of LMs to Their Sources (yaofu.notion.site)
		2 points by magoghm on Dec 13, 2022 \| past