Hacker News new | past | comments | ask | show | jobs | submit login
Reinforcement Learning for Language Models – Why RL (gist.github.com)
4 points by tim_sw on April 22, 2023 | hide | past | favorite



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: