Hacker News new | past | comments | ask | show | jobs | submit login

Looks like it's maybe about genetic evolution of neural networks controlling bodies in a 3D simulation.



It‘s Reinforcement Learning. In each state, the agent can choose from a set of actions. This leads to a state transition, and the agent gets a reward which it can use to adjust it‘s policy. A policy is just a conditional probability distribution over actions, given a state p(a | s).




Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: