Looks like it's maybe about genetic evolution of neural networks controlling bod... | Hacker News

Hacker News new | past | comments | ask | show | jobs | submit

login

hypertele-Xii on July 20, 2021 | parent | context | favorite | on: Megaverse: Simulating Embodied Agents at One Milli...

Looks like it's maybe about genetic evolution of neural networks controlling bodies in a 3D simulation.

Hendrikto on July 21, 2021 [–]

It‘s Reinforcement Learning. In each state, the agent can choose from a set of actions. This leads to a state transition, and the agent gets a reward which it can use to adjust it‘s policy. A policy is just a conditional probability distribution over actions, given a state p(a | s).

Consider applying for YC's Spring batch! Applications are open till Feb 11.
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact