Sorry, I meant compare the RL version as it trains to the analytical version. It... | Hacker News

Hacker News new | past | comments | ask | show | jobs | submit

login

rcfox on Feb 15, 2019 | parent | context | favorite | on: Controlling a 2D Robotic Arm with Deep Reinforceme...

Sorry, I meant compare the RL version as it trains to the analytical version.

It's certainly neat that inverse kinematics can be learned from zero knowledge, but I would have a hard time trusting it to operate a real arm in an industrial setting.

formalsystem on Feb 15, 2019 [–]

You'd be entirely right not to trust it as it is in an industrial setting. There's been some research around safe exploration that would add additional terms to the reward function to do things like punish flailing around and such but I haven't experimented with those techniques myself.

Consider applying for YC's first-ever Fall batch! Applications are open till Aug 27.
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact