What's nice about the move is that the reinforcement learning algorithm doesn't fundamentally change. The action and state spaces will be larger since a joint has more degrees of freedom in 3D than it does in 2D so learning may take longer so we'll need to also increase the size of the replay buffer and the episode length.
I'm planning on writing a followup post on 3D so stay tuned!
I'm planning on writing a followup post on 3D so stay tuned!