Sorry, I meant "Mountain Cart" not cartpoll - https://www.gymlibrary.dev/environ...

Sorry, I meant "Mountain Cart" not cartpoll - https://www.gymlibrary.dev/environments/classic_control/moun...

The reason for this is that the algorithm doesn't like to have to "spend" energy, reducing its score. Without huge amounts of trickery to get the gradient descent algorithm to stop getting stuck in the center, this is never solved - due to using a local optimizer for a global optimization problem (finding good weights in a NN)