Pretty much. The XML is mainly used to specify the 'mission' that you want your agent to perform. You write your agent in Python (or a few other languages), and it can be as simple as this: https://github.com/Microsoft/malmo/blob/master/Malmo/samples...
We talk to the Minecraft Mod over TCP.