Sometimes I wish I had the reach of Google Deepmind. I created a sandbox environ...

doctorpangloss · 2024-03-13T17:00:56 1729762357

Isn't an essential part of what they are doing, and why they have results, that they are tackling all games at the same time, rather than focusing on one? Is Disco Elysium a good choice?

nsagent · 2024-03-13T17:30:43 1729762357

Good point, they are quite different objectives.

Their approach is one that works for simple directives: "Go to ship" or "Pick up iron ore" which lends itself well to sandbox-like games (which seems to be a major focus looking at Deepmind's tech report). Similar research has been done in Minecraft [1].

These instruction following agents are more an RL achievement than a language understanding achievement. On the other hand, Disco Elysium has over a million words of dialogue, and solving the quests requires an agent to understand and reason about language much more extensively. People have looked at text-based game agents, like Microsoft's TextWorld [2], but these are much smaller in scope and not easily adapted for humans-in-the-loop.

My work bridges that gap, focusing on the language aspect, rather than navigating a 3D world. Again, they are definitely different objectives, but as a sole researcher there's no way I can compete with Deepmind's budget and manpower anyway. Just look at the extensive author list in the tech report. So it doesn't make sense to necessarily focus on outcompeting them in producing a better generalized RL agent (in fact I merely use GPT-4). Instead, I made a publicly available experimentation platform that allows others to be able to build upon this work, which is valuable for the community at large.

At least, that's my take.

[1]: https://sites.google.com/view/steve-1

[2]: https://www.microsoft.com/en-us/research/project/textworld/