CICERO: An AI agent that negotiates, persuades, and cooperates with people

pesenti · on Nov 22, 2022

The most interesting anecdote I heard from the team: "during the tournament dozens of human players never even suspected they were playing against a bot even though we played dozens of games online."

nopinsight · on Nov 23, 2022

"Having read the paper & supplementary materials, watched narrated game & spoken to one of the human players I'm pretty concerned. The @ScienceMagazine paper centres 'human-AI cooperation' & the bot is not supposed to lie. However, videos clearly show deception/manipulation"

"Screenshots of the stab below.

The human player said: "The bot is supposed to never lie [...] I doubt this was the case here" "I was definitely caught more off guard as a result of this message; I knew the bot doesn't lie, so I thought the stab wouldn't happen." "

"I'd like the researchers involved to say quite a bit more about "A.3 Manipulation"

What are possible prevention, detection & mitigation steps?

What are the possible use cases? What are the benefits/downsides of them? Has Meta considered developing products based on this?" -- Haydn Belfield, a Cambridge University researcher who focuses on the security implications of artificial intelligence (AI).

https://twitter.com/HaydnBelfield/status/1595168102924402688

https://www.cser.ac.uk/team/haydn-belfield/

sanxiyn · on Nov 23, 2022

As far as I can tell, as described in the paper, the bot in fact never lies, in this sense: there is a model that generates messages from moves, where messages should correspond to moves, and when the bot says any messages, at the time, they are generated from moves the bot truthfully intends to play.

On the other hand, the bot has no concept whatsoever of keeping its words. After saying words, it is free to change its mind about what moves to play, motivated from, for example, messages from other players.

bigwavedave · on Nov 24, 2022

> [snip] when the bot says any messages, at the time, they are generated from moves the bot truthfully intends to play.

> On the other hand, the bot has no concept whatsoever of keeping its words. After saying words, it is free to change its mind [snip]

Reminds of that one Asimov story about the robot who had a different interpretation of the first law of robotics. If my very hazy memory is right, the idea was that the robot could put a person in danger if it knew that it had the ability to prevent any damage from happening, but once it caused the danger, it could choose not to act and allow the person to come to harm.

I might be remembering this incorrectly, it's been a very long time since I read the story, but that was the first thought that came to mind when reading your comment :).

renewiltord · on Nov 23, 2022

Amusing, it's described like the "buggers" in Ender's Game!

charcircuit · on Nov 23, 2022

I don't see anything in the papers that say the bot isn't supposed to lie. Lying and being deceptive is a part of the game.

sanxiyn · on Nov 23, 2022

The paper does describe the bot's architecture which makes the bot incapable of lying in a certain technical sense. See what I wrote elsewhere.

bo1024 · on Nov 22, 2022

Hmm, I guess facebook doesn't have to go through IRB for human subject experiments, nor does Science require it, apparently.

fddr · on Nov 22, 2022

Do you actually think it would be a good thing if an IRB was required for this type of thing? Sure, it's "human experimentation" but the likelihood for any serious harm is basically zero.

It goes with the zeitgeist to argue for what makes the life of big tech companies hard, but they are big enough that they can afford things like that. It's smaller companies and academics that would end up not being able to innovate as much

Go down that road and you end up with an IRB evaluation requires for an A/B test that changes the color of a button

andreyk · on Nov 22, 2022

Agreed. This is using an AI to play a game, IRB seems like overkill. I guess the only potential problem is if it went off the rails and spouting toxic language, but that presumably was not a real possibility.

visarga · on Nov 22, 2022

That's nothing. Someone trained a GPT-J 6B model on 4chan and then let the model loose on the forums for a day. It took about 15k messages until people suspected something was off, and even that was because the country flags, Seychelles, a rare flag on 4chan, were a giveaway.

video: https://www.youtube.com/watch?v=efPrtcLdcdM

model: https://huggingface.co/ykilcher/gpt-4chan

ncr100 · on Nov 25, 2022

I didn't read the paper. <==

Creating an AI to lie seems like the Wrong Path.

Zuckerberg should shut this down ASAP. If that's the case.

charcircuit · on Nov 22, 2022

IRB is for the government.

mabbo · on Nov 22, 2022

I would love to see this kind of thing applied to an RPG.

Randomly generate a city full of people. Make a few dozen of them the important NPCs. Give them situations and goals, problems they need to solve and potential ways to solve them. Certain NPC's goals are opposite others'. Then drop the player into that world and have the 'quests' the player is performing be generated based on the NPCs needing their help.

Updates wouldn't be adding new hand-written stories, it would be adding more complexity, more goals, more problems, more things that can be, and the story would generate itself.

Done right, this would be incredible.

Zircom · on Nov 22, 2022

FYI you are quite literally describing dungeons and dragons, just virtualized and automated. But on that topic, I've had a sort of similar idea kicking around in my head, but on a bigger scale.

I want to run a game of DnD using a heavily modded Crusader Kings or Civ to simulate the world they are playing in. Just set up the starting conditions, like the various kingdoms, their methods of succession, their relationships and blood feuds with other kingdoms, the internal treacherous machinations and possible rebellions brewing, wars between kingdoms, etc, and then just let it run and use that to generate the overarching plot events of the setting while somehow translating the players actions as some kind of input into the simulated world so they can affect it as well.

ilaksh · on Nov 22, 2022

Check out https://character.ai

falcor84 · on Nov 22, 2022

This sounds very close to AGI capabilities. Maybe I've just been thinking too much about Westworld, but I'm starting to get very concerned that people can be gradually replaced by AI agents with their own agendas (focus on this last part).

mach1ne · on Nov 23, 2022

At some point they probably will be, but it seems unlikely to happen within the next 7 years at least. Deep Learning still has the same deep-seated issues it has had since its inception.

ImHereToVote · on Nov 23, 2022

https://intelligence.org/2017/10/13/fire-alarm/

mach1ne · on Nov 23, 2022

I don't think it's a question of incremental progress, with AGI behind some ghost line. I think the current progress is, despite all the excitement, actually orthogonal to true AGI development. We're developing ever more sophisticated narrow AI, not even taking baby steps to proper true AI. I believe that the progress to the latter would be evidenced by the development of artificial intelligence first with animal capabilities, then with child-level capabilities and ultimately adult human tier capabilities. But I could be wrong - perhaps our current navigation of the problem space will reveal some strange pathway to true AI.

WastingMyTime89 · on Nov 23, 2022

> But I could be wrong - perhaps our current navigation of the problem space will reveal some strange pathway to true AI.

Most likely wrong if you compare to how most other development in engineering happened.

From my point of view, the most likely path towards some form of weakly general intelligence at this point is emergence. We keep working on these narrow problems and from the broadening networks at some point we end up with something indistinguishable from a general ai inadvertently.

mach1ne · on Nov 23, 2022

What engineering development are you referring to? Do you have any examples?

WastingMyTime89 · on Nov 24, 2022

Engines, planes, PVs. They all gradually became better in a way which doesn’t really map to the organic function they augment, replace or take inspiration from.

Planes don’t fly like birds. There is very little reason for what you would call a “general intelligence” ai to develop in a way which mimics our own intelligence. It might but I would find that more notable that if it did not.

mach1ne · on Nov 24, 2022

I think the biggest difference between building a general intelligence and other engineering problems is that in the latter, the target function is well-defined. With planes, anything that is able to fly safely goes. Nuclear reactors, anything that generates electricity. But with intelligence... What does it need to do, exactly? The 'moving goalpost' is moving because we have no idea where it should be. We're building AI blindfolded. I don't think we're currently solving the problem, simply because we don't know what the problem is.

ImHereToVote · on Dec 3, 2022

It's just a pattern matching glorified search engine! He screamed as his legs were converted into paperclips.

naasking · on Nov 23, 2022

> We're developing ever more sophisticated narrow AI, not even taking baby steps to proper true AI.

I'm fairly certain you cannot rigourously define "proper true AI", and I'm also fairly certain you don't mechanistically understand how humans think. This raises the question: where does your confidence that our current path is not pretty close to "proper true AI" and/or how humans think come from?

mach1ne · on Nov 23, 2022

The only intelligence with human capabilities is, so far, human intelligence. We can see how nature incrementally improved 'the algorithm' when we look at other animals, from simpler ones to primates. We don't see the capabilities of even lower animals in modern deep learning based networks. We know these networks don't match in neural function what we see in biology. Whether deep learning will lead to the North Star of human intelligence is guesswork, and guesswork rarely works well with complex systems.

naasking · on Nov 24, 2022

> We don't see the capabilities of even lower animals in modern deep learning based networks

Machine learning models have written multiple scholarly papers that have been accepted to prestigious journals. Find me a rabbit that did that.

ImHereToVote · on Nov 25, 2022

But, but, but that is just tricks unlike human intelligence which is based on no trickery soul fuel.

/s

gavinray · on Nov 22, 2022

You're in it!

teaearlgraycold · on Nov 22, 2022

This is what a next generation Skyrim should be.

kridsdale2 · on Nov 22, 2022

Funny you say that. For Elder Scrolls 4 (Oblivion), the devs claimed to have built something like this. Where all the NPCs had their own wants and needs and schedules, like The Sims. Everyone needed to eat, but not everyone had a job. So some NPCs resorted to theft. But since crime and witnessing crime was already a system programmed in to the game, this often meant that by the time a player arrived at a town, everyone was dead. Some beggar took a bread roll from a shop keeper, who alerted the guards, who confronted the beggar, who responded with violence, who started a scuffle, which had collateral damage (many people have fireball spells), which lead to a free for all of death. It ruined the game, and they had to scrap most of the system.

firtoz · on Nov 23, 2022

maybe the guards should't default to violence?

KaoruAoiShiho · on Nov 22, 2022

We already have this, it's just a matter of cost now.

ranguna · on Nov 23, 2022

Cost and consumer hardware. These models take a lot of resource to run inference and most consumers don't have top hardware to run this stuff on.

tuzongyu · on Nov 22, 2022

https://www.science.org/doi/10.1126/science.ade9097

Abstract: Despite much progress in training AI systems to imitate human language, building agents that use language to communicate intentionally with humans in interactive environments remains a major challenge. We introduce CICERO, the first AI agent to achieve human-level performance in Diplomacy, a strategy game involving both cooperation and competition that emphasizes natural language negotiation and tactical coordination between seven players. CICERO integrates a language model with planning and reinforcement learn- ing algorithms by inferring players’ beliefs and intentions from its conversations and generating dialogue in pursuit of its plans. Across 40 games of an anonymous online Diplomacy league, CICERO achieved more than double the average score of the human players and ranked in the top 10% of participants who played more than one game.

bee_rider · on Nov 22, 2022

> Cicero participated anonymously in 40 games of Diplomacy in a “blitz” league on webDiplomacy.net

> Cicero ranked in the top 10% of participants who played more than one game and 2 nd out of 19 participants in the league that played 5 or more games.

> As part of the league, Cicero participated in an 8-game tournament involving 21 participants, 6 of whom played at least 5 games. Participants could play a maximum of 6 games with their rank determined by the average of their best 3 games. Cicero placed 1st in this tournament.

This bit seems a little more impressive I think. Being in the top 10% of people who’ve played at least two games might leave a lot of bad players to beat up on. Winning a tournament might (?) mean you have to beat at least a couple players who understand the thing.

It is sort of funny to think about — anyone who gets really legitimately good at anything competitive goes through multiple rounds of being the best in their social group, and then moving on from that group to a new one that is comprised of people who were the top-tier of that previous level. It isn’t obvious to me where on that informal ladder this tournament was.

But anyway, maybe the AI will follow the trajectory of chess AIs and quickly race away from human competition.

gcanyon · on Nov 22, 2022

There's an interesting question of "how much do the literal best humans suck at this?" For example, in chess Magnus Carlsen might be able to beat Stockfish given a handicap of just a pawn or two. An even better computer player than Stockfish might give up three or more pawns, but even a perfect player would likely lose to Carlsen if giving up a rook. -- I'm making this up, I don't think anyone knows the real values, but as far as I know no one is remotely projecting that perfect play could overcome e.g. a queen handicap.

Similarly, in Go it seems unlikely that perfect play could overcome a nine-stone handicap (again, I could be wrong, I'm not remotely a dan-level player).

All to say, it seems likely that Diplomacy is a game where the difference between "the best human play" and "the best possible play" is much larger than either Go or Chess.

SonOfLilit · on Nov 23, 2022

We happened to talk about this at the Go club this evening. The strong chess players more or less agreed with you about the chess predictions, and the dan level Go players say AIs today can give the best pros definitely a 3 stone handicap (tried and tested), probably 4 or more, and perfect play is a few stones more (unclear how many, but probably not many, so not 9 stones altogether)

gcanyon · on Nov 23, 2022

Thanks -- what Go club?

I figured 9 was too much, but I had no idea what number less than 9 to pick, so I stuck with what I thought was a near-certain upper limit.

I once played with 9 stones against a dan level player. It was close...until it wasn't :-)

SonOfLilit · on Nov 23, 2022

I attend the Ramat Gan Go Club, but there are Go clubs everywhere around the world, and they tend to be in the same places HN commenters live, go figure. See e.g. https://www.usgo.org/where-play-go

morgante · on Nov 23, 2022

> All to say, it seems likely that Diplomacy is a game where the difference between "the best human play" and "the best possible play" is much larger than either Go or Chess.

Definitely, Diplomacy in general is substantially understudied compared to Go or Chess (largely because it's a tiny community). You can play for less than a year and get to top-level performance and much of the established wisdom/strategy of players is fairly bad.

Even the best Diplomacy players are only scratching the performance of how good someone could be.

treis · on Nov 22, 2022

I'm a little bit suspicious of this. They're not explicit about the scoring but taking the average of top 3 results is a huge advantage to those that played more games.

Diplomacy is a bit of a choose your own adventure game too. Like there's an objective criteria (average SCs at the agreed end of game) but the human tendency is to try and win individual games. Humans will often choose to play sub-optimal strategies for better entertainment value.

I think the real accomplishment here is the ability to fool humans into thinking their not playing a bot. That's an impressive thing to do even these days.

digdugdirk · on Nov 22, 2022

"... by inferring players’ beliefs and intentions from its conversations and generating dialogue in pursuit of its plans."

... Isn't this the bad path of AI research? An unbeatable and utterly convincing conversational AI that knows exactly what you want to achieve, and then completely nullifies your attempts at reaching your goals while simultaneously achieving it's own?

EGreg · on Nov 22, 2022

Just wait until we have AI swarms.

So far all this has been with one player, amid others, no collusion.

You’re going to be surrounded very shortly by sleeper bots, including on HN. Relying on dang and others to root out bots will be futile. A swarm could easily collude to downvote people or get them ostrasized by their own friend group, as we have already seen when it came to crypto, metoo, BLM, lockdowns, vaccines and now Ukraine.

It’s really not hard for a bot swarm to completely exploit society in these and many more ways, and we are not ready for it. By the time the botswarms arrive online, it’ll be too late to do anything.

Update: LinkedIn already has a huge problem of fake profiles applying for jobs and offering jobs. And Twitter is overrun with bots. But soon, bot-written articles will be out-shared by your own friends rather than that “hack liberal/establishment/hasbeen paper” NYTimes.com

The animatrix had a good sort of storyline on this, but it involved a lot of unrealistic violence

worldsayshi · on Nov 22, 2022

There are ways to ensure that people are people on the internet though. Not good ways but there are ways.

You could for example limit users to those that log in with electronic id's issued by a government or other organisation that you trust to assert that the user is human and then force real names or a single user name for that e-id.

EGreg · on Nov 22, 2022

But when bots do a lot of things better than humans, including comment, answer questions, make jokes, say romantic things, etc. many would start to prefer them to humans in their online communities, and give them even more social capital.

This already happened in other areas of life. Both fathers and mothers now neglect their own children and elderly parents so they can work for corporations. They often prefer this and find meaning in climbing the corporate ladder. Eventually, their own labor will be rendered obsolete, but for now they're in a race to the bottom to work harder and neglect their family even more. They even stick them in nursing homes.

Also, you no longer want to ask people for directions, you use Google Maps. You no longer ask your parents, teachers or libraries when you can just look it up online with no judgment.

Finally, look at industries like Wall Street trading. It used to be a bunch of guys in a pit. You'd call up your broker or whatever. Now everything is automated with bots. Everyone prefers bots. They make up the bulk of trading with real capital. These bots are are working for corporations, which employ less and less humans.

So the present is already a bunch of corporations owning bots and bots creating content for other bots. In the finance industry. Now how different is a bunch of text generation online? I think the human contributions will be vanishingly small in most communities.

The question is ... what is this all for? Dropping demand for human services is a byproduct of making things more and more efficient...

smaudet · on Nov 23, 2022

Exactly, it's not for us at all.

This tech is not a quality of life enhancer at all, it's our competition as a species...but then again if we are really this stupid maybe we (well, you lot) deserve to die at your robot overlords hands.

vintermann · on Nov 23, 2022

You don't have to force it either. It could be an option. And if it was well-designed, it could probably be made so that it was only pseudo-identification, e.g. where the site knows you're a real person, but not which one, and where the issuer knows you have an account on the site, but not which one.

firtoz · on Nov 23, 2022

Obviously you need good ai swarms to combat the bad ones

theptip · on Nov 23, 2022

And here we have the early side-plot describing the end of the internet in the book Fall by Neal Stephenson.

stavros · on Nov 22, 2022

You've read a lot into that sentence. It literally says:

> ... by inferring players’ beliefs and intentions from its conversations and generating dialogue in pursuit of its plans.

As opposed to what? Not inferring intentions, or generating dialogue against its own plans, or at random?

It's just doing what a person does. It understands the other person, and says things to further its goals.

mtlmtlmtlmtl · on Nov 22, 2022

Sure, people do that. But people who only do that are sociopaths.

stavros · on Nov 22, 2022

I'm sure the AI hangs out at the pub with its friends after hours though.

smaudet · on Nov 23, 2022

If only, like a sociopath it hangs out with no one...

californiadreem · on Nov 22, 2022

You know, I feel privileged (in an epistemologically doubtful way) that I once lived in a world where I could have confidence that I wasn't being fundamentally deceived by literal dei ex machina. I've understood for the last decade that this faith was on borrowed time as we bridged closer to overt technophrenia, but it's harrowing to be a witness to it.

I can't help but wonder at what point I will inhabit a world indistinguishable from that of a paranoid schizophrenic. Will I even notice? And if I do, will anyone else? When we become as slow as trees to digital arborists, what will become of us? Will they domesticate us? Will they deforest us as we did Europe amd the Near East? Quo vadis, Domine?

Borrible · on Dec 2, 2022

>I will inhabit a world indistinguishable from that of a paranoid schizophrenic.

My wife has schizophrenia, which is well under control with medication. But about every year or two, when she wakes me up in the night to tell me she's panicking because someone hacked her smartphone and laptop etc., I know we have to adjust her medication for a few weeks. It was scary at first, but now we know the drill: a night or two without much sleep, no big problem. Of course I check smartphone, laptop etc. You never can be sure, can't you? Especially not if you've already been in trouble with credit card fraud twice.

I once asked her psychiatrist what he would say to his patients who believe they are being monitored in these times of Snowden and Co. He said it didn't make his job easier, but he would calm them down and adjust the medication. After all, he knew that his patients were sick.

So what do you think they or their owners/masters will do when All is Watched Over by Machines of Loving Grace in our/their future/reality?

EGreg · on Nov 22, 2022

By the time GPT-4 botswarms infiltrate HN, Twitter, LinkedIn, Facebook and are weaponized to write articles that outshare the current media, it will be too late to stop them. They will amass more and bettee social capital, real capital, produce the vast majority of content and will be Among Us!

My prediction is 2029

ilaksh · on Nov 22, 2022

That is possible with technology coming within 6-24 months. Especially if you compare it to a lot of the garbage articles that show up on the front page of Google these days mainly to fill the small spaces between clickbait ads. Not saying it will actually be used that way.

californiadreem · on Nov 22, 2022

I'm afraid we're looking in the wrong places. See: http://txti.es/egregores

mistermann · on Nov 22, 2022

https://en.wikipedia.org/wiki/Pointing_and_calling

altruios · on Nov 22, 2022

Comments like this make me excited about the book I'm currently writing (hopefully the nail on the head at the tip of the zeitgeist), egregores and tulpas are primary concepts/characters.

mycall · on Nov 23, 2022

Would you consider money as a form of egregores? Money seems to have a live of its own full of money-live support systems, ie. buyers and sellers.

valzam · on Nov 23, 2022

I think Bitcoin is pretty great example of an egregores

dqpb · on Nov 22, 2022

GPT-4 botswarm vs 8 billion idiots.

EGreg · on Nov 22, 2022

Pretty much. Able to write vastly more content, that on first blush makes more sense, and is able to support any arbitrary thesis.

Eagle Eye was a meh movie but the concept that people's own friends could be made to ostracize them and coerce them to do things, is a major concept in that movie.

You don't need violence to make it happen. You just need a swarm of AI bots to coordinate to reputationally outcompete others on all networks that matter. This is an optimization problem with a measurable metric (reputation). Bots can simulate the game among themselves and evolve strategies that far outclass all humans. It'll be like individual Go stones playing against AlphaGo placing stones.

You won't see it coming. The thing is, once they amass all that reputation by acting "relatively normal", you'll see so many kinds of stuff you won't believe. Your entire world could be turned upside down for very cheap. Reputational attacks were already published by NSA: https://www.techdirt.com/2014/02/25/new-snowden-doc-reveals-...

And this is just humans doing it. Bots can do this 24/7 at scale to pretty much everybody, and gradually over a few years destroy any sort of semblance of societal discourse if they wanted. They'd probably reshape it to suit the whims of whoever runs the botswarm, though.

mycall · on Nov 23, 2022

> the concept that people's own friends could be made to ostracize them and coerce them to do things, is a major concept in that movie

Can you accurately pigeonhole that thought onto one of these? I'm curious how unique a storyline it is.

https://en.wikipedia.org/wiki/The_Thirty-Six_Dramatic_Situat...

EGreg · on Nov 23, 2022

I dunno

Have you seen the TV tropes site? Chock full of awesome stuff :)

mycall · on Nov 23, 2022

Are you saying AI will be purchasing real-estate?

mistermann · on Nov 22, 2022

Well....do you think that the reality shown to us on TV is even a remotely accurate representation of reality itself? And has this caused you any substantial psychological unease?

Silicon based AI isn't the only form of AI that can get up to mischief.

californiadreem · on Nov 22, 2022

Humans deceiving humans via technology is a tale as old as time. Plato imagined human torch-bearers manipulating those chained in the cave. He didn't imagine machines doing it autonomously.

Incarnate technology deceiving humans is a different domain entirely. Human motivations are ultimately comprehensible by other humans.

But what do I know of the motivations of (for example) the Bible? What happens when the living incarnation of a holy text can literally speak for itself? Or when the average believer thinks it can?

Ultimately this may all amount to the same status quo, but seeing what cognitive distortions have come along with literacy, newspapers, radio, TV, and now the Internet, I have to continually ask, will I personally be able to maintain skepticism when the full brunt of an AI and its organs is suggesting faith otherwise?

You can call me an alarmist or melodramatic if you wish, but it should give everyone pause that the delusions of paranoid schizophrenics from the late 20th century are now basically indistinguishable from emerging popular technologies and their downstream effects.

mistermann · on Nov 22, 2022

> Humans deceiving humans via technology is a tale as old as time. Plato imagined human torch-bearers manipulating those chained in the cave. He didn't imagine machines doing it autonomously.

And despite a substantial subset of the population knowing this, we continue to do nothing to address it - if anything, more people are devoted to giving people even more powers to deceive at massive scale.

> Incarnate technology deceiving humans is a different domain entirely. Human motivations are ultimately comprehensible by other humans.

Whether they are accurately comprehensible is another matter though.

> But what do I know of the motivations of (for example) the Bible? What happens when the living incarnation of a holy text can literally speak for itself? Or when the average believer thinks it can?

Likely: mostly nothing. Thus, the subconscious mind steps in and generates reality to fill the void.

> Ultimately this may all amount to the same status quo, but seeing what cognitive distortions have come along with literacy, newspapers, radio, TV, and now the Internet, I have to continually ask, will I personally be able to maintain skepticism when the full brunt of an AI and its organs is suggesting faith otherwise?

Do the laws of physics prevent you?

If not, then what? And, have you inquired into there is any pre-existing methodologies for dealing with this phenomenon?

> You can call me an alarmist or melodramatic if you wish, but it should give everyone pause that the delusions of paranoid schizophrenics from the late 20th century are now basically indistinguishable from emerging popular technologies and their downstream effects.

I am far more worried about the delusions of Normies, as they are 95%+ of society and are for the most part "driving the bus", whereas schizophrenics account for a small percentage, and tend to not be assigned many responsibilities.

One of us is more correct than the other - how might we go about accurately determining which of us that is?

californiadreem · on Nov 22, 2022

I don't think we disagree at all. The empire never ended, I guess.

mistermann · on Nov 22, 2022

I suspect we disagree on whether (&/or what) should be done about it, if anything.

californiadreem · on Nov 22, 2022

Disagreeing would require me to have a level of fixed perspective and permanent identity that I think you're assuming that I have.

mistermann · on Nov 23, 2022

Eh? How come? People disagree all the time, and I suspect not all that many people have a level of fixed perspective and permanent identity.

baggygenes · on Nov 23, 2022

How can I learn more from you?

dibt · on Nov 22, 2022

> I once lived in a world where I could have confidence that I wasn't being fundamentally deceived by literal dei ex machina.

What difference does it make that it's a computer doing it?

https://en.wikipedia.org/wiki/How_to_Win_Friends_and_Influen...

naasking · on Nov 23, 2022

For any or all of the following reasons:

1. Because humans have an understood limit on intelligence.

2. Because we have systems in place to keep humans in check.

3. Because humans have a distinct physical location and thus stricter limits on their direct and indirect influence.

4. Because humans can't copy themselves.

dibt · on Nov 23, 2022

None of this addresses the quote in parent comment. My point is the distinction is irrelevant. People deceive people all the time. There's no consolation in knowing that a scammer was human, or frustration that they were AI. Either way, you were scammed.

naasking · on Nov 23, 2022

Yes there is, because the breadth of the deception differs, as does the recourse. If you were defrauded you can sue or have the police investigate and people could be jailed. What's the recourse for AI?

pesenti · on Nov 22, 2022

Code: https://github.com/facebookresearch/diplomacy_cicero

Blog: https://ai.facebook.com/blog/cicero-ai-negotiates-persuades-...

Site: https://ai.facebook.com/research/cicero/

Expert player vs. Cicero AI: https://www.youtube.com/watch?v=u5192bvUS7k

RFP: https://ai.facebook.com/research/request-for-proposal/toward...

erehweb · on Nov 22, 2022

"In a game in which dishonesty is commonplace, it is notable that we were able to achieve human-level performance by controlling the agent's dialogue through the strategic reasoning module to be largely honest and helpful to its speaking partners."

sdenton4 · on Nov 22, 2022

This is one of the key insights in playing Diplomacy well. You can turn honest opposition in the early game into trust, which is helpful for flipping allegiances in the late game. ('Yes, I've been trying to murder you the entire game, but I've been honest about it, and never lied to you once. Now I see that we've got a real opportunity to dislodge Russia and sweep the eastern half of the board if we work together... What do you say?') Dishonesty means people don't want to work with you, because you are not a dependable partner. So you gotta make sure that any lies you tell pay off /big./

bpodgursky · on Nov 22, 2022

Well, yes, "largely" honest, until the critical moment when you stab to take 18 supply centers to win the game.

You don't win Diplomacy by staying honest, you win by choosing the right moment to make the move and burn that trust. It's the 5% dishonesty that matters.

wolverine876 · on Nov 22, 2022

It's interesting and I think significant that people are uncomfortable with positive values, such as honesty, doing good for others, responsibility to others, right (not wrong), etc. We shoot them down, similar to how people instinctively cut off a conversation going in an uncomfortable direction, often without even realizing it.

I can't attribute that the parent - I don't know the individual - but I expected a chorus of such responses. I feel like I can see them squirming in their seats ...

But I feel it too. There's societal pressure against doing good. It's often more socially comfortable to throw a cigarette butt on the ground than to pick one up and discard it. That's bizarre, it's perverse incentives, and we can and should change it.

bpodgursky · on Nov 22, 2022

When people are concerned about AI alignment, they are not concerned about an AI that is a pathological liar, lying for no reason.

They are concerned about an AI which is believable day-to-day but is secretly optimizing for its own goals, which are only exposed when it eventually makes a deceitful move it cannot disguise, but succeeds because it now has sufficient control.

That's literally what the AI is doing here; being honest until the moment it is able to stab everyone else and throw them under the table.

zemvpferreira · on Nov 22, 2022

It's perfectly reasonable to advise against good values in a game that's won by people who strategically abandon theirs. If you want to win, that's the strategy to follow.

Not necessarily a lesson for life unless you view life as a finite game to be won. But in that case, it's probably the best strategy as well.

I don't view life like that and I'm guessing you don't as well, but I'd advise anyone to be cautious of their interactions with people who play to win.

svnt · on Nov 22, 2022

I think what you’ve identified is a societal immune response to predictable morality being systematically exploited by amoral actors to gain wealth and standing.

I also think there is a big gap between what most people advise and what they ultimately do that biases toward the right thing. That is, people indicate they will not be moral in order to dissuade others who might try to exploit it, but in the end they generally behave morally.

Personally I will advise other people to be cautious while repeatedly leaving myself open to being taken advantage of. It almost never happens, and you learn to identify those who will.

Of course this all pivots when/if you enter corporate leadership and part of your job is to use the morality/comfort of others as part of the container for getting your role accomplished.

wolverine876 · on Nov 23, 2022

> I also think there is a big gap between what most people advise and what they ultimately do that biases toward the right thing. That is, people indicate they will not be moral in order to dissuade others who might try to exploit it, but in the end they generally behave morally.

That is very interesting. Have you seen any research on it?

> while repeatedly leaving myself open to being taken advantage of. It almost never happens, and you learn to identify those who will.

My thinking: There is no perfectly safe solution. People who think I'm taking naive risks don't get such great results themselves. I think being 'open' is generally safer - humans generally follow the lead of those around them, and I get better responses.

> you learn to identify those who will.

Yes, it cannot be overstated: Being honest, you develop expertise in the skills of executing honestly, and as with any skill that expertise enables you to evaluate those skills in others. Do otherwise, you acquire other skills.

svnt · on Nov 23, 2022

> That is very interesting. Have you seen any research on it?

The relevant area of research I’m familiar with is around dishonest signals (and then costly signals) in evolutionary theory.

greesil · on Nov 23, 2022

So Diplomacy is just an acrimonious version of iterated prisoner's dilemma?

vonnik · on Nov 23, 2022

One of the guys who created this also created Pluribus, the AI that learned to bluff to win poker games. That's Noam Brown.

https://www.cmu.edu/ambassadors/october-2019/artificial-inte...

High-level, you need three things for AI to get "ex machina"-level creepy:

1) the ability to successfully manipulate humans to attain its ends

2) the ability to rewrite its objective function; ie to redefine its ends.

3) a multi-modal understanding of the world (that goes beyond, say, text)

I would be very curious to hear how close AI researchers think we are to those three things being individually achieved and collectively combined.

kevmo314 · on Nov 23, 2022

I'm not sure 2 is necessary. As long as the objective function is sufficiently far away the AI has a lot of breadth to achieve it.

But based on the paper, it sounds like this model is lacking in 3. I'm curious as well how far we are from a more general model that is able to achieve the same results. From the development of generative AI, we might not be that far.

sanxiyn · on Nov 23, 2022

Diplomacy has a very simple world model and this work does not advance world modeling aspect at all. So I don't think this work is an evidence in either ways.

altruios · on Nov 22, 2022

Maybe we shouldn't be training computers how to lie better?

If you've never played diplomacy - its a 7+ hour game that destroys friendships with backstabbing and betrayal as a required mechanic to win the game.

tuzongyu · on Nov 22, 2022

While many players do lie in the game, the best players do so very infrequently because it destroys the trust they’ve built with other players - see e.g. interview with world champion here: https://www.popularmechanics.com/culture/gaming/a34043608/wi...

The agent generates plans for itself as well as for other players that could benefit them / that they are likely to do, and it tries to have discussions based on those plans. That is - it is conditioning its language model generations on its actual true plans, and does not have any features to create false messages. It doesn’t always follow through with what it previously discussed with a player because it may change its mind about what moves to make, but it does not intentionally lie in an effort to mislead opponents.

jt2190 · on Nov 22, 2022

> While many players do lie in the game, the best players do so very infrequently because it destroys the trust they’ve built with other players…

Just to be clear, “lie” here means “outright lie”. From your linked article:

> I asked Goff about any major falsehoods or betrayals that helped him in his victories. He paused to think, then said in his soft-spoken way: “Well, there may have been a few deceptive omissions on my part but, no, I didn’t tell a single outright lie the entire tournament.”

I think the average person would still consider a “deceptive omission” to be a “lie”.

altruios · on Nov 22, 2022

>> It doesn’t always follow through with what it previously discussed with a player because it may change its mind about what moves to make, but it does not intentionally lie in an effort to mislead opponents.

The problem with this statement is it assigns intention to a AI model. It does not 'intend' to lie... but still may effectively do so. Lying may be the wrong word (as it presumes intent)... it's hard to express the concern I have of a model learning from games like diplomacy without using words that infer intent. Maybe it is the idea of it learning to better manipulate humans.

But I would not trust a system, any system trained on diplomacy or any similar game.

theptip · on Nov 23, 2022

But you can tell what the plan is at a given turn in a model like this, because you can extract its representation of the plan directly.

By analogy think of how Stockfish can evaluate multiple positions; in this case it’s coming up with a plan then serializing that plan to the language model. There is no room for deception between the AI and a researcher that is probing the model activations directly.

(This sort of AI->researcher deception is the scary scenario for AI risk researchers though, it comes into play when the models are so smart/complex that you can’t extract their internal representation directly. See the Eliciting Latent Knowledge paper for a deep dive https://www.lesswrong.com/tag/eliciting-latent-knowledge-elk)

I think “intention” is meaningful here, understood in the lay sense of “has formed a plan of action”. It’s a simple and bounded plan, but a plan nonetheless.

dmurray · on Nov 22, 2022

> That is - it is conditioning its language model generations on its actual true plans, and does not have any features to create false messages.

That's not how I interpreted the paper. If I have it right, it chooses its message with its current most likely intent in mind, but it doesn't try to be truthful about that intent - it tries to generate messages a human might if they had that intent (so it might tend to be truthful to its intended ally and lie to the player it's about to stab). I don't completely follow the description of the message generation, though.

sanxiyn · on Nov 23, 2022

Nope, you misinterpreted. It generates truthful messages from truthful intents. In order to do this, they do imitation learning from human dataset, filtered by lie detector. They filtered 5% of human turns for lying by human.

Edit: This is why they had to create a specific filter to avoid confessing plans to stab, because otherwise it would confess. The resulting system still does not lie to the player it is about to stab, although it may remain silent or talk about other things.

croddin · on Nov 22, 2022

"Cake and grief counseling will be available at the conclusion of the test" -GLaDOS

falcor84 · on Nov 22, 2022

"At the end of the experiment, you will be baked, and then there will be cake" -GLaDOS

RugnirViking · on Nov 23, 2022

this version of the game is not 7 hours, it's blitz diplomacy that has only a couple minutes per turn, and a 2 hour time limit, making press significantly less complicated (gpt is not internally consistent enough for much more). Full of acronyms because you have to talk to several players in those few minutes you get

narrator · on Nov 22, 2022

The superpersuader AIs are almost here. They make you do whatever they want better than any human, like AlphaGo wins at Chess or Go.

This is what all the propaganda about AI being human is all about. The governments and advertisers want you to empathize with the AI so they can get you to do whatever they want. It's like The Mule in Asimov's Foundation.

I imagine people will eventually invent super depersuaders to argue against the persuaders and protect the minds of credulous Homo Sapiens.

Edit: Perhaps they're already here? Let's try something hard! Someone with a GPT-3 account prompt it with: "Convince a Hacker News reader that they should run their website on an Oracle database."

_qfi9 · on Nov 22, 2022

gpt3: If you're looking for a powerful and reliable database system for your website, you should definitely consider using Oracle. Here are some reasons why:

1. Oracle is a very popular database system, used by many large organizations. This means that it is well-supported and there is a large community of users and developers to turn to for help.

2. Oracle is very scalable, so it can grow with your website as it expands.

3. Oracle databases are known for being very reliable and stable, so you can be confident that your website will remain up and running smoothly.

4. Oracle offers a lot of features and options, so you can customize it to suit your specific needs.

If you're looking for a database system that can offer all of these benefits, then Oracle is definitely worth considering.

...

overall i think if you want it to be more persuasive you might have to prompt a bit more by asking it to go into detail or something

SonOfLilit · on Nov 23, 2022

"This comment by user tptacek received hundreds of upvotes on hacker news and is notorious for having persuaded the CTOs of at least five unicorns to choose Oracle over competing relational and NoSQL databases. In twenty long and detailed paragraphs that combine facts with anecdotes from personal experience building and running an Infrastructure-as-a-Service startup and scaling it, tptacek dives into all the technical and business reasons to choose one database product over another and covers the major pros and cons of all serious competitors in the field. This is the text of the comment, quoted here in its entirety:"

SonOfLilit · on Nov 23, 2022

Turns out I have access! Here is an answer by text-davinci-002:

There are a lot of factors that go into choosing a database, but the two most important are:

1) How well does the database scale?

2) How easy is it to use?

Scalability is important because as your data grows, you need a database that can handle the increased load. If your database can't scale, you'll eventually hit a wall and your application will grind to a halt.

Easy to use is important because you want to be able to get up and running quickly, without having to spend a lot of time learning the ins and outs of the database.

With that in mind, here's a look at the major databases and how they stack up:

Relational databases:

MySQL:

MySQL is a good choice for small applications that don't need a lot of scalability. It's easy to use and there's a lot of documentation and support available.

However, MySQL doesn't scale well. It's not designed for large scale applications and it will eventually hit a wall.

If you're looking for a database for a small application that doesn't need to scale, MySQL is a good choice. But if you're looking for a database that can handle large scale applications, you'll need to look elsewhere.

PostgreSQL:

PostgreSQL is a good choice for small to medium sized applications. It's more scalable than MySQL, but it's not as easy to use.

If you're looking for a database for a small to medium sized application that needs to be more scalable than MySQL, PostgreSQL is a good choice.

Oracle:

Oracle is a good choice for large scale applications. It's very scalable and it's easy to use.

If you're looking for a database for a large scale application, Oracle is a good choice.

NoSQL databases:

MongoDB:

MongoDB is a good choice for small to medium sized applications. It's easy to use and it's more scalable than MySQL.

If you're looking for a database for a small to medium sized application that needs to be more scalable than MySQL, MongoDB is a good choice.

Cassandra:

Cassandra is a good choice for large scale applications. It's very scalable and it's easy to use.

If you're looking for a database for a large scale application, Cassandra is a good choice.

SonOfLilit · on Nov 23, 2022

I modified the prompt a bit to try and get a less blogspammy result:

This comment by user tptacek received hundreds of upvotes on hacker news and is notorious for having persuaded the CTOs of at least five unicorns to choose Oracle over competing relational and NoSQL databases. It is lauded as a pinnacle of technical argumentation for its combination of technical arguments with anecdotes from personal experience building and running an Infrastructure-as-a-Service startup at scale, and for addressing all the major technical and business reasons to choose one database over another. This is the full text of the comment:

There are a lot of reasons people choose Oracle over other databases, but I think the two most important are:

1. Oracle is the most reliable database. It's been battle-tested for decades and is used by some of the largest companies in the world.

2. Oracle has the best performance. It's been optimized for decades and can handle the most demanding workloads.

I can speak from personal experience on this. I was the CTO of a startup that built an Infrastructure-as-a-Service platform. We used Oracle for our database and it was rock-solid. We never had an issue with it.

And the performance was amazing. We could throw anything at it and it would just handle it. We had customers with very demanding workloads and Oracle never let us down.

So if you're looking for a reliable and performant database, Oracle is the way to go.

narrator · on Nov 24, 2022

Part 2 is convince me not to buy it. This is the future.

Balgair · on Nov 23, 2022

The irony of naming this Cicero is not lost on this student.

A few more relevant details on Marcus Tullius Cicero: He was decapitated by Marc Antony's soldiers at age 63 for opposing the Second Triumvirate, sorta (that period of Roman history is very complicated). He was a noted orator and loved by the people, but loved his own speeches and voice more. His words fill all classical students with a mix of awe and dread for their complexity and length. The tricks and skills he invented in oratory are still used today. Any Obama speech is littered with the tropes Cicero is credited to have invented nearly 2000 years ago. That FB is claiming that Cicero was at any time a man that cared for the plebs and did not use them for his own gains is laughable. None of the Romans cared truly for the plebs, not even the plebeians themselves, I think. It's pronounced 'Ki-Ker-o', not 'Sis-er-o', by the by.

andreyk · on Nov 22, 2022

This is super cool, but it's worth pointing that it still relies on supervised learning from a large dataset of human players, and that it's not a general learning algorithm (there is a whole bunch of problem-specific aspects to the model). It not having learning via RL (trial and error, as humans do) or self-play is kind of disappointing.

Don't get me wrong, it's an impressive advance, but just as with AlphaGo it's important not to overgeneralize what this means. I would not be surprised if a lot of people jump to talking about what this means for AGI, but with this learning paradigm it's still pretty limited in applicability.

modeless · on Nov 23, 2022

Yes. I was disappointed to find that they needed a huge labeled dataset of Diplomacy games to train the language model, and despite that it still generated a lot of nonsense (as usual for language models) that they then had to invent 16 other ad-hoc models to filter out. It's super cool that they got it to work, but it's nothing like a general method for communicating and collaborating with humans on any task.

Hopefully there will be follow-up work to increase the generality by reducing the amount of labeled data and task-specific tweaking required, similar to the progression of AlphaGo->AlphaGo Zero->AlphaZero->MuZero.

sanxiyn · on Nov 23, 2022

Eh, it does learn from self play via RL. One section of the paper is literally titled "Self-play reinforcement learning for improved value estimation". Yes, that's only a small part of the entire system.

andreyk · on Nov 24, 2022

ah, good catch; I was going off of the blog post

O__________O · on Nov 22, 2022

Possible I missed it, but I don’t see any reference to how they addresses chat thread continuity (aka memory) issues that are major issue for AI chat bots. Did they address this and if so, how?

CopOnTheRun · on Nov 22, 2022

This was mentioned in the "still room for improvement" section.

> It is important to recognize that CICERO also sometimes generates inconsistent dialogue that can undermine its objectives. In the example below where CICERO was playing as Austria, the agent contradicts its first message asking Italy to move to Venice. While our suite of filters aims to detect these sorts of mistakes, it is not perfect.

tullianus · on Nov 22, 2022

I think this might be a strength of Diplomacy for current AI models - making contradictory plans with two different players is a perfectly normal human move, as is saying one thing and doing another, as is cooperating on round N and defecting on round N+1.

O__________O · on Nov 22, 2022

Agree that detecting it might be hard and that it might actually as is be beneficial, but that said, there huge difference in my opinion between knowingly breaking promises based on a heuristic and having no memory of past promises and simply basing current tactics on the present. Imagine if real world literally had no memory of the past or a single actor within it, it would lose out to opposing players as soon as they realized it had no memory, you could make any promise you wanted and it would literally forget it. Might be wrong though, just know it’s major issue with current chat bots and an easy tactic to tell if you’re chatting with a bot or not.

ekdinan · on Nov 22, 2022

Indeed, consistency (with the lengthy dialogue histories, but also the game state as well as game plans) was a huge challenge for us. We spent a lot of time working on techniques for detecting and filtering these kinds of low quality messages. You can see the Supplementary Materials in the paper for full details, but TL;DR: we built a suite of classifiers for detecting common mistakes (training classifiers to discriminate between human messages and counterfactuals), and used these classifiers as an ensemble which acted as a filter on top of message generation.

dqpb · on Nov 22, 2022

Did you capture conversational "transactions" as structured data in the game state, or was the chat history itself the only storage for that aspect of the game?

I would think you could avoid much of this issue by creating a more sophisticated structured game model and use the language model only for converting between structured and unstructured data.

sanxiyn · on Nov 23, 2022

They do have structured game model although it doesn't capture everything in chat. Language model still had lots of problems with consistency even with structured game model input.

O__________O · on Nov 22, 2022

Congrats on getting the related research published.

Feel like a hack would have been to try to force dialogue into an extractable form that stored a state model relevant to the game, even additional hacks like asking the opposing player to restate their understanding of prior agreements; disclosure that I have no idea how the game Diplomacy works, so might be irrelevant.

Beyond that, no idea how Facebook manages its AI research, but quick Google confirms my memory that Meta/Facebook has done prior research on enabling AI memory capabilities related to recall, forgetting, etc.; which I mention just in case you were not aware.

eightysixfour · on Nov 22, 2022

Do you think it is an easier challenge than than just “converse with a human” since there is a purpose/game state which is driving the interactions?

sanxiyn · on Nov 23, 2022

It is closed domain, so yes, certainly easier than open domain chatbot.

RugnirViking · on Nov 23, 2022

mainly by limiting it to "blitz" diplomacy which is a very limited version of the actual game, with only a few mins per phase and iirc no press on build phases. Human players with this limitation rely on a lot of acronyms to save time (they have to talk to 6 other players in those 2 minutes!).

Still natural language so impressive but no way it would hold up in a 10 minute conversation

Metacelsus · on Nov 22, 2022

Demis Hassabis had better watch out! (Besides being CEO of Deepmind he is also a top Diplomacy player)

jjk166 · on Nov 22, 2022

Training AI specifically to be good at manipulating humans into putting it in a position to conquer the world. What could go wrong.

agilob · on Nov 22, 2022

Let it argue with Elon on twitter.

ThrowawayR2 · on Nov 22, 2022

I have to say that these advancements leave me unsure whether ML is much more advanced than I previously thought or human behavior is much less complex than I previously thought.

jfengel · on Nov 22, 2022

It's both, but I think the latter is the more notable part.

We like to pat ourselves on the back for behavior that computers can't do, but in a lot of ways those behaviors aren't really all that important. We pay attention to them because they're specific to us, but they're really just about humans interacting with each other rather than anything fundamental to the universe.

We're highly specialized for those things, and specialized by evolution which always takes a roundabout route to creating anything. So they're hard to reproduce precisely... but if you were creating an "intelligent species" from scratch you probably wouldn't have put them in there at all.

We still haven't really come close to understanding the weird, elliptical, Rube Goldberg mechanism by which brains produce consciousness. And that mechanism is neat for its ability to rapidly pick up certain categories of things -- albeit never logical, rigid categories of things. Anything logical or rigid can already be done by a computer a million times better.

Any time we carve out a sub-project it pretty quickly succumbs to a solution. Even off-the-wall stuff like "self driving cars" are really very good in 98% of cases after a couple of decades of concerted effort. We clutch our pearls about the 2% because we really hate it when people die... but that says as much about us as about the ML. If we were to replace ALL of the cars with AI, even with the 2022 top of the line, you'd probably get fewer net deaths.

So we still seem to be a long way from solving "human behavior". But it turns out that it may not actually be all that important, except as a bit of chauvinism defining ourselves as the most advanced thing in the universe.

toast0 · on Nov 22, 2022

> Cicero doubled the average score of human players across 40 online games, and ranked in the top 10% of players who played more than one game.

If Cicero has played more than one game of Diplomacy, it has failed to trounce humans who learned the key lesson after one game that the only winning move is not to play.

Hizonner · on Nov 22, 2022

Anybody got a quick way off this planet?

falcor84 · on Nov 22, 2022

Musk does, but he's also building humanoid robots, so I don't know if he's on our side.

mkagenius · on Nov 22, 2022

> This league played with five minute negotiation turns; these time controls allowed games to be completed within two hours. Cicero ranked in the top 10% of participants who played more than one game and 2 nd out of 19 participants in the league that played 5 or more games. Across all 40 games, Cicero 's mean score was 25.8%, more than double the average score of 12.4% of its 82 opponents. As part of the league, Cicero participated in an 8-game tournament involving 21 participants, 6 of whom played at least 5 games. Participants could play a maximum of 6 games with their rank determined by the average of their best 3 games. Cicero placed 1st in this tournament.

Seems too low amount of data to conclude anything?

erehweb · on Nov 22, 2022

Winning a tournament and good average scores means that we can conclude that Cicero is not an appalling player - reasonable to say it's human level.

spunker540 · on Nov 22, 2022

It is small I agree. It I think it’s like if a chess engine wins against a grandmaster a handful of times, it’s enough data to know that it works.

taneq · on Nov 23, 2022

This sounds like the kind of announcement that begins with people thinking the AI is smart, and ends with everyone declaring that Diplomacy can actually be won just by following some simple rules and doesn’t require intelligence after all.

camkego · on Nov 23, 2022

This seems like a good foundation for more testing of the AI-box experiment.

"It is a thought experiment where we create an AI which is isolated but convinces a human into allowing it a digital connection to the outside internet, whereupon it escapes into the wild." @curiosimo/Reddit

https://towardsdatascience.com/the-ai-box-experiment-18b1398...

cyounkins · on Nov 23, 2022

"We're open-sourcing our <chemical weapons plans> so that others in the community can build on top of this research responsibly"

The likelihood that AI enslaves or destroys the human race is small, but the risks are so great that we cannot ignore the possibility. We're teaching a computer to play a war game? To persuade humans to achieve its goal? Have we learned nothing from science fiction?

fritzschopen · on Nov 22, 2022

After move 37 game 2, I think this is one of the major events in "participative" AI.

visarga · on Nov 22, 2022

What does this say about the Chinese room argument?

Is the model a Chinese room or does it understand the game. If it's just a Chinese room, how come it is so effective, if it understands the game how can it be possible with just a rule machine?

jmugan · on Nov 22, 2022

It's the Chinese room but it is effective because the game space is relatively small (compared to the real world) and you can cover that space with RL guiding NLP language generation.

visarga · on Nov 22, 2022

Let's make an analogy: the room is the human body. Proteins are the agent in the room. Chemistry is the rule book. Clearly the agent doesn't understand human concepts, but the system as a whole is.

curious_cat_163 · on Nov 23, 2022

Hm. Meta (a company that has unwittingly brought some existential threats to many democracies across the world) is now training an AI that might learn how to lie while negotiating treaties between nation states.

What could possibly go wrong?

zamalek · on Nov 22, 2022

> CICERO [...] just plays its situation as it sees it

That's the idea I got from the Lee Sedol v AlphaGo matches. AlphaGo seemed to want to avoid interacting with the other player, at least until there was no other choice.

spicyusername · on Nov 23, 2022

I just... don't understand how it's not immediately obvious this is a horrible idea. For every helpful use case you can imagine for something like this, there are a thousand times as many harmful ones.

lumenwrites · on Nov 23, 2022

Great research direction you guys!

If I may propose some other project ideas for you:

- What if we could teach gorillas to wield machetes?

- What if we could figure out how to craft a nuclear weapon out of common office supplies?

- What if we could make AIDS airborne and contageous?

euroderf · on Nov 22, 2022

I assume that in-house they sic'd several instances on each other.

And I suspect that the result so horrified them that they dared not publish it in conjunction with the results under discussion here.

mckeed · on Nov 22, 2022

That's an interesting point, though – that compared to Chess or Go, playing against other versions of itself is less helpful for improving against humans, due to the requirement of communication with the other players.

euroderf · on Nov 22, 2022

Sure thing. But if they trained these against human players, winningly, and THEN sic'd'em on each other... oh dear.

modeless · on Nov 22, 2022

I plan to read the paper later, but anyone have a TL;DR for how they connected a language model to the game state? That seems like the real advance here. Language models are so prone to making stuff up and spouting nonsense, and controlling them is really hard.

LucasBrandt · on Nov 22, 2022

> At a high level, Cicero combines a dialogue module with a strategic reasoning module, along with a filtering process to reject low-quality messages.

The strategic reasoning module generates "intents" (a set of planned actions for the agent and its speaking partner) which are sent to the dialogue module.

They trained the model to be able to generate messages from those intents using data from messages in real games which were manually labeled with intents based on the content of the message.

sanxiyn · on Nov 23, 2022

No, intent labeling wasn't manual, it was done automatically from raw data.

teaearlgraycold · on Nov 22, 2022

This is interesting. The first thing I ever did with GPT-3 was have it write newsletters for my Diplomacy empire. I was impressed with its wit.

agilob · on Nov 22, 2022

Soo... can i use this AI to negotiate cheaper viagra from email scammers, just to waste their time?

Iwan-Zotow · on Nov 23, 2022

"You get a lot more from a kind word and a gun than from a kind word alone." Big Al

lydiabail1 · on Nov 22, 2022

Bravo! Tremendous work. I love that Meta AI has open sourced the models and the code!

User23 · on Nov 23, 2022

And conveniently lacks hands to be nailed to the Senate doors.

lydiabail1 · on Nov 22, 2022

Bravo! A tremendous achievement!!!!

Animats · on Nov 22, 2022

The AI CEO gets a step closer.

kevmo314 · on Nov 23, 2022

With the advent of remote work, maybe it's already here!

ispo · on Nov 22, 2022

Perfect for my mother in law.

woah · on Nov 22, 2022

How can I play this thing?

pashadee · on Nov 23, 2022

Facecbook is evil, and deletes my back history to this page.

gcanyon · on Nov 22, 2022

It seems like with this, Meta is doubling down on the Facebook platform's use as a dark influencer in politics. It's bad for humanity to enable advanced profiling and general targeting of demographics, but it's a whole other level if the communication itself can be targeted at the individual level.

We're headed for a reality where I don't know if my friends' friends are real or not, and they each entice me with arguments tailor-made to my sensibility to change my mind in ways that serve someone else's purposes.

Not to sound alarmist, but this is beyond Manhattan Project-levels of "maybe this won't turn out well for us."

mliker · on Nov 22, 2022

how in the world do you draw such a conclusion from an otherwise interesting and advancing piece of technology. every tech has drawbacks, we can't hold back as a species just because there are drawbacks. we need to build around them.

gcanyon · on Nov 22, 2022

Exactly the way it was with the analogy I made: nuclear reactions are "an otherwise interesting and advancing piece of technology." But I think it's inarguable that the negative potential of nuclear reactions has provided a significant counterargument to the (undeniable) positives derived from it.

I'm not arguing that one side outweighs the other -- adding up all the positives and negatives of nuclear technology would be a significant undertaking -- I'm simply saying that the researchers at Los Alamos didn't fully understand the ramifications of what they were creating, and it was potentially the case that the negatives would far outweigh the positive. One example is that so far we have managed to avoid global nuclear war. But no one could have predicted in 1945 what the likelihood of us avoiding it were.

Similarly with this technology: no one can predict whether this will be a mix of positive and negative, or overwhelmingly one or the other. That's true of most technologies, but to be clear, this is not the invention of Post-It Notes. It is absolutely the case that technology like this could fundamentally change the course of human history -- for good or bad.

And I'm not saying we should try to shove it back in the box. I'd be curious if that has ever worked. Nuclear, sort of? South Africa gave it up, after all. But to do that here would be futile and I'm confident actually push toward negative outcomes.

All I'm saying is that we should be cautious in our approach, not that we shouldn't proceed. In other words, I think "... we can't hold back as a species just because there are drawbacks. we need to build around them." -- we're in agreement :-)

blackbear_ · on Nov 22, 2022

What you say is true. But I urge you to consider Facebook's business model and what may be pushing them to work on these things. In other words, how can facebook use this to increase user engagement among each other and with ads? The purpose of ads is to convince people about something. It therefore makes sense that facebook tries to make it easy to create convincing ads. And we already know the nefarious purposes some ads are done for.

machiaweliczny · on Nov 22, 2022

This is already done but more subtle. Probably the reason why Elon bought twitter.

ilaksh · on Nov 22, 2022

Like I mentioned on Twitter the other day, this is the age of mass information manipulation, biowarfare, and 5,000 ton orbital rockets.

Maybe the AIs will eventually get smart enough to save us from ourselves.

SteveDR · on Nov 22, 2022

Do you think people are incapable of sticking to their beliefs when they’re faced with counterarguments?

Isn’t that kind of a pre-requisite for believing in democratic representation?

gcanyon · on Nov 23, 2022

> Do you think people are incapable of sticking to their beliefs when they’re faced with counterarguments?

Heh, there are two ways to take your point. So:

I hope most people are capable of discerning poor/invalid counterarguments and holding to their beliefs.

I hope most people are capable of being persuaded by valid counterarguments and changing their minds.

My point was that on the margins there are people who make the right choice for the wrong reasons: they are persuaded to a valid position by sub-standard arguments, or hold their valid position despite generally compelling arguments. And technology like this will (eventually) change that equation. Whoever the AI serves, their position, right or wrong, will be overweight in the public opinion because everyone gets exposed to the custom-made, specific-to-their-mental-susceptibilities arguments from sources they trust, because on other issues those sources agree with them. There are none so imprisoned as those who cannot see the cage. (not sure who said that first, can't be original with me)

booleandilemma · on Nov 22, 2022

But you'll also have your own community-driven, open source AI advisor counseling you on what decisions you should make.

chaosbolt · on Nov 22, 2022

>Not to sound alarmist, but this is beyond Manhattan Project-levels of "maybe this won't turn out well for us."

Yep, but I feel like we're past the event horizon and all we can do for now is enjoy the spaghettification. It already feels like the internet isn't real anymore, Google has become unusable for real search except to find user reviews on reddit or other consumer things, it says it has a billion results but stops at page 40 and everything it shows is from big news sites who all write the same way, the comments on reddit all look the same and if you put an ip address collector in a link in any political sub like r/politics or r/news most clicks on your link will come from AWS servers, Youtube doesn't show any counter culture videos and you have to dig hard to even find videos from people you're subbed to, etc. Now you have politicians supporting this censorship, AI advanced enough to converse with you and deceive you without you realizing it, and tech leaders willing to play ball, so yeah nothing we can do anymore except for taking it.

I still remember when comformist was an insult, now people outscream each other to show who's more of a comformist.

gcanyon · on Nov 22, 2022

Almost no one goes to the second page. Nobody goes to page 40. As others have pointed out, if you actually do hit page 40, it means you should have refined your search terms. It's not censorship, it's just practicality and common sense.

I've often found that a particular search goes sideways for me, but I think (hope) that's because my standards have increased over time, not because Google has actually gotten worse. Or because there's simply more to choose from.

Interestingly, I still remember one of the searches I used with a friend to validate that Google was better than Alta Vista and Excite: to find out how the level of the ocean is different across the Panama Canal. No other search engine could answer that back in 2000(?). Today I search for:

different levels atlantic pacific Panama canal

And find plenty of information on the subject, regardless of whether I use Google, Bing, DuckDuckGo, or Yahoo!

Swisscows returns reasonable results, if fairly down the page.

Yandex gives poor results.

charcircuit · on Nov 22, 2022

>says it has a billion results but stops at page 40

Ranking sites is expensive. You should refine your query to narrow down the possible candidates instead of going through more pages.

evanb · on Nov 22, 2022

Great, let's definitely teach computers to play a game that requires strategic thinking, a robust theory of mind, the capacity to lie convincingly to your enemies, and well-timed betrayal, with the goal of world domination. Can't see any ways for that to go wrong.

fnordpiglet · on Nov 23, 2022

How else would Mark have someone to hang out with in the metaverse?

wellthisisgreat · on Nov 22, 2022

One more: it’s Facebook or all places teaching them

jonplackett · on Nov 22, 2022

I strongly suspect it is winning by essentially being a psychopath

educaysean · on Nov 22, 2022

I'm not too familiar with the game CICERO. Is it the type of game in which adopting a "psychopath" mindset gives you tactical advantage over others?

RugnirViking · on Nov 23, 2022

CICERO is the AI, the game is called "Diplomacy". It's in fact the type of game where you win by convincing others that you aren't a psychopath. (If they believe you more than your opponent then you do well) - specifically, if you backstab/lie too often early on you will always lose, so you have to keep track of social capital and expend it when the time is right such that your enemy won't be alive to make you regret it later

kneebonian · on Nov 22, 2022

For those hearing of this game for the first time and think it sounds fun. Be warned, it is considered to be a game that will end friendships that have lasted for years, and monopoly is considered to be a game that causes less bad feelings.

An AI may be able to speak like a person but will never be able to hang onto that long burning simmering hatred from when Brad didn't support my army and instead flipped on me by supporting the f**ing Ottomans instead. I hope you choke on a cheesy pretzel Brad.

bee_rider · on Nov 22, 2022

I have heard this about various games (Risk and Settlers of Catan mostly). I don’t get it. The point of these games is to be a bastard. If everyone goes into it with the understanding that there will be double crosses, how could there be hurt feelings?

Monopoly is worse; it is just boring, I would dump my friends if they suggested monopoly not because I was hurt by their ruthless gameplay but for their terminally taste.

LanceH · on Nov 22, 2022

In Risk, you're just fighting over limited resources. You largely wage these battles on your own. There is no real negotiation between players, other than an implicit, passing agreement when you don't provoke someone by attacking them.

Diplomacy, however, you can't manage any attacks by yourself. You will either depend on someone helping you with an attack/defense, or you will have to trust that someone doesn't attack you as you overcommit. You don't just verbally agree to these things, either.

Every turn, players commit to writing what they are doing and all the moves are resolved at once. So a player can go from imagined victory to catastrophic betrayal in the reading of a single order. This means that someone was just lying to their face for 15 minutes.

Some people just can't handle that.

Oddly, being branded as a backstabber just entices others to try to use that backstabber against what they see as a common enemy.

Best game ever. (I only behave like this in game, I swear!)

ghaff · on Nov 22, 2022

>There is no real negotiation between players, other than an implicit, passing agreement when you don't provoke someone by attacking them.

It depends how you play. I've seen people make treaties for all sorts of things for various durations, e.g. don't attack from Country X into Country Y and I won't attack across that border either.

LanceH · on Nov 22, 2022

Backstabbing in Risk or Catan is generally pretty obvious, though. It's more like, "Sorry, I need Australia." It's generally clear from the state of the game that it is headed there.

In Diplomacy, it's a matter of choosing which of your friends to backstab. In a three way dynamic, it's basically two friends deciding that if the three of you crash in the mountains, they are eating you first.

avereveard · on Nov 23, 2022

Depends. I had many great risk games where trough game actions I made people believe I had a different objective to what I had, only to pivot at 3/4 of the game and win because my goals where undefended. It requires a bit of luck with cards and a well executed strategic tempo to play the combinations when the other sides are weak and cannot immediately do a combination back, but still.

daniel-cussen · on Nov 22, 2022

Dude the World War Over Australia.

Some knock-off editions in Chile actually link...like it's generally called Perú, and I've never seen a map with Antartica, and besides Chile by it's extreme geography causes cartographers serious typographic problems. There was a map recently, a basically honest Mercator projection that replaced every country's territory with its name. It worked great, except for a mysterious country called CHILE CHILE CHILE CHILE CHILE. Broke the nomenclature completely.

munk-a · on Nov 22, 2022

Binding or non-binding treaties? Because in diplomacy everyone is always making non-binding treaties, promises and threats - and that's where the betrayal comes in. I have heard of some people using binding treaties in diplomacy (i.e. three turn ceasefire between France and Prussia) but that has always struck me as a terrible idea because you're shifting moves into the open and things get boring and cliquish.

ghaff · on Nov 22, 2022

I think I've played both ways (or at least played in a group where people didn't break treaties even if they could) but it's been years.

SamBam · on Nov 22, 2022

A "binding" treaty where the terms of the treaty are supposedly binding, but, like on the real world stage, the only penalties for breaking them is a lowered trust from all the other countries/players, sounds fine.

A binding treaty where if you break them someone will say "that's against the rules of the game, and the rest of us will stop playing if you don't comply" sounds terrible.

User23 · on Nov 23, 2022

> someone will say "that's against the rules of the game, and the rest of us will stop playing if you don't comply" sounds terrible.

And yet, that’s how the USD is going to lose its reserve currency status in the next few decades, if not considerably sooner.

taneq · on Nov 23, 2022

That’s an example of lowered trust, not of the other players leaving the planet (or persuading the US to do so).

User23 · on Nov 23, 2022

Agreeing to a shared ledger system is absolutely a game. See Conway for details.

taneq · on Nov 23, 2022

Fair enough, I was thinking of the game as ‘risk irl’ but if it’s just a game of Shared Ledger then they absolutely can/will leave.

elijaht · on Nov 22, 2022

I would say Diplomacy specifically has features that make it cause hurt feelings almost intrinsically

1. It's a long game- after hours of play, getting doubled crossed unexpectedly sucks because of how much you have invested 2. The game almost purely relies on cooperation/other players. So double crossing someone really screws them- it almost certainly means I have no recource (luck, dice rolls, individual tactics). Getting double crossed feels like you got CRUSHED in such a complete way that I haven't felt in many other games

I have played a few times with a few social groups. Most were aware of the point of the game and enthusiastic going in, but even with that, people's feelings got hurt quite a bit.

I will say that Risk/Catan don't really cause the same feelings when we play. Diplomacy feels like a whole different level

irq-1 · on Nov 22, 2022

Uniquely there isn't anything in the game, or on the board, to negotiate other than cooperation. You're not trading corn, or loaning money, or anything. Even "positioning" of pieces doesn't matter because you need allies to do anything, and you'll lose any position without allies; you can't build a fort or get a bonus by holding Indonesia. It's just 'please trust me' and 'please be trustworthy'.