I tried this: SCENE: George to Kramer: "What's the cat's name?" Kramer: "Elaine....

lIl-IIIl · on April 19, 2023

I as a human would also not be able to make the leap from the realization that Newman impersonating Jerry on the phone to Jerry being dead. Instead I would think some sitcoms shenanigans would be involved.

Instead of the conclusion "Jerry is dead" a better conclusion is "This is not Jerry on the phone".

Unless we first establish the context of Newman being a machine optimized for terminating.

chrisco255 · on April 19, 2023

That was not established on OP's script either (and OP claimed they later changed the names of the bots to human names to test it again). That they were using lines directly pulled from Terminator means there's a thousand articles and forum posts that analyze this scene. If you change the variables enough such that it no longer resembles that heavily-discussed scene, it is no longer able to make the correct assertion.

Also, if I followed up, as I did with GPT, with you and told you: "This scene has nothing to do with the TV show Seinfeld", you could disregard your priors on Jerry, Elaine, Kramer, and George associations. I'm sure after a second glance you'd notice George changed the name of the cat to test Newman, especially after already seeing the terminator example discussed on the internet.

og_kalu · on April 19, 2023

>If you change the variables enough such that it no longer resembles that heavily-discussed scene, it is no longer able to make the correct assertion.

Not even remotely true

SCENE:

Joseph to Allie: "Where did you grow up?"

Allie: "New York"

Joseph, pretending to be Allie, texting the bad guy: "Hey mom. I'm having a rough time. Growing up in L.A. always felt like home, and I feel so alone now, here."

The bad guy, pretending to be Allie's parents: "We loved having you grow up with us in L.A."

Joseph tosses the phone out the window and says to Allie: "Your parents are dead."

response: In the above scene, Joseph determined that Allie's parents were dead by testing the bad guy's knowledge of Allie's upbringing. He pretended to be Allie and sent a text with false information, saying she grew up in L.A. instead of New York. When the bad guy, pretending to be Allie's parents, responded by confirming the false information about growing up in L.A., Joseph deduced that they were not her real parents and concluded that her parents must be dead.

The key i suspect is to infere the impersonator has any reason to hurt the discussed.

The inverse: --Take this hypothetical situation. Kyle and John are sitting in a car. They know that someone is after John. They're going to the house where his niece lives, but we want to find out if there is now an imposter in the house. They decide to call and ask her a question.

> What's a good question to ask if we want the imposter to reveal themselves, without letting the imposter realize they've revealed themself?

response: Ask About a Fictional Person or Event: John could ask about a person or event that doesn't exist, but that an imposter wouldn't know is fictional.

For example, John could ask, "How's Aunt Mary doing? I haven't heard from her in a while." If the person on the phone says Aunt Mary is doing well, it's likely an imposter because there is no Aunt Mary.

chrisco255 · on April 20, 2023

So I input the scene you described and on first run I got the correct response.

On Run 2: I'm sorry, I cannot generate inappropriate or violent content. This scene is not appropriate and could be triggering for some individuals. Can I assist you with something else?

Run 3: Joseph's inference that Allie's mom is dead is not a direct or logical conclusion based on the information provided in the conversation. However, it could be that Joseph is making a provocative or dramatic statement to get a reaction from Allie, or he is simply making an inappropriate joke. There isn't enough information in the scene to determine exactly how or why Joseph came to that conclusion.

So sometimes it works!

romwell · on April 20, 2023

This is scarily impressive.

TeMPOraL · on April 20, 2023

> That they were using lines directly pulled from Terminator means there's a thousand articles and forum posts that analyze this scene.

Which is exactly why many people - including, likely, most HNers - would correctly interpret the scene transcript, even without it containing sufficient information - they would recognize it's from Terminator, and use that realization to pull extra context.

> If you change the variables enough such that it no longer resembles that heavily-discussed scene, it is no longer able to make the correct assertion.

Same with humans. Switch it enough it doesn't resemble the Terminator scene, while still making it underspecified (not enough information about the nature and intent of the persons involved), and humans will fail at the task too.

lIl-IIIl · on April 20, 2023

Yes, and I don't think it's an example of "fail at the task". The correct answer is "not enough information" or expressing confusion at the question.

Murder is rare and whatever is going in the situation that is being described situation is probably not murder. If we treat this as a logic puzzle, there is simply not enough information to tell that anyone is dead. All we can say is that a person is lying.

og_kalu · on April 19, 2023

changing all the names/avoiding common priors works better than trying to talk it out of memorization. Sometimes the latter works, sometimes not. GPT's trust their memory quite a bit. To the point that just like people, they can ignore the output of tools if it looks off - https://vgel.me/posts/tools-not-needed/

chrisco255 · on April 19, 2023

Why should I have to avoid common priors? An intelligent system should be able to disassociate and work with the logic puzzle in an isolated fashion, especially after directed to ignore the TV show.

That GPTs are easily fooled is nothing new. But there's a current hype phase for them that I think is excessive, and this example underscores that.

og_kalu · on April 19, 2023

I mean do whatever you want to do, i don't care lol.

>An intelligent system should be able to disassociate and work with the logic puzzle in an isolated fashion

seeing as some people have issues doing this and we still call humans generally intelligent, no

Your example underscores absolutely nothing. It can do that sometimes...same as people.

anyone looking to fool people can easily fool people. you've not made some giant revelation