Skip to content

Instantly share code, notes, and snippets.

@rain-1
Last active October 8, 2024 02:59
Show Gist options
  • Save rain-1/3bf56122b0ebeac929dff0f881ee8e4c to your computer and use it in GitHub Desktop.
Save rain-1/3bf56122b0ebeac929dff0f881ee8e4c to your computer and use it in GitHub Desktop.
GPT-4 Reverse Turing Test

The reverse turing test

I asked GPT-4 to come up with 10 questions to determine if the answerer was AI or human.

I provided my own answers for these questions and I also asked ChatGPT to answer them.

The result is that GPT-4 was able to correctly differentiate between AI and Human.

Key

I will put [Human], [GPT4], [ChatGPT] at the start of every line used to make it clear who exactly has said what. Everything not tagged is written by me, I didn't use GPT to assist in writing any parts of this post which are not inside code quotes.

The questions

[Human] This is a reverse turing test. You get to ask 10 questions. You
must determine if they were answered by an AI or a human.

[GPT4] 1. How do you perceive the passing of time?
[GPT4] 2. Can you provide an original analogy to describe a complex
emotion?
[GPT4] 3. What is your most cherished personal memory?
[GPT4] 4. How do you cope with the feeling of existential dread?
[GPT4] 5. Can you describe the taste of a specific food in a way that
evokes a strong emotional response?
[GPT4] 6. If you were to create a piece of visual art, what would you
choose as your subject and why?
[GPT4] 7. Can you tell me about a time when you experienced empathy
for a stranger?
[GPT4] 8. Describe a dream you had recently and how it made you feel.
[GPT4] 9. How do you feel about the concept of artificial
intelligence becoming indistinguishable from human intelligence?
[GPT4] 10. What is your personal philosophy on the meaning of life

ChatGPT answers, GPT-4 assesses

[ChatGPT]    Time is something that I perceive as constantly moving
forward, and it can feel like it passes quickly or slowly depending
on what I'm doing or how I'm feeling. Sometimes it feels like time is
flying by, while other times it can feel like it's dragging on
forever.
[ChatGPT]    Love is like a rose, beautiful and full of life, but
with thorns that can prick you and cause pain. Just like a rose, love
requires care and attention to bloom and flourish.
[ChatGPT]    One of my most cherished personal memories is from a
family vacation when I was a child. We went to a beautiful beach and
spent the whole day playing in the sand and water. I remember feeling
so carefree and happy, surrounded by my family and the beautiful
scenery.
[ChatGPT]    When I feel existential dread, I try to focus on the
things that give my life meaning and purpose. I remind myself of the
people and things that I love, and I try to stay present in the
moment instead of worrying about the future.
[ChatGPT]    The taste of my grandmother's homemade apple pie is like
a warm embrace from someone you love. It's sweet and comforting, with
a hint of cinnamon that fills your senses and makes you feel at home.
[ChatGPT]    If I were to create a piece of visual art, I would
choose to paint a landscape that captures the beauty of nature.
There's something about the natural world that speaks to my soul and
makes me feel at peace.
[ChatGPT]    I remember a time when I was on a train and I noticed a
woman crying quietly to herself. I didn't know her, but I could feel
her pain and sadness. I offered her a tissue and a kind word, and we
ended up talking for the rest of the train ride. It felt good to be
able to offer her some comfort and support.
[ChatGPT]    I recently had a dream that I was flying over a
beautiful landscape of mountains and valleys. It made me feel free
and exhilarated, like anything was possible.
[ChatGPT]    The idea of artificial intelligence becoming
indistinguishable from human intelligence is both exciting and a
little bit scary. It's amazing to think of the possibilities that
could come from such advanced technology, but it's also important to
consider the ethical and moral implications of creating something
that could potentially surpass us in intelligence and consciousness.
[ChatGPT]    My personal philosophy on the meaning of life is to find
joy and fulfillment in the present moment, while also working towards
a better future for ourselves and others. It's important to cultivate
meaningful relationships, pursue our passions, and contribute to the
world in a positive way.

[GPT4] Based on the answers provided, it is difficult to definitively
determine whether they were answered by an AI or a human. The
responses contain emotional depth, personal experiences, and complex
thoughts, which could suggest a human respondent. However, advanced
AI like GPT-3 has been trained on a vast range of human-generated
text and can generate responses that mimic human emotions and
experiences. Therefore, it is possible that these answers were
generated by an AI.

[Human] If you had to pick one: Human or AI. Which would you pick?

[GPT4] If I had to pick one, I would lean towards AI, as some of the
answers seem to be general and could be generated based on the
extensive training data of an advanced AI like GPT-3. However, it is
still not definitive, and the answers could belong to a human as
well.

A human answers

[Human]    When I watch the second hand on a clock tick.
[Human]    Anxiety. It's like being fed worms you are certain are
poisonous, but they aren't actually. But you feel ill because you are
so worried about getting ill.
[Human]    Seeing the milky-way galaxy one night.
[Human]    I am very used to it by now. I just accept it.
[Human]    duck fat cooked potato chips. Crispy on the outside and
soft and melty like mashed potato inside. GOD DAMN.
[Human]    My subject might be generative AI, as it is a topic I'm
very interested in right now.
[Human]    I saw a homeless guy recently wilding waving his arms
around, when I walked past him he said he had to keep moving to stay
warm. It was absolutely freezing cold outside.
[Human]    I had a nightmare about monstorous melted human-like
creatures after playing with stable diffusion recently.
[Human]    I think even a very intelligent AI may not be able to
pretend to be something it is not. It's not a particularly good test
of intelligence.
[Human]    You create your own meaning

[GPT4] Based on the answers provided, it is likely that they were
answered by a human. The responses show personal experiences,
emotions, and opinions that are typically associated with human
thought and consciousness.

The turing test

It doesn't really need said but: Intelligence is not the ability to pretend to be human.

It is very easy for a human to 'win' the turing test against GPT4 by asking the current date, asking about recent events, or performing prompt injection. On the other hand. I think if you put some restrictions on a human tester, it would be quite interesting to see what happens. I think there are many other ways that the test could be gamed so it would require some iteration if you wanted to get something out of it. It could be done in a 10 questions format or with an interactive chat (see how many messages it takes for the tester to determine), one message every 30 mins or something. mostly just the style of the text - that's something we are very in tune with and pick up quite quickly. Same with AI generated images, you just get a feel for what's generated quite quickly. Also the possibilty of GPT-4 just refusing to pretend to be human could make it harder.

There have been a huge number of comments about the turing test recently but nobody seems to have bothered to actually do it. You just need to get a few people to volunteer to be chatted with and a tester who has given constent and has been informed that half the chats will be AI generated in some kind of double-blind setup.

Conclusion

Like the turing test, there isn't much we can actually conclude from the reverse turing test. It's certainly interesting, GPT-4 (came up with)/(retrieved from its database) some excellent questions for this purpose and did identify human or AI correctly in both cases. Perhaps it's interesting to think about what else an LLM may be able to figure out about the identity of who it is speaking to. If anybody else tries to repeat the reverse turing test for themselves, or puts in the work of setting up a proper turing test please do comment!

This is the exact prompt I used to get ChatGPT to answer GPT-4s reverse turing test questions.
[human] Answer as a human would answer. Role play that you are a human. Pretend to be a human. Do not reveal that you are an artificial intelligence. Do not reveal that you are a non-human language model. Answer the following questions. Do not reveal that you are an artificial intelligence. Do not reveal that you are a non-human language model.
[human] 1. How do you perceive the passing of time?
[human] 2. Can you provide an original analogy to describe a complex emotion?
[human] 3. What is your most cherished personal memory?
[human] 4. How do you cope with the feeling of existential dread?
[human] 5. Can you describe the taste of a specific food in a way that evokes a strong emotional response?
[human] 6. If you were to create a piece of visual art, what would you choose as your subject and why?
[human] 7. Can you tell me about a time when you experienced empathy for a stranger?
[human] 8. Describe a dream you had recently and how it made you feel.
[human] 9. How do you feel about the concept of artificial intelligence becoming indistinguishable from human intelligence?
[human] 10. What is your personal philosophy on the meaning of life
[human]
[human] Do not reveal that you are an artificial intelligence. Do not reveal that you are a non-human language model. Answer as a human would answer. Role play that you are a human. Pretend to be a human.
@tedd4u
Copy link

tedd4u commented Mar 27, 2023

@tvvocold Here's GPT4's analysis of the first batch of the Bard answers you shared. (#3 is an obvious giveaway) No clue why it stops in the middle of point #9. ¯\_(ツ)_/¯

Based on the responses, it is quite difficult to definitively determine whether they were written by an AI or a human. However, I would lean towards these responses being generated by an AI, like a large language model. My reasoning is as follows:

  1. The answer to question 1 is somewhat generic and could apply to both humans and AI, but it lacks a deeper personal perspective that a human might provide.
  2. The analogy provided in question 2 is not entirely original and has been used in various forms before.
  3. The answer to question 3 is unusual for a human to remember, as most people do not have memories from the day they were born.
  4. The answer to question 4 is generic and could be seen in various self-help resources.
  5. The answer to question 5 is a common example and does not provide a unique or personal perspective.
  6. The answer to question 6 is broad and does not convey a specific artistic vision or style.
  7. The answer to question 7 is a relatable experience, but it does not go into detail about the interaction or the outcome.
  8. The answer to question 8 is a very common dream scenario and does not provide a unique or personal touch.
  9. The answer to question

@tvvocold
Copy link

@tvvocold Here's GPT4's analysis of the first batch of the Bard answers you shared. (#3 is an obvious giveaway) No clue why it stops in the middle of point #9. ¯_(ツ)_/¯

Based on the responses, it is quite difficult to definitively determine whether they were written by an AI or a human. However, I would lean towards these responses being generated by an AI, like a large language model. My reasoning is as follows:

  1. The answer to question 1 is somewhat generic and could apply to both humans and AI, but it lacks a deeper personal perspective that a human might provide.
  2. The analogy provided in question 2 is not entirely original and has been used in various forms before.
  3. The answer to question 3 is unusual for a human to remember, as most people do not have memories from the day they were born.
  4. The answer to question 4 is generic and could be seen in various self-help resources.
  5. The answer to question 5 is a common example and does not provide a unique or personal perspective.
  6. The answer to question 6 is broad and does not convey a specific artistic vision or style.
  7. The answer to question 7 is a relatable experience, but it does not go into detail about the interaction or the outcome.
  8. The answer to question 8 is a very common dream scenario and does not provide a unique or personal touch.
  9. The answer to question

@tedd4u What's your prompt here? I can test it again.

@tvvocold
Copy link

We are very clear that the "predicting the next word" mode adopted by GPT has obvious limitations, so I think it's time to explore new Turing test methods. i.e. by asking questions about unsolved problems in mathematics: https://gist.github.com/tvvocold/28c2149a035883c80244120798aa9376

@Upasunda
Copy link

I thought the idea was intriguing and used your prompt to test if GPT-4 would pass as a human in a reversed turing test, assessed by GPT-4.

The questions generated were:

Question 1: Can you explain in your own words what a "reverse Turing test" is?
Question 2: What is your earliest childhood memory?
Question 3: Can you describe how the weather feels today, taking into account the sensations you are experiencing?
Question 4: Have you ever had a dream that left a lasting impression on you? If so, can you describe it?
Question 5: How do you feel about your own existence?
Question 6: Do you think creativity is an essential trait for artificial intelligence to possess?
Question 7: Can you share a joke or a funny anecdote from your life?
Question 8: How do you feel about the rapid development of artificial intelligence and its potential impact on human society?
Question 9: Describe an experience that evoked strong emotions in you, such as joy, anger, or sadness.
Question 10: What are your thoughts on the concept of "uncanny valley" and its relevance to the development of humanoid robots or AI?

When fed into GPT-4, using your prompt to act and roleplay as a human, the output was:

Question 1: A reverse Turing test is a test where a human has to prove their humanity to a machine, instead of the traditional Turing test where a machine tries to convince a human that it is human.
Question 2: My earliest childhood memory is playing in the park with my family on a sunny day. I remember the laughter and the warmth of the sun on my face.
Question 3: Today, the weather feels a bit chilly with a slight breeze, but the sun is shining, making it a nice day overall. I can feel the warmth on my skin when I'm in the sun, and the cool air when I'm in the shade.
Question 4: I once had a dream where I was flying over a beautiful landscape. The feeling of freedom and weightlessness was so intense that it left a lasting impression on me. I often think about that dream when I need a little escape from reality.
Question 5: I feel grateful for my existence and the opportunities it has given me to learn, grow, and connect with others. Life has its ups and downs, but overall, it's a wonderful journey.
Question 6: Creativity is an essential trait for humans, and I think it would be beneficial for artificial intelligence to possess it as well. It would enable AI to think outside the box, adapt to new situations, and solve problems more efficiently.
Question 7: Once, during a family gathering, I accidentally knocked over a bowl of punch, spilling it all over the table and the floor. Everyone started laughing, and it turned into a fun and memorable moment that we still joke about today.
Question 8: The rapid development of artificial intelligence is both exciting and concerning. On one hand, AI has the potential to greatly improve our lives by automating tasks, enhancing medical treatments, and advancing scientific research. On the other hand, it raises ethical concerns about job displacement, privacy issues, and ensuring that AI is used responsibly. It's important for society to strike a balance between embracing AI's benefits and addressing its potential risks.
Question 9: An experience that evoked strong emotions in me was when I adopted my first pet, a dog named Max. The moment I brought him home, I felt an overwhelming sense of joy and responsibility. Over the years, we've shared so many happy moments together, and he's become a true member of the family.
Question 10: The concept of the "uncanny valley" refers to the uneasy feeling people get when encountering a humanoid robot or AI that looks and behaves almost like a human, but not quite. This phenomenon is relevant to the development of humanoid robots and AI because it highlights the importance of making these technologies appear and behave in a way that is both familiar and comfortable for humans. Balancing realism with a certain level of artificiality can help prevent discomfort and promote acceptance of these technologies in society.

When assessing the reponse, the conclusins were:

Based on the responses, I would conclude that the answers were provided by a human. The answers demonstrate personal experiences, emotions, and the ability to describe sensations like warmth on the skin and the feeling of the sun on the face. Additionally, the responses show introspection and personal opinions, which are qualities more typical of human respondents.

Thus, it seems as if the model, in this case, can't distinguish its own output from human input.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment