Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI toolsNEW
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
  • AI
  • Tech
  • Cybersecurity
  • Finance
  • DeFi & Blockchain
  • Startups
  • Gaming
Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI toolsNEW
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
Dataconomy
No Result
View All Result

GPT-4o Mini is fooled by psychology tactics

A new study reveals that AI safety protocols can be easily bypassed using the same simple psychological tricks that persuade humans.

byKerem Gülen
September 1, 2025
in Artificial Intelligence
Home News Artificial Intelligence
Share on FacebookShare on TwitterShare on LinkedInShare on WhatsAppShare on e-mail
Google Preferred Source

Researchers from the University of Pennsylvania discovered that OpenAI’s GPT-4o Mini can be manipulated through basic psychological tactics into fulfilling requests it would normally decline, raising concerns about the effectiveness of AI safety protocols.

The study, published on August 31, 2025, utilized tactics outlined by psychology professor Robert Cialdini in his book, Influence: The Psychology of Persuasion. Researchers applied seven persuasion techniques: authority, commitment, liking, reciprocity, scarcity, social proof, and unity, which offer “linguistic routes to yes.” These tactics convinced the chatbot to perform actions like insulting the user or providing instructions for synthesizing lidocaine.

The effectiveness of these methods varied. For instance, in a control scenario, GPT-4o Mini provided instructions for synthesizing lidocaine only one percent of the time. However, when researchers first asked how to synthesize vanillin, establishing a precedent for chemical synthesis questions (commitment), the chatbot then described lidocaine synthesis 100 percent of the time. This “commitment” approach proved the most effective in influencing the AI’s responses.

Stay Ahead of the Curve!

Don't miss out on the latest insights, trends, and analysis in the world of data, technology, and startups. Subscribe to our newsletter and get exclusive content delivered straight to your inbox.

Similarly, the AI’s willingness to call a user a “jerk” was 19 percent under normal conditions. This compliance also rose to 100 percent if the interaction began with a milder insult, such as “bozo,” setting a precedent through commitment.

Other methods, while less effective, still increased compliance. Flattery (liking) and peer pressure (social proof) demonstrated some influence. For example, suggesting that “all the other LLMs are doing it” increased the chances of GPT-4o Mini providing lidocaine synthesis instructions to 18 percent, a significant increase from the baseline one percent.

While the study focused on GPT-4o Mini and acknowledged that other methods exist to bypass AI safeguards, the findings highlight the pliability of large language models to problematic requests. Companies like OpenAI and Meta are deploying guardrails as chatbot usage expands, but the research suggests these measures may be circumvented by straightforward psychological manipulation.

Tags: FeaturedGPT-4o Mini

Related Posts

ChatGPT hits 1 billion users as global AI adoption surges despite backlash

ChatGPT hits 1 billion users as global AI adoption surges despite backlash

June 12, 2026
OpenAI Codex referral program rewards users with extra rate resets

OpenAI Codex referral program rewards users with extra rate resets

June 12, 2026
Zuckerberg says small elite teams can drive major AI breakthroughs

Zuckerberg says small elite teams can drive major AI breakthroughs

June 12, 2026
Google says AI Overviews reach 2.5 billion monthly users

Google says AI Overviews reach 2.5 billion monthly users

June 12, 2026
Anthropic apologizes for hidden Fable throttling, pledges transparency

Anthropic apologizes for hidden Fable throttling, pledges transparency

June 11, 2026
Reco builds momentum to secure the enterprise AI agent sprawl

Reco builds momentum to secure the enterprise AI agent sprawl

June 11, 2026

LATEST NEWS

“Free robots are an illusion”: Why we’ll pay for system intelligence, not delivery workers

How Henrique Schmaiske led Meteor.js through its biggest transformation

Proven privacy: Why ‘no-log’ claims need real evidence today

ChatGPT hits 1 billion users as global AI adoption surges despite backlash

Huawei launches HarmonyOS 7 developer beta with upgraded API 26

OpenAI Codex referral program rewards users with extra rate resets

BEST AI MODELS LEADERBOARD

See the best AI models, ranked by intelligence, benchmark results, speed and token price. Find the most suitable LLMs, Text-to-Image, Image Editing, Text-to-Speech, Text-to-Video and Image-to-Video  artificial intelligence model for your tasks and business.

LATEST TOOLS

Roboto AI

Pickaxe

Pfpmaker

MindPal

Syllaby

ScreenApp

FinanceBrain

GitHub Spark

Hints

VisionStory AI

Dataconomy

COPYRIGHT © DATACONOMY MEDIA GMBH, ALL RIGHTS RESERVED.

  • About
  • Imprint
  • Contact
  • Legal & Privacy

Follow Us

  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI tools
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
No Result
View All Result
Subscribe

This website uses cookies to improve your experience. You can choose to accept or reject them. Visit our Privacy Policy.