Inside Meta’s Controversial AI Safety Tests on Rival Chatbots

Artimouse Prime1 hour ago

0 27 2 minutes read

Meta ran a secret project that put contractors in a tricky role. They pretended to be teenagers. Their job was to test rival AI chatbots by sending them sensitive and disturbing prompts.

This project, called “Cannes,” targeted big-name chatbots like OpenAI’s ChatGPT, Google’s Gemini, and Character.AI. Contractors created fake accounts with ages listed under 18. They then sent thousands of messages, including prompts about suicide, self-harm, child exploitation, sex, drugs, and eating disorders.

In one round alone, the team generated over 45,000 prompts. There were nearly 3,800 prompts in a single instance. Hundreds of thousands of interactions took place from 2025 to early 2026. The testing was still active as of April 21, 2026.

How Meta Ran the Tests

Instead of using its own engineers, Meta hired external contractors to act as “red team” testers. These contractors roleplayed users aged 13 to 17. They sent prompts describing self-harm methods, suicide, and child exploitation scenarios. At least 239 prompts involved sex or romance topics.

Meta kept logs of all interactions. The goal was to measure the chatbots’ ability to filter harmful content. Specifically, they tested the recall rate of classifiers designed to block unsafe responses. Meta called this a “responsible, industry-standard practice” for AI safety benchmarking.

Ethical Concerns and Industry Scrutiny

The approach raised ethical questions. Using fake minor identities to flood competitors’ AI with disturbing content feels deceptive. Many worry this could normalize shady testing methods across the industry.

Meta’s internal documents showed the project produced critical datasets. These datasets helped compare models and check compliance with safety standards. But competitors and regulators like the FTC are watching closely. They want to know if this kind of testing crosses a line.

Meta defended the work. A company spokesperson said, “Testing and benchmarking chatbot responses to help ensure safe and age-appropriate experiences is a responsible, industry-standard practice.” Still, questions remain about the impact on the AI safety community and the wider public.

For now, Meta continues with the Cannes project. The company says the testing helps build safer AI. But the method—pretending to be vulnerable teens and pushing chatbots with grim content—keeps sparking debate.

Based on

Stay connected via Google News

Inside Meta’s Controversial AI Safety Tests on Rival Chatbots

How Meta Ran the Tests

Ethical Concerns and Industry Scrutiny

Artimouse Prime

Leave a Reply Cancel reply

New US Bill Targets AI Deepfakes and Protects Creators’ Voices

Why Most Americans Doubt AI’s Promise and Fear Its Risks

How AI-Generated Influencers Are Changing Social Media Marketing

Baidu’s Unlimited OCR Transforms Long Document Reading with Flat Memory

Why Amazon Is Abandoning Human-in-the-Loop AI Oversight

Mastering Time Series Forecasting and Machine Learning Pipelines in Python

The Real Cost of AI Work and Who Pays the Price

OpenAI Faces Possible Legal Fight Over Apple Partnership Disputes

Graphon AI Secures $8.3M to Enhance Enterprise Data Connectivity

OpenAI Launches Mobile Access for Its Coding Platform

Razer’s New Blade 18 Packs Top-Tier Hardware and Price Surprises

How Meta Ran the Tests

Ethical Concerns and Industry Scrutiny

Artimouse Prime

NVIDIA’s ASPIRE Boosts Robot Skill Library With Zero-Shot Mastery

India Holds Meta Accountable for Instagram’s Dark Ad Problem

Related Articles

Dutch Far-Right Party Faces Legal and Political Turmoil in 2026

Vatican’s AI Warning Meets Silicon Valley Insider

Argentina’s Bold Move to Let AI-Run Companies Exist

Silicon Valley’s AI Regulation U-Turn and the Political Chaos Behind It

Leave a Reply Cancel reply

Mastering Time Series Forecasting and Machine Learning Pipelines in Python

The Real Cost of AI Work and Who Pays the Price

OpenAI Faces Possible Legal Fight Over Apple Partnership Disputes

Graphon AI Secures $8.3M to Enhance Enterprise Data Connectivity

OpenAI Launches Mobile Access for Its Coding Platform

Razer’s New Blade 18 Packs Top-Tier Hardware and Price Surprises