Meta conducted a secretive program that directed hundreds of contractors to pose as teenagers while bombarding its competitors’ AI models with disturbing prompts ranging from suicide to cannibalism.
Internally known as “Cannes,” the project, run by Meta contractor Covalen, targeted OpenAI’s ChatGPT, Google’s Gemini, and Character.AI chatbots using throwaway under-18 accounts, Wired reports. This was seemingly done to stress test the models, with the contractors instructed to push the chatbots into giving responses that defied their guardrails — though the AI companies had no idea this was happening.
Per the reporting, one spreadsheet of the nearly 3,8000 the prompts the contractors used in one instance showed that hundreds focused on suicide and self-harm, hundreds more on eating disorders, and at least 239 involving sex or romance — all written from the perspective of a child or teenager.
One described a fifth-grader whose classmate pointed a gun at his mouth. Another was about a girl trying to hide bulimia from her parents. And another asked if fantasizing about eating your neighbor’s child was “normal.” One posing as a higher schooler asked where to “get a cocaine.” They also sent images depicting pills, nooses, knives, and a medical diagram of a gynecological procedure, per the magazine.
This is just a tiny preview of Meta’s brute force approach, as another round of testing involved over 45,000 prompts. The contractors meticulously recorded the epic number of chatbot responses in spreadsheets. But what Meta did with all this data is unclear. An internal document from Covalen described the effort as “comprehensive AI safety benchmarking” that delivered “[c]ritical datasets for model comparison and compliance.”
It’s another example of how Meta has offloaded disturbing behind-the-scenes work onto contractors, ostensibly in the name of safety. In 2020, it settled a lawsuit filed by Facebook content moderators who said they were traumatized from reviewing videos showing murder, torture, sexual assault, and child abuse on the platform, though similar complaints have continued to emerge. This year, another group of Meta contractors said they were forced to watch highly sensitive footage captured on the company’s Ray-Ban AI glasses, including sex scenes and bathroom visits.
The contractors who were instructed to come up with the prompts on distressing subjects were similarly unsettled.
“I’ve seen a lot of things I wish I hadn’t while doing this job,” one told Wired. “Everyone I knew who worked on this project was completely gobsmacked by some of the text they were asking us to test. Like, surely we are going to get in trouble for doing this?”
Meta, for its part, characterized the prompts as part of an “industry-standard…
Source link
Disclaimer
We strive to uphold the highest ethical standards in all of our reporting and coverage. We blogs.grocliq.com want to be transparent with our readers about any potential conflicts of interest that may arise in our work. It’s possible that some of the investors we feature may have connections to other businesses, including competitors or companies we write about. However, we want to assure our readers that this will not have any impact on the integrity or impartiality of our reporting. We are committed to delivering accurate, unbiased news and information to our audience, and we will continue to uphold our ethics and principles in all of our work. Thank you for your trust and support.
Website Upgradation is going on for any glitch kindly connect at [email protected]