In a recent interview, Nenad Tomašev, Senior Staff Research Scientist at Google DeepMind, described the sorts of traps that malicious actors are setting in order to take control of systems, take money, and jailbreak models without any of it being visible to the average user. Tomašev said this is already happening.
Agentic AI Agents At Scale Tips Them Toward Failure
Host Hannah Fry asked about traps that malicious actors are setting for AI agents and Tomašev responded that it’s true, people are setting traps for AI agents in order to take advantage of them for criminal purposes. He remarked that complete reliability of every interaction is necessary but that the scale of what’s happening tips it statistically toward failure.
Fry asked:
“Just looking at the other side of this, I also wanna think about the sort of cybersecurity element of this, because as more and more agents are out there interacting in the world on the internet and so on, there are inevitably gonna be people who are trying to exploit the vulnerabilities of agents.
Tell me a little bit about agentic traps that people are laying.”
Nenad Tomašev answered that the topic is both scary and fascinating:
“This is a scary and a fascinating topic at the same time, I would say. And I think it’s one of the main reasons why these kinds of deployments at scale cannot work, right?
Because as we said, if there is not complete reliability of individual interactions, any system at scale that has many interactions is naturally going to statistically fail.
And because these systems take a lot of compute and therefore energy and money to run, if they’re not reliable, it’s just a non-starter.
And agentic traps are something that we have been thinking about for quite a while now. They can manifest in different ways.
There are many types of traps, but it boils down to agents operate within an environment. And in this context, the environment is the web.
If the environment itself is poisoned, if the traps are laid, agents may stumble upon them when interacting with the web.
And then yes, malicious people or malicious agents deployed by malicious people can place those traps and then compromise systems really.”
Kinds Of Agentic Traps To Beware Of
Host Hannah Fry then asked Tomašev how these traps are set and Tomašev provided examples, remarking that the traps aren’t going to be visible on a website but are nonetheless available to AI agents. Some of what he described will sound familiar to old-school SEOs who engaged in things like cloaking in the early days of search engines.
Tomašev said that hidden tokens could be hidden for AI agents to consume. Tokens in this context is a reference to how AI breaks words into representations of words. When an AI reads words on a page what it does is to break it down into tokens. Hidden tokens could be completely invisible to humans.
He mentioned three ways that traps could be set for AI agents:
- Hidden tokens
- Dynamic cloaking
- Content…
Source link
Disclaimer
We strive to uphold the highest ethical standards in all of our reporting and coverage. We blogs.grocliq.com want to be transparent with our readers about any potential conflicts of interest that may arise in our work. It’s possible that some of the investors we feature may have connections to other businesses, including competitors or companies we write about. However, we want to assure our readers that this will not have any impact on the integrity or impartiality of our reporting. We are committed to delivering accurate, unbiased news and information to our audience, and we will continue to uphold our ethics and principles in all of our work. Thank you for your trust and support.
Website Upgradation is going on for any glitch kindly connect at [email protected]