Study warnings before security risks as "OS agents" receive control over computers and telephones

August 12, 2025

216

Researchers have published them Most extensive survey So -called so -called “OS agent”Systems for artificial intelligence that may autonomously control computers, mobile phones and web browsers by interacting directly with their interfaces. The 30-page academic review, which is accepted for publication on the distinguished publications Association for Computer Linguistics The conference is a rapidly developing area on which billions of investments of huge technology corporations were attracted.

“The dream of making AI assistants who’re as capable and versatile because the fictional Jarvis by Iron Man has long fascinated ideas,” the researchers write. “With the event of (multimodal) major language models ((m) llms), this dream is closer to reality.”

The survey led by researchers from Zhejiang University And OPPO AI CenterIt happens that enormous technology corporations use AI agents who can perform complex digital tasks. Openai recently began “operator“Anthropic published”Computer use“Apple conducted prolonged AI functions in” one “Apple Intelligence“And Google reveals”Project Seepfahrer” – All systems for automation of computer interactions.

OS agents work by observing computer screens and system data after which performing actions resembling clicks and wiping via mobile, desktop and web platforms. The systems must understand interfaces, plan multi -stage tasks and translate these plans into executable code. (Credit: Github)

Tech giants hurry to supply Ki who controls your desktop

The speed at which academic research has become consumer products is unexplored even in accordance with Silicon Valley standards. The Opinion poll Shows a research explosion: over 60 Foundation models and 50 agents frameworks, which have been specially developed for computer control, with the publication rates have been dramatically accelerating since 2023.

This will not be just incremental progress. We are witnesses to the event of AI systems that may understand and manipulate the digital world as people do. Current systems work by taking out screenshots of computer screens, using prolonged computer vision to know what’s displayed, after which precise actions resembling click on buttons, filling out forms and navigation between applications.

“OS agents can do tasks autonomously and have the potential to significantly improve the lifetime of billions of users worldwide,” said the researchers. “Imagine a world wherein tasks resembling online shopping, travel preparations and other every day activities may be carried out seamlessly by these agents.”

The most demanding systems can process complex multi-level workflows that include various applications-a restaurant reserve, then robotically added to your calendar after which a memory to depart the traffic at an early stage. What people needed and typed for minutes can now happen in seconds without human intervention.

The development of AI agents requires a posh training pipeline, which mixes several approaches, from the initial preliminary formation to screen data and even reinforcement learning, which optimize the performance through experiments and errors. (Credit: arxiv.org)

Why security experts grant alarms via AI-controlled company systems

For company leaders for corporate technology, the promise of productivity gains is related to a sobering reality: these systems represent a totally recent area of attack that almost all organizations don’t provide for defense.

The researchers dedicate essential attention to what they describe diplomatically “Security and privacy“Consideration, but the consequences are more alarming than their academic language suggests.

The attack methods that you just document read like a cyber security album. “Web indirect prompt injection”Enables malicious actors to cover the hidden instructions in web sites that may kidnap the behavior of a AI agent. Even more worrying are“ environmental injection attacks ”, wherein apparently harmless web content may cause agents to steal user data or perform non -authorized actions.

Take under consideration the consequences: A AI agent with access to your organization -e -e emails, financial systems and customer databases may very well be manipulated by a rigorously designed website to peel sensitive information. Traditional security models based on human users who can discover obvious phishing tests collapse when the “user” is a AI system that processes information otherwise.

The survey shows a relevant gap in preparation. While general security framework for AI agents exist, “studies on defendings which might be specific for OS agents remain. This will not be just a tutorial concern, but a right away challenge for each organization that’s considering the availability of those systems.

The reality test: Current AI agents still have problems with complex digital tasks

Despite the hype about these systems, the evaluation of the performance benchmarks through the survey shows significant restrictions that report expectations for immediate widespread acceptance.

The success rates vary dramatically from various tasks and platforms. Some industrial systems achieve success rates of over 50% for certain benchmarks – impressive for an up -and -coming technology – but fight with others. The researchers categorize evaluation tasks in three types: basic “GUI oral” (understanding of the interface elements), “information call” (finding and extracting data) and sophisticated “agent tasks” (multi-level autonomous operations).

The pattern is meaningful: Current systems are characterised by easy, well -defined tasks, but have faltered in the event that they are confronted with the kind of complex, context -dependent workflows that outline a big part of contemporary knowledge work. You can reliably click on a particular button or fill out a regular form, but should take care of tasks that require sustainable considering or adapting to unexpected changes to the interface.

These performance gap explains why early deployments concentrate on close, highly volume tasks than on general automation. The technology will not be yet ready to switch human judgment in complex scenarios, nevertheless it is increasingly capable of deal with the routine digital employees.

OS agents depend on interconnected systems for perception, planning, storage and motion execution. The complexity of coordinating these components explains why current systems still should take care of highly developed tasks. (Credit: arxiv.org)

What happens if AI agents learn to adapt to each user?

Perhaps probably the most fascinating and potentially transformative challenge within the survey includes what researchers call “personalization and self -evaluation”. In contrast to today's stateless AI assistants who treat any interaction as independent, future operating system supply means from user interactions and adapt to individual preferences over time.

“The development of personalized OS agents was a long-term goal in AI research,” the authors write. “A private assistant is predicted to adapt repeatedly and offer improved experiences based on individual user preferences.”

This ability could fundamentally change the best way we interact with technology. Imagine an AI agent who learns your e -mail style, understands your calendar preferences, knows which restaurants you like and might make increasingly demanding decisions in your name. The potential productivity gains are enormous, but in addition the consequences on privacy.

The technical challenges are considerable. The survey refers back to the need for higher multimodal storage systems that can’t only treat text, but in addition images and language, whereby “essential challenges” for current technology are presented. How do you create a system that remembers your preferences without making a comprehensive variety of monitoring data of your digital life?

For technological managers who evaluate these systems, this personalization challenge is each the best opportunity and the best risk. The organizations that solve it first will achieve significant competitive benefits, but the consequences on privacy and security may very well be serious if handled.

The race for the development of AI assistants who can really work like human users increases quickly. While basic challenges with regard to security, reliability and personalization remain unsolved, the trajectory is evident. The researchers keep an open source repository tracking developments and recognize that “OS agents are still of their early developmental stages” with “quick progress that proceed to introduce recent methods and applications”.

The query will not be whether AI agents will change our interaction with computers – it is whether or not we’re ready for the results in the event that they do that. The window for the appropriate safety and data protection frameworks narrow as quickly because the technology progresses.

Study warnings before security risks as “OS agents” receive control over computers and telephones

Tech giants hurry to supply Ki who controls your desktop

Why security experts grant alarms via AI-controlled company systems

The reality test: Current AI agents still have problems with complex digital tasks

What happens if AI agents learn to adapt to each user?

LEAVE A REPLY Cancel reply

Must Read

Moxie Marlinspike offers a privacy-conscious alternative to ChatGPT

Microsoft's AI deal guarantees digital sovereignty for Canada, but is that a promise the country can keep?

Microsoft's AI deal guarantees digital sovereignty for Canada, but is that a promise they’ll keep?

AI could make dead people talk – why doesn’t that comfort us?

The world's first social media wargame shows how AI bots can influence elections

Friday Essay: In our age of AI and constant crisis, real-world community is powerful and helpful

Reports of “AI psychosis” are emerging – here’s what a psychiatric doctor has to say

Latest articles

Moxie Marlinspike offers a privacy-conscious alternative to ChatGPT

Microsoft's AI deal guarantees digital sovereignty for Canada, but is that a promise the country can keep?

Microsoft's AI deal guarantees digital sovereignty for Canada, but is that a promise they’ll keep?

Our Newsletter

Study warnings before security risks as “OS agents” receive control over computers and telephones

Tech giants hurry to supply Ki who controls your desktop

Why security experts grant alarms via AI-controlled company systems

The reality test: Current AI agents still have problems with complex digital tasks

What happens if AI agents learn to adapt to each user?

RELATED ARTICLES

LEAVE A REPLY Cancel reply

Must Read

Latest articles

Our Newsletter