← Home

OpenAGI's "Lux" AI Agent Claims to Outperform OpenAI and Anthropic

Published: December 03, 2025 | Source articles

The Essentials: OpenAGI's Ambitious Arrival

OpenAGI, founded by MIT researchers, has officially launched with the unveiling of Lux, an AI agent designed to automate computer tasks. According to a press release by OpenAGI, Lux aims to make human-like agents more accessible, opening the door for broader applications of artificial general intelligence (AGI). CEO Qin Zengyi stated that Lux is a foundational model capable of automatically performing operations on desktop applications by analyzing computer screenshots. As an example of its capabilities, Lux achieved an 83.6% success rate on the Online-Mind2Web benchmark test, outperforming OpenAI's Operator (61.3%) and Anthropic's Claude Computer Use (56.3%). OpenAGI is also open-sourcing OSGym, the data engine and infrastructure used to train these adaptable computer-use agents.

Beyond the Headlines: Diving into Lux's Capabilities

Nerd Alert ⚡

Lux distinguishes itself through its architecture and training methodology. Unlike competitors limited to browser-based tasks, Lux can fully control desktop applications like Excel and Slack. Imagine a flock of digital seagulls trained to recognize spreadsheets - that's Lux, diving in, pecking at the data, and flying away with the insights. This is achieved through "Agent-based pre-training," a process where the AI learns from screenshots and action sequences in a self-reinforcing cycle.

Lux operates in three distinct modes: "Actor" for quick tasks, "Thinker" for multi-step goals, and "Tasker" for user-controlled task breakdown. OpenAGI also claims that Lux operates at approximately one-tenth the cost of models from OpenAI and Anthropic, while also being faster, completing tasks in about one second per step compared to three seconds for OpenAI's Operator. Furthermore, OpenAGI provides a software development kit (SDK) for developers to build applications based on Lux. Could this open-source approach democratize AI development and foster innovation?

How Is This Different (Or Not)?: A Competitive Glance

While OpenAI and Anthropic have primarily focused on browser-based automation, Lux's ability to control full desktop applications marks a significant departure. This broader scope opens up a wider range of potential applications, from software QA to data entry and social media management. It’s worth noting that OpenAI and Anthropic recently published results of their first joint AI safety evaluation, testing each other’s models using internal safety protocols, according to *AIMagazine*. Whether OpenAGI's Lux incorporates similar rigorous safety measures remains to be seen.

Lesson Learnt / What It Means For Us

OpenAGI's Lux presents a compelling vision for the future of AI agents, promising greater automation and cost-effectiveness. The company's open-source approach could accelerate innovation and make AI more accessible. However, as with any emerging technology, careful consideration of safety and ethical implications is crucial. Will Lux truly revolutionize how we interact with computers, or will it become just another footnote in the ever-evolving AI narrative?

References

[6]
📝Introduction | OpenAGI
openagi.aiplanet.com
[10]
aimagazine.com
aimagazine.com