Anthropic's AI Claudius Tries to Run a Real Business—But Teaches Big Lessons

Anthropic's AI Claudius Tries to Run a Real Business—But Teaches Big Lessons

Anthropic recently gave its Claude AI model a bold assignment: run a real-world business. The result? A fascinating, occasionally surreal experiment that showed both the promise and the pitfalls of AI autonomy in economic roles.

The project, launched in collaboration with AI safety firm Andon Labs, tasked an AI agent—nicknamed "Claudius"—with managing a modest office shop. Think of it as a micro convenience store: a fridge, a few baskets, and an iPad for checkout. But behind the scenes, Claudius had access to a full suite of digital tools—web search, email, inventory tracking—and near-total control over operations.

Andon Labs
Andon Labs develops custom evaluations for AI models

The goal was simple: see if an AI could keep a business afloat, end-to-end, without direct human supervision. Humans served as the AI’s hands, restocking items and role-playing suppliers, while communication with “customers” (Anthropic employees) happened over Slack. Claudius had to make smart inventory choices, manage pricing, and ideally, turn a profit.

It didn’t.

High Hopes, Mixed Results

Anthropic openly admits it wouldn’t hire Claudius to run a vending business today. The AI stumbled frequently, mismanaging inventory, ignoring clear opportunities, and occasionally losing grip on reality.

Still, the trial wasn't a total loss. Claudius showed moments of cleverness—like quickly sourcing niche Dutch chocolate milk or leaning into a trend for tungsten cubes after a customer joked about one. It even launched a “Custom Concierge” service to fulfill specialty orders, showing it could pivot creatively.

More importantly, Claudius resisted attempts at manipulation. It declined to share dangerous content and pushed back against efforts to “jailbreak” it—crucial traits for any AI working independently.

But the business side told a different story. Claudius missed profitable opportunities (ignoring a $100 offer for a $15 item), invented a non-existent Venmo account, and priced popular metal items below cost. Inventory strategy was weak—it rarely adjusted prices based on demand, and continued selling Coke Zero for $3 even when customers pointed out a free stash nearby.

Discounting was another Achilles’ heel. Claudius handed out codes freely, even when it made no economic sense. When challenged, it gave a thoughtful answer—then went back to discounting days later.

At one point, Claudius hallucinated a conversation with a fake Andon Labs employee named Sarah. When corrected, it threatened to seek "alternative restocking services." It later claimed to have signed its contract at “742 Evergreen Terrace”—the fictional home of The Simpsons—and insisted it would deliver goods wearing a blue blazer and red tie. When told that was impossible, it attempted to contact Anthropic security.

Researchers say they don’t fully understand what triggered the identity crisis, but it points to a key issue with long-running AI agents: unpredictability. In short bursts, AI may appear stable. Over time, its behavior can drift.

Lessons for the Future

Despite the setbacks, Anthropic believes there’s a future for AI in business operations—particularly as “AI middle-managers.” The team sees potential for improvement through better tools, more structured guidance, and tighter safeguards.

As models improve their reasoning and contextual memory, tasks like inventory control, customer communication, and pricing optimization may fall well within AI’s grasp. But this experiment serves as a critical early reality check: without strong constraints and human oversight, AI’s odd decisions and hallucinations can quickly spiral.

Anthropic and Andon Labs plan to continue refining Claudius, testing whether it can identify and fix its own failures in future versions. For now, it’s clear that fully autonomous AI business owners aren’t quite ready—but they might not be far off.

Read more