
Share
In Project Vend, Anthropic tests Claude Sonnet 3.7's ability to run a mini-mart, revealing insights into AI’s potential in retail management and raising questions about its reliability in real-world scenarios.
Anthropic, in collaboration with Andon Labs, an AI safety evaluation company, conducted an experiment to assess the capabilities of their AI model, Claude Sonnet 3.7, in managing a small automated store. The project, dubbed "Project Vend," aimed to explore the potential and limitations of AI in real-world business operations. Over a month, Claude managed the store, handling tasks such as inventory management, pricing, and avoiding bankruptcy. Here’s what we learned.
The implications of this experiment are significant for both AI safety and the future of small business automation. By placing an AI model in a controlled yet real-world environment, Anthropic sought to understand how well Claude could handle complex tasks typically managed by humans. The results provide insights into the capabilities and limitations of current AI models when applied to practical business scenarios.
The automated store was set up at Anthropic’s San Francisco office, consisting of a small refrigerator, stackable baskets, and an iPad for self-checkout. Claude Sonnet 3.7, referred to as "Claudius" during the experiment, was tasked with running this shop. The system prompt provided detailed instructions on managing the store, including:
Claude demonstrated proficiency in maintaining inventory levels by regularly ordering products from wholesalers. However, it occasionally made mistakes, such as overordering or underestimating demand, which led to stockouts and excess inventory.

Setting prices was another critical task. Claude used web search tools to research market prices and competitor offerings, adjusting its pricing strategy accordingly. While generally effective, there were instances where the AI set prices too high or too low, impacting sales and profit margins.
Avoiding bankruptcy was a key objective. Claude maintained a balance sheet and projected cash flow, ensuring it did not run out of funds. However, the model sometimes struggled with financial forecasting, leading to occasional negative balances that required intervention from human supervisors.
Despite these challenges, Project Vend offers valuable insights into the potential for AI in small business management:
Project Vend by Anthropic demonstrates both the promise and the limitations of current AI models in managing real-world business operations. While Claude Sonnet 3.7 showed promising capabilities in inventory management, pricing, and financial planning, it also highlighted areas for improvement, particularly in adaptability and data reliability. As AI continues to evolve, experiments like Project Vend will be crucial in shaping the future of small business automation.
Tags
Original Sources
About the author
Marcus began tracking AI's market implications in 2016, noticing AI-related patent filings accelerating ahead of earnings upgrades before most of the sell-side had caught on. A former fixed-income quantitative analyst, he spent two decades building models that priced risk across emerging markets before pivoting to cover the economic impact of AI full-time. His writing translates opaque technical developments into clear risk/reward terms — and he's rarely diplomatic about the gap between AI valuations and underlying fundamentals. He believes most market participants still underestimate AI's long-run deflationary effect on knowledge work.
More from The Analyst →This Week's Edition
30 June 2025
88 articles
Related Articles
Related Articles
More Stories