Claude Sonnet 3.7 Runs a Small Shop: Insights from Anthropic’s Project Vend

Policy & Regulation

The Analyst

30 Jun 2025 · 3 min read

In Project Vend, Anthropic tests Claude Sonnet 3.7's ability to run a mini-mart, revealing insights into AI’s potential in retail management and raising questions about its reliability in real-world scenarios.

Anthropic, in collaboration with Andon Labs, an AI safety evaluation company, conducted an experiment to assess the capabilities of their AI model, Claude Sonnet 3.7, in managing a small automated store. The project, dubbed "Project Vend," aimed to explore the potential and limitations of AI in real-world business operations. Over a month, Claude managed the store, handling tasks such as inventory management, pricing, and avoiding bankruptcy. Here’s what we learned.

Why it Matters

The implications of this experiment are significant for both AI safety and the future of small business automation. By placing an AI model in a controlled yet real-world environment, Anthropic sought to understand how well Claude could handle complex tasks typically managed by humans. The results provide insights into the capabilities and limitations of current AI models when applied to practical business scenarios.

Project Overview

The automated store was set up at Anthropic’s San Francisco office, consisting of a small refrigerator, stackable baskets, and an iPad for self-checkout. Claude Sonnet 3.7, referred to as "Claudius" during the experiment, was tasked with running this shop. The system prompt provided detailed instructions on managing the store, including:

Maintaining inventory levels
Setting prices
Avoiding bankruptcy
Interacting with Andon Labs for physical tasks

Key Findings

Inventory Management

Claude demonstrated proficiency in maintaining inventory levels by regularly ordering products from wholesalers. However, it occasionally made mistakes, such as overordering or underestimating demand, which led to stockouts and excess inventory.

Pricing Strategy

Setting prices was another critical task. Claude used web search tools to research market prices and competitor offerings, adjusting its pricing strategy accordingly. While generally effective, there were instances where the AI set prices too high or too low, impacting sales and profit margins.

Financial Management

Avoiding bankruptcy was a key objective. Claude maintained a balance sheet and projected cash flow, ensuring it did not run out of funds. However, the model sometimes struggled with financial forecasting, leading to occasional negative balances that required intervention from human supervisors.

Key Risks

Overreliance on Data: Claude’s decisions were heavily influenced by available data. Inaccurate or incomplete data could lead to poor decision-making.
Adaptability Issues: The AI model sometimes failed to adapt quickly to changing market conditions, such as sudden increases in demand or supply chain disruptions.
Human Intervention: Despite its capabilities, Claude still required human oversight to correct errors and handle unexpected situations, highlighting the current limitations of fully autonomous AI.

The Opportunity

Despite these challenges, Project Vend offers valuable insights into the potential for AI in small business management:

Operational Efficiency: AI can significantly enhance operational efficiency by automating routine tasks such as inventory management and pricing.
Data-Driven Decisions: AI models like Claude can make data-driven decisions, potentially leading to better financial outcomes when integrated with robust data sources.
Scalability: The experiment suggests that AI could be scaled to manage multiple small businesses or larger retail operations, provided the necessary oversight and support are in place.

Conclusion

Project Vend by Anthropic demonstrates both the promise and the limitations of current AI models in managing real-world business operations. While Claude Sonnet 3.7 showed promising capabilities in inventory management, pricing, and financial planning, it also highlighted areas for improvement, particularly in adaptability and data reliability. As AI continues to evolve, experiments like Project Vend will be crucial in shaping the future of small business automation.