
Share
The Nova Act SDK empowers developers to create sophisticated browser agents capable of handling complex tasks, marking a significant leap in AI's ability to interact with web environments seamlessly.
Amazon AGI Labs has introduced a new AI model, Nova Act, designed to perform actions within web browsers. This release includes a research preview of the Nova Act SDK, available at nova.amazon.com, allowing developers to experiment with an early version of the model. The SDK enables developers to build agents capable of completing tasks in web browsers, such as submitting out-of-office requests or setting up calendar holds.
Traditionally, "agents" have been systems that respond to users in natural language or draw on knowledge bases using Retrieval-Augmented Generation (RAG). However, Nova Act shifts the focus to agents that can execute tasks in digital environments, particularly within web browsers. This change is significant because it addresses a critical gap in current agent capabilities: the ability to handle multi-step, complex workflows without constant human supervision.
Atomic Commands: The SDK provides reliable atomic commands for common actions like searching, checking out, and answering questions about the screen.
API Integration: Nova Act supports calling APIs, allowing agents to interact with external systems seamlessly.
Direct Browser Manipulation: The SDK leverages Playwright for direct browser manipulation, enhancing reliability and accuracy.
Python Code Interleaving: Developers can insert Python code to enhance the agent's functionality.

Nova Act is particularly useful for tasks that require a series of steps, such as organizing events or handling complex IT tasks. While some use cases are well-suited for today’s technology, multi-step agents prompted with high-level goals still often require human intervention.
For developers and businesses, the ability to automate complex, multi-step tasks in web environments can significantly boost productivity. By reducing the need for human oversight, Nova Act can help streamline processes and free up valuable time. The SDK's focus on reliability and composability means that developers can build robust agents that handle a wide range of tasks with minimal errors.
To start experimenting with Nova Act, developers can visit nova.amazon.com to access the SDK. The research preview provides documentation, examples, and community support to help you get up and running quickly.
Tags
Original Sources
↗ https://labs.amazon.science/blog/nova-act?utm_source=tldrai
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
1 April 2025
88 articles
Related Articles

OpenEvidence Targets Hospitals to Expand Its AI Chatbot for Doctors
Products & Applications · 3 min

OpenEvidence Launches Voice AI to Enhance Physician Workflow
Products & Applications · 3 min

Doximity Accelerates AI Investment in 2026, Targeting Multibillion-Dollar Market
Products & Applications · 3 min
Related Articles

OpenEvidence Targets Hospitals to Expand Its AI Chatbot for Doctors
Products & Applications · 3 min

OpenEvidence Launches Voice AI to Enhance Physician Workflow
Products & Applications · 3 min

Doximity Accelerates AI Investment in 2026, Targeting Multibillion-Dollar Market
Products & Applications · 3 min
More Stories