launched by alibaba open sandboxis an open-source device designed to offer AI brokers with a safe and remoted surroundings for code execution, net shopping, and mannequin coaching. was launched below Apache 2.0 licensethe proposed system goals to standardize the “execution layer” of AI agent stacks, offering a unified API that works throughout totally different programming languages and infrastructure suppliers. The device is constructed on the identical inside infrastructure that Alibaba leverages for large-scale AI workloads.
Expertise gaps in agent workflows
Constructing an autonomous agent sometimes requires two parts: a “mind” (often a big language mannequin) and a “device” (code execution, net entry, or file manipulation). Offering a safe surroundings for these instruments required builders to manually configure Docker containers, handle advanced community isolation, and depend on third-party APIs.
OpenSandbox addresses this downside by offering a standardized, safe surroundings wherein brokers can execute arbitrary code and work together with interfaces with out risking the integrity of the host system. Abstracts the underlying infrastructure and permits builders to make use of a single API to maneuver from native improvement to production-scale deployment.
structure
The structure of OpenSandbox is as follows: Constructed on a modular 4-layer stack— together with SDK layer, spec layer, runtime layer, and sandbox occasion layer– Designed to separate shopper logic from the execution surroundings. On the core of this technique, we leverage FastAPI-based servers to handle the sandbox lifecycle via Docker or Kubernetes runtimes, and communication is standardized via the OpenAPI specification (Sandbox Lifecycle and Execution Specification). OpenSandbox inserts a high-performance Go-based execution daemon (execd) inside every remoted container that works with an inside Jupyter kernel to offer stateful code execution, real-time output streaming through Server-Despatched Occasions (SSE), and complete filesystem administration, making certain a “protocol-first” strategy that’s constant throughout any base container picture.

Core expertise
OpenSandbox is designed to be surroundings unbiased. I help docker regional improvement and Kubernetes For distributed production-grade execution. The platform provides 4 major forms of sandboxes:
- Coding agent: An surroundings optimized for software program improvement duties. Brokers can write, take a look at, and debug code.
- GUI agent: full help VNC desktoppermits the agent to work together with a graphical person interface.
- Executing the code: A high-performance runtime for executing particular scripts or computational duties.
- RL coaching: An remoted surroundings tailor-made for reinforcement studying (RL) workloads allows secure iterative coaching.
On this system, Integration APIThis retains interplay patterns constant whatever the underlying language or runtime. OpenSandbox at present provides the next SDKs: Python, TypeScript, Java/Kotlinand C# and Go It’s listed on the event roadmap.
Integration and ecosystem help
A key characteristic of OpenSandbox is its native compatibility with present AI frameworks and developer instruments. Offering a safe execution layer permits brokers constructed on a wide range of platforms to carry out “actual world” actions. Presently supported integrations embody::
- Mannequin interface: Claude Code, Gemini CLI, and OpenAI Codex.
- Orchestration framework: LangGraph and Google ADK (Agent Improvement Equipment).
- Automation instruments: Chrome and Playwright for browser-based duties.
- Visualization: Full VNC help for visible monitoring and interplay.
Which means that the agent can carry out the duties of “web site scraping and linear regression mannequin coaching” inside a single remoted session. The agent makes use of Playwright to navigate the online, obtain information to the sandbox’s native file system, and run Python code to course of that information. You are able to do all this with out leaving the safe OpenSandbox surroundings.
Set up and configuration
This undertaking prioritizes a streamlined developer expertise (DX). Organising a neighborhood execution server requires three major instructions via the command-line interface.
pip set up opensandbox-server— Set up server parts.opensandbox-server init-config— Generate the configuration information wanted on your surroundings.opensandbox-server— Begins the server and exposes an API for interplay with the agent.
As soon as the server is working, builders can use the offered SDK to create, handle, and terminate sandboxes programmatically. This reduces the overhead of “stitching collectively” a number of instruments for file administration, course of isolation, and community proxies.
Necessary factors
- Built-in language-independent execution: OpenSandbox gives a constant API for AI brokers to run code, browse the online, and work together with GUIs. Though at present supported, Python, TypeScript, Java/KotlinSDK C# and Go It is on the roadmap.
- Infrastructure flexibility (Docker and Kubernetes): The device is designed to scale seamlessly from a developer’s native machine to an enterprise-grade manufacturing surroundings. I’ll make use of it docker native isolation and Kubernetes Remove “environmental drift” generally seen when transferring brokers from improvement to the cloud for distributed, large-scale deployments.
- Broad ecosystem integration: It’s designed to attach on to main AI frameworks and instruments, together with: LangGraph, Claude Code, Gemini CLI, OpenAI Codex, and Google ADKand automation libraries corresponding to playwright and chrome.
- Eliminating “sandbox dependencies”: By offering a free and open supply different based mostly on Apache 2.0 licenseAlibaba eliminates dependence on costly managed sandbox providers that cost by the minute or impose vendor lock-in.
- Excessive-fidelity interactions (VNC and net): OpenSandbox goes past easy script execution. Full VNC desktop and browser automation. This permits brokers to carry out advanced multimodal duties, corresponding to interacting with net interfaces and utilizing desktop functions, inside a safe, “blast-proof” surroundings.
Please examine lipo, document and example. Additionally, be happy to comply with us Twitter Do not forget to affix us 120,000+ ML subreddits and subscribe our newsletter. cling on! Are you on telegram? You can now also participate by telegram.

