Sandboxing

Give the agent a playground, not your production environment.

The simple explanation

Imagine hiring a new intern and giving them full admin access to your production servers on day one. Probably not a great idea. Instead, you'd give them a test environment where they can experiment, make mistakes, and learn without any real consequences. That's sandboxing.

Sandboxing means running an AI agent in an isolated environment - a container, a virtual machine, a separate branch, or a disposable workspace - where it can read files, write code, run commands, and even break things without affecting your real systems.

If the agent accidentally deletes a directory, corrupts a config file, or installs a bad dependency, it only happens in the sandbox. Your real codebase stays untouched.

Why it matters for agentic engineering

Agents are capable but imperfect. They can misinterpret instructions, hallucinate solutions, or execute commands that have unintended side effects. The more autonomy you give an agent, the more important sandboxing becomes.

Sandboxing creates a safe space for agents to be autonomous. Without it, every action the agent takes carries real risk, which means you need to review and approve every command. With sandboxing, you can let agents experiment freely and only review the final result - dramatically speeding up workflows.

In practice

Git worktrees: Create an isolated copy of the repo for the agent to work in. If the result is good, merge it. If not, delete the worktree
Containers: Run agents inside Docker containers with limited network access and filesystem isolation
Cloud sandboxes: Services that provide ephemeral dev environments (like GitHub Codespaces or Gitpod) where agents can run freely
Branch-based isolation: Agents work on separate branches. CI runs verify the changes before any human reviews them
Network restrictions: Sandbox the agent's network access so it can't make external API calls, send emails, or access production services
Permission scoping: Even without full sandboxing, limiting file access (read-only for most directories, write access only to specific paths) provides meaningful isolation

The gold standard is treating agent output like an untrusted PR - let it work in isolation, review the result, and merge only after verification.

← Back to Glossary

Sandboxing

The simple explanation

Why it matters for agentic engineering

In practice

Related Terms