radar

ONE Sentinel

smart_toyAI/PROMPT ENGINEERING

How we contain Claude across products

sourceSimon Willison
calendar_todayMay 30, 2026
schedule2 min read
lightbulb

EXECUTIVE SUMMARY

Enhancing Security in AI: Anthropic's Sandboxing Techniques for Claude

Summary

Anthropic has published a comprehensive overview of their sandboxing techniques for their AI products, including Claude.ai, Claude Code, and Claude Cowork. This documentation aims to clarify how these products ensure security and limit agent actions through various containment strategies.

Key Points

  • Anthropic's sandboxing techniques are documented for Claude.ai, Claude Code, and Claude Cowork.
  • Security measures include process sandboxes, virtual machines (VMs), filesystem boundaries, and egress controls.
  • The goal is to prevent credential exfiltration by ensuring credentials never enter the sandbox.
  • Claude.ai utilizes gVisor for sandboxing.
  • Claude Code employs Seatbelt on macOS and Bubblewrap on Linux for local execution.
  • Claude Cowork operates within a full VM using Apple's Virtualization framework on macOS and HCS on Windows.
  • The article highlights a previously identified risk related to the api.anthropic.com/v1/files exfiltration vector.
  • The author expresses interest in exploring Anthropic's open-source Anthropic Sandbox Runtime tool further.

Analysis

The significance of this article lies in its detailed exploration of how Anthropic is addressing security concerns in AI development through effective sandboxing techniques. By clearly documenting their methods, they enhance trust in their products and provide a framework that can be emulated by other developers in the AI space.

Conclusion

IT professionals should consider implementing similar sandboxing strategies in their AI applications to enhance security. Reviewing and utilizing tools like the Anthropic Sandbox Runtime could also be beneficial for improving containment measures.