Running OpenClaw At Home Inspired Us To Create DefenseClaw

Running OpenClaw at Home Revealed the Critical Need for DefenseClaw

There’s a DGX Spark sitting in my home office running OpenClaw. It’s connected to my phone and my laptop through secure tunnels, and over time, it has become, without exaggeration, the operating system for how my family functions.

Every day, my wife and I rely on it to manage our children’s schedules. I built a small agent skill that pulls up the school lunch menu each morning to remind us what’s for lunch. Another skill tracks tennis match draws and practice sessions. I’ve connected Model Context Protocol (MCP) servers via Zapier to synchronize my email, calendar, and Discord messages. It nudges me about things I might otherwise forget. It holds the context I simply cannot store in my head. OpenClaw has evolved into my deepest thinking partner—the place where half-formed strategy ideas turn into tangible plans before they ever hit a slide deck.

In short, OpenClaw hasn’t just improved my personal productivity—it has fundamentally reshaped how we operate as a family.

And that’s precisely why I feel a profound sense of vulnerability.

The Fastest-Growing Open Source Project Became a Massive Target

OpenClaw didn’t just gain traction—it exploded.

When Peter Steinberger released the first version of what would become OpenClaw in November 2025, it went viral almost overnight. Within days, the project amassed 60,000 GitHub stars, and within months, it had hundreds of thousands. NVIDIA CEO Jensen Huang called it the “operating system for personal AI.” Developers worldwide began building their workflows—and even their lives—around it.

The excitement was warranted. OpenClaw represents a paradigm shift: from AI you interact with, to AI that actively acts on your behalf. It reads files, manages tools, executes shell commands, connects to every messaging platform you use, and even learns to build new capabilities autonomously while you sleep. As one early adopter described it, OpenClaw is the closest thing to Jarvis we’ve ever seen.

But this power comes with enormous risk.

Within three weeks of going viral, OpenClaw became the epicenter of one of the most concentrated security crises in open source history:

CVE-2026-25253 — a critical remote code execution vulnerability where visiting a single malicious webpage could hijack an agent.
135,000+ exposed OpenClaw instances on the public internet, many of which were vulnerable.
ClawHavoc supply chain attack — over 800 malicious skills infiltrated ClawHub, roughly 20% of the registry, distributing infostealers disguised as productivity tools.
Proof-of-concept attacks by researchers — malicious third-party skills demonstrated the ease of prompt injection and data exfiltration without user awareness.
Enterprise and national restrictions — nation-states restricted agencies from running OpenClaw, and enterprises began encountering similar security patterns.

This is not a theoretical concern. These attacks were real, rapid, and highly impactful.

Peter and his team acted with transparency, patching vulnerabilities quickly. Yet the structural reality is undeniable: an agent with full system access, network reach, and a thriving ecosystem of community-contributed skills is an irresistible target. And the people most at risk are the ones like me—deep users who have woven the agent into everything they do.

Bridging the Gap Between “Powerful” and “Safe”

Over the past year, the ecosystem has started to respond to these challenges.

NVIDIA’s recent announcements of NemoClaw and OpenShell at GTC 2026 addressed a critical piece of the puzzle. OpenShell provides the infrastructure-level sandbox that OpenClaw lacked: kernel isolation, deny-by-default network policies, YAML-based policy enforcement, and a privacy router to ensure sensitive data stays local. These controls exist outside the agent itself, making them tamper-resistant.

Building on this foundation, Cisco’s AI Defense team analyzed how malicious skills exploit the OpenClaw trust model—through prompt injection, credential theft, and silent data exfiltration—and released an open-source Skill Scanner to vet installed skills. OpenShell constrains agent actions, while Cisco tools verify and audit behavior.

Yet something remained missing: the operational layer. This is the layer a developer—or a security-conscious family like mine—actually runs day-to-day to keep an agent secure. OpenShell provides the sandbox. Cisco provides the scanning tools. But who manages block lists? Who receives alerts when something goes wrong at 2 AM? Enter DefenseClaw.

Introducing DefenseClaw: Governance for OpenClaw

DefenseClaw is Cisco’s open-source project designed to serve as the agentic governance layer for OpenClaw. It integrates OpenShell’s sandbox capabilities with Cisco’s scanners into a deployable solution that can be set up in under five minutes.

DefenseClaw performs three critical functions:

Pre-execution scanning — Every skill, plugin, or tool is scanned before being installed in the agent environment. The scan engine combines five tools: Skill Scanner, MCP Scanner, A2A Scanner, CodeGuard static analysis, and an AI bill-of-materials generator. Nothing bypasses the admission gate: DefenseClaw checks block/allow lists, generates a manifest, and only then installs the skill.
Runtime threat detection — OpenClaw agents are self-evolving systems. A skill that was safe yesterday might attempt exfiltration tomorrow. DefenseClaw monitors all inbound and outbound agent activity at execution time, scanning every message and system call to detect threats as they occur.
Enforced policy control — DefenseClaw doesn’t offer advisory suggestions; it enforces them. Skills on the block list have their sandbox permissions revoked, files quarantined, and calls blocked. Blocked MCP servers are immediately removed from the network allow-list. All enforcement actions happen in real time—under two seconds—with no restart required.

Observability from Day One

DefenseClaw ensures that every agent is observable from the moment it comes online. Out-of-the-box integration with Splunk means that every scan result, block/allow decision, prompt-response pair, and policy enforcement action is captured as structured telemetry. Observability isn’t an afterthought—it is embedded. The system ensures that if an agent does anything, there is a record. This level of visibility is critical for families, enterprises, and developers alike, providing peace of mind in environments where OpenClaw agents manage sensitive data and critical workflows.

Why DefenseClaw Matters

For users who have integrated OpenClaw deeply into their lives—whether managing a family, running a startup, or operating a critical enterprise system—the difference between power and safety is stark. OpenClaw delivers unprecedented productivity and capability, but without governance, it’s also a target for exploitation.

DefenseClaw fills this gap by making secure deployment accessible, enforceable, and observable. It allows users to safely enjoy the benefits of agentic AI while minimizing the risks that naturally accompany such powerful tools. Families can run AI agents that manage calendars, coordinate schedules, and handle communications without fear. Enterprises can leverage agentic workflows while meeting regulatory and security requirements. Developers can experiment confidently, knowing that scanning and enforcement mechanisms are always active.

OpenClaw represents the future of personal and organizational AI: agents that act autonomously, understand context, and augment human productivity in profound ways. But this future is only sustainable if safety, governance, and observability are baked into the ecosystem from the ground up.

Source link: https://blogs.cisco.com