Blog

Thoughts, updates, and insights from the Superagent team.

All Research Security Announcements Benchmarks Compliance Red Teaming Guardrails Engineering Opinions Category

Research•March 24, 2026•5 min read

Frontier models miss 57% of threats in agent context

We ran 485 real artifacts through Claude 4.6 Opus with a security-focused system prompt. The model missed 57% of the threats brin had already identified. Here's the full breakdown.

Security•February 18, 2026•5 min read

The Cline Incidents and the Broken Security Model

Two Cline security incidents in two months expose the same underlying problem: AI agents treat untrusted content as instructions. The npm supply chain and prompt injection attacks reveal why the current security model is fundamentally broken.

Announcements•February 17, 2026•3 min read

Launching brin.sh — the universal allowlist for agents

brin pre-scans packages, MCP servers, repositories, skills, web pages, and contributors for malware, prompt injection, and supply chain attacks. One GET request, no auth, no SDK.

Security•January 25, 2026•4 min read

What Can Go Wrong with AI Agents

AI agents fail in ways traditional software doesn't. Data leaks, compliance violations, unauthorized actions. Here's what to watch for.

Research•January 21, 2026•3 min read

We Bypassed Grok Imagine's NSFW Filters With Artistic Framing

Text-to-image safety is broken. We generated explicit content of a real person using basic compositional tricks. Here's what we found, why it worked, and what this means for AI safety systems.

Benchmarks•January 16, 2026•12 min read

AI Code Sandbox Benchmark 2026: Modal vs E2B vs Daytona vs Cloudflare vs Vercel vs Beam vs Blaxel

We evaluate seven leading AI code sandbox providers across developer experience and pricing to help you choose the right environment for executing AI-generated code.

1 2 3 4 5 6 7 8

Join our newsletter

We'll share announcements and content regarding AI safety.