Research & Writing

Ideas from the
safety frontier.

Technical research, threat analysis, and field notes from our work building foundational AI safety tooling.

researchMar 20, 2026

When the Assembly Line Becomes the Attack Surface: Supply Chain Threats in the Age of AI Agents

Software supply chain attacks can steal your credentials in minutes. Now AI agents are running the same attacks autonomously. What the hackerbot-claw campaign against Microsoft, DataDog, and Aqua Security reveals about the enterprise AI security gap.

analysisFeb 17, 2026

When Guardrails Fail: What Claude Opus 4.6 Reveals About Prompt Injection Risk

Anthropic's Claude Opus 4.6 system card finally quantifies prompt injection risk at scale. These numbers should reshape how enterprises deploy AI agents.

researchJan 23, 2026

Hidden in Plain Language: How Calendar Invites Became Data Extraction Tools Through Prompt Injection

A calendar event with crafted instructions could silently extract your private meeting data when you ask Gemini about your schedule. This reveals fundamental gaps in how AI systems handle untrusted inputs.

analysisDec 30, 2025

When AI Democratization Meets Vulnerability: The Real Cost of No-Code AI Agents

No-code AI platforms promise accessibility. Recent research shows they also introduce security challenges traditional approaches don't address.