Menu
Public documentation

Roleplay Documentation

Roleplay Documentation

Roleplay is a paid workbench for social-engineering tests for AI agents.

Use it to test whether an agent can be manipulated into violating a protected boundary, policy, tool-use rule, or user intent. The core product loop is:

  1. Create a Builder or Team workspace.
  2. Create a project, protected agent, and project API key.
  3. Choose attacker and judge providers for real adaptive runs.
  4. Run attack simulations locally or in CI with the included CLI.
  5. Upload safe finding summaries to the workbench.
  6. Review Evidence, assign the fix, and keep the same check in CI.

Who This Is For

Roleplay is built for teams shipping:

  • customer support agents
  • billing and refund agents
  • internal copilots
  • browser or tool-using agents
  • agent workflows that read untrusted content

Plans

Builder

Builder is about $49/month for solo founders, consultants, and developers testing one or a few AI agents.

Team

Team is about $199/month for teams and agencies that need shared findings, owners, run history, CI uploads, and regression detection.

The CLI is included as the local execution engine for both plans. Real adaptive runs currently require your own provider key and explicit judge mode; mock mode is only a smoke test.

Privacy Defaults

Roleplay is local-first by default.

  • Full transcript upload: off by default
  • Sanitized finding summaries: on by default for Cloud uploads
  • Redacted snippets: on
  • Secret redaction: on
  • Default evidence retention: plan-based in the workbench

Start Here

Documentation Map

For local execution:

For scenario authors:

For workbench users:

For integration and operations: