Series BWe raised $41M to protect the internet from AI-powered abuseRead announcement

Custom agents

Build an agent for any policy, workflow, or edge case.

When the problem does not fit an off-the-shelf classifier, configure a Cinder agent around it. Same platform, same controls, same evidence. Built around the risk your team actually has.

Overview

Every platform has problems no vendor has seen before

Avatar misuse on a video platform. Romance scams on a dating app. Election interference at a foundation model. Toxicity in a kids' community. The threat surface is specific to your product, and the policy that defends it is yours.

Cinder agents are configurable around that specificity. Wire your policies, your data, and your team's decisions into an agent built for the exact problem you have, and run it on the same platform that already powers your operation.

Capabilities

How custom agents work

  • Configure on top of your policies

    Define the policy, the data the agent should consider, and the actions it can take. No model training degree required.

  • Live training on your team's reviews

    Every reviewer's decision becomes training signal. The agent gets sharper at your problem, not someone else's.

  • External tool use

    Agents can browse the web, hit your APIs, and pull third-party signals when the call requires more context than any single input provides.

  • Backtesting before production

    Run new agents against historical data to see what would have changed: false positives, false negatives, queue impact, before anything ships.

  • Co-built when you want it

    Spin one up yourself, or partner with our services team to design and train it alongside your operators.

Capabilities

What teams have built

01

AI red teaming agents

For foundation model launches and prompt injection coverage.

02

Avatar policy agents

For video and synthetic media platforms.

03

Comment moderation agents

For community platforms with millions of daily posts.

04

Counterfeit-specific agents

For marketplaces with high-value categories.

05

Romance and grooming agents

For dating and social platforms.

Cinder provided rigorous adversarial testing that matched our release velocity. Their team found important edge cases and helped us address them before launch.

Ben Brooks, Head of Public PolicyBlack Forest Labs

>90%

Reduction in CSAM and NCII vulnerability

10X

Safer than benchmark industry models at launch

Read case study
3X

More repeat-offender accounts taken down

50%

Of all bans executed by orchestrated workflows

Read case study