PolicyAI

Composable, conflict-aware policies for reliable agents

The Problem: LLMs Can't Handle Conflicting Instructions

When building agents, you quickly discover that large language models have subtle biases that make structured outputs unreliable:

Frequency bias: If "priority": "low" appears 10× more often than "priority": "high" in your prompts, the LLM will default to "low" even when "high" is correct.
Key name bias: The names of your JSON fields influence the model's outputs in unexpected ways (e.g. priority).
Context leakage: Mention "the building is on fire" in your prompt and watch the LLM assign high priority to unrelated low-priority messages.
Conflicting instructions: Tell the LLM to prioritize messages from Alice and deprioritize messages from Bob—then send a message from both. The LLM won't report a conflict; it will silently pick one instruction to follow.

These aren't edge cases. They're fundamental limitations of how LLMs process instructions. Your agent appears to work in testing, then fails unpredictably in production.

What PolicyAI Does

PolicyAI provides a layer on top of structured outputs that makes agent behavior composable and conflict-aware:

Each policy is independent: Write and test policies in isolation, then compose them.
Conflicts are detected: When instructions conflict, you get an error instead of silent bias.
Conflict resolution is explicit: Choose how to handle conflicts (agreement required, largest value wins, or default).
Monotonic overrides: Use the "largest value" strategy to make important values "sticky"—once set high, they stay high.

PolicyAI trades latency and cost for reliability and debuggability. If you're building production agents where correctness matters, that's a trade worth making.

Show Me the Problem

Here's what happens with vanilla structured outputs when you have conflicting policies:

# Your agent instructions
"""
- When Alice sends a message, set priority to HIGH
- When Bob sends a message, set priority to LOW
"""

# Message from: alice@example.com, bob@example.com
# LLM output: {"priority": "LOW"}  # Wrong! But which instruction should it follow?
# The model picked one silently. No error. No warning.

With PolicyAI, this scenario produces a conflict error because two policies disagree on the priority field's value, and you've configured it to require agreement.

How PolicyAI Works

1. Define a PolicyType

A PolicyType is like a schema, but with conflict resolution strategies:

use policyai::{PolicyType, Field, OnConflict};

let policy_type = PolicyType {
    name: "EmailPolicy".to_string(),
    fields: vec![
        Field::Bool {
            name: "unread".to_string(),
            default: true,
            on_conflict: OnConflict::Default,
        },
        Field::StringEnum {
            name: "priority".to_string(),
            values: vec!["low".to_string(), "medium".to_string(), "high".to_string()],
            default: None,
            on_conflict: OnConflict::LargestValue,  // "high" wins over "low"
        },
        Field::StringArray {
            name: "labels".to_string(),
        },
    ],
};

2. Create Policies with Semantic Injections

A semantic injection is a natural language instruction that generates structured actions:

let policy1 = policy_type
    .with_semantic_injection(
        &client,
        "If the email is about football, mark \"unread\" false with low \"priority\""
    )
    .await?;

let policy2 = policy_type
    .with_semantic_injection(
        &client,
        "If the email is from mom@example.org, set high \"priority\" and add Family \"label\""
    )
    .await?;

let policy3 = policy_type
    .with_semantic_injection(
        &client,
        "If the email is about shopping, add Shopping \"label\""
    )
    .await?;

3. Compose and Apply

let mut manager = Manager::default();
manager.add(policy1);
manager.add(policy2);
manager.add(policy3);

let report = manager.apply(
    &client,
    template,
    "From: mom@example.org\nSubject: Shopping for football gear",
    None
).await?;

// Result: unread=false, priority=high, labels=["Family", "Shopping"]
// - policy1 sets unread=false, priority=low
// - policy2 sets priority=high (wins via LargestValue)
// - policy3 adds Shopping label
// - labels compose (arrays merge)

The policies compose cleanly because:

priority uses OnConflict::LargestValue → "high" overrides "low"
labels is an array → values merge automatically
unread uses OnConflict::Default → takes the default value when conflicts occur

Conflict Resolution Strategies

PolicyAI provides three strategies for handling conflicts:

Agreement

All policies must agree on the value, or you get a conflict error. Best for fields where inconsistency indicates a logic error in your policies.

Field::String {
    name: "template".to_string(),
    default: None,
    on_conflict: OnConflict::Agreement,
}

LargestValue

The largest value wins. This makes important values "sticky" and enables monotonic overrides:

For bools: true > false
For numbers: 10 > 5
For strings: longer strings win
For enums: values later in the list win

Field::StringEnum {
    name: "priority".to_string(),
    values: vec!["low".to_string(), "medium".to_string(), "high".to_string()],
    on_conflict: OnConflict::LargestValue,  // "high" > "medium" > "low"
}

Why this matters: Once a policy sets priority to "high", no other policy can downgrade it to "low". This prevents surprising interactions between policies.

Default

Use the field type's default behavior (usually last-writer-wins, but arrays append) when conflicts occur. Useful for fields where you want predictable behavior regardless of policy interactions.

Field::Bool {
    name: "unread".to_string(),
    default: true,
    on_conflict: OnConflict::Default,
}

PolicyType Syntax

PolicyAI provides a concise syntax for defining policy types:

type policyai::EmailPolicy {
    unread: bool = true,
    priority: ["low", "medium", "high"] @ highest wins,
    category: ["ai", "distributed systems", "other"] @ agreement = "other",
    template: string @ agreement,
    labels: [string],
}

You can parse this syntax directly:

let policy_type = PolicyType::parse(r#"
    type EmailPolicy {
        priority: ["low", "high"] @ highest wins,
        labels: [string]
    }
"#)?;

Use Cases for Agents

PolicyAI excels when your agent needs to:

Triage emails or notifications: Apply multiple categorization rules that may interact
Process RSS feeds: Extract structured metadata from articles using composable rules
Label documents: Assign categories, tags, priorities based on content
Extract metadata: Any scenario where discrete documents need structured descriptors

PolicyAI is not the right tool when:

You have a single, simple classification task
Latency is more important than correctness
Your policies never interact or conflict

Scaling with Policy Retrieval

For production agents with large policy sets, you don't want to apply every policy to every input. Use vector retrieval to select relevant policies dynamically:

Pattern: PolicyAI + Chroma

(Example is illustrative, but likely needs work to work because Claude hallucinated some of this)

use chromadb::{ChromaClient, Collection};
use policyai::{Policy, Manager};

async fn process_with_retrieval(
    client: &Anthropic,
    chroma: &Collection,
    input: &str,
) -> Result<Report, Box<dyn std::error::Error>> {
    // 1. Retrieve relevant policies from vector database
    let results = chroma.query(
        vec![input.to_string()],
        5,  // top 5 most relevant policies
        None,
        None,
        None,
    ).await?;

    // 2. Deserialize policies from metadata
    let policies: Vec<Policy> = results.metadatas
        .into_iter()
        .flatten()
        .filter_map(|meta| {
            serde_json::from_value(meta.get("policy")?.clone()).ok()
        })
        .collect();

    // 3. Apply only relevant policies
    let mut manager = Manager::default();
    for policy in policies {
        manager.add(policy);
    }

    let report = manager.apply(client, template, input, None).await?;
    Ok(report)
}

Storing Policies for Retrieval

When adding policies to your vector database, store both the semantic injection and the full policy:

// Create policy
let policy = policy_type
    .with_semantic_injection(
        &client,
        "If email from VIP, set high priority"
    )
    .await?;

// Store in Chroma with embedding of the semantic injection
chroma.add(
    vec![Uuid::new_v4().to_string()],  // id
    vec![policy.prompt.clone()],         // text to embed
    Some(vec![serde_json::json!({
        "policy": policy,
        "type": "email_triage",
    })]),
    None,
).await?;

Why This Works

Semantic injections are natural language: Vector databases embed them naturally
Retrieval filters noise: Only relevant policies are applied, reducing conflicts
Scalable: Support thousands of policies without performance degradation
Dynamic: Add/update policies without redeploying your agent

Retrieval Best Practices

Embed the semantic injection, not the action: The natural language prompt (policy.prompt) captures intent
Store full policy in metadata: Retrieve the complete Policy object for application
Use top-k = 3-10: Start small; more policies = more potential conflicts
Monitor conflict rates: If retrieval pulls conflicting policies, tune your embeddings or increase specificity

This pattern combines the best of both worlds:

Vector retrieval for relevance and scale
PolicyAI for correctness and composability

Tradeoffs

PolicyAI sacrifices performance for reliability:

Metric	vs Vanilla Structured Outputs
Latency	Higher (additional LLM calls)
Token Usage	Higher (policy composition)
Cost	Higher (more tokens)
Reliability	Much higher (conflict detection)
Debuggability	Much higher (isolated policies)

Why it's worth it: In production agents, silent failures are expensive. PolicyAI makes agent behavior predictable and testable. You can verify each policy independently, then compose them with some confidence.

Getting Started

Add PolicyAI to your Cargo.toml:

[dependencies]
policyai = "0.3"

Basic usage:

use policyai::{PolicyType, Field, OnConflict, Manager};
use claudius::Anthropic;

#[tokio::main]
async fn main() -> Result<(), Box<dyn std::error::Error>> {
    let client = Anthropic::new(None)?;

    // Define your policy type
    let policy_type = PolicyType::parse(r#"
        type MyPolicy {
            priority: ["low", "high"] @ highest wins
        }
    "#)?;

    // Create a policy from natural language
    let policy = policy_type
        .with_semantic_injection(&client, "Set high priority for urgent messages")
        .await?;

    // Apply it
    let mut manager = Manager::default();
    manager.add(policy);

    let report = manager.apply(
        &client,
        template,
        "This is urgent!",
        None
    ).await?;

    println!("{}", report.value());
    Ok(())
}

Tools

PolicyAI includes tools for testing and debugging:

policyai-verify-policies: Verify policies are well-formed
policyai-regression-report: Generate reports on policy behavior
policyai-extract-regressions: Extract failing cases for analysis
policyai-regressions-to-examples: Convert regressions to test examples

Implementation Note

PolicyAI deliberately orders arguments in tool calls carefully. Agents are surprisingly susceptible to argument order, so the framework maintains consistent ordering to avoid bias.

Current Status

Model support: Anthropic Claude only (currently)
License: Apache-2.0
Status: Active development

Examples

See the examples/ directory for:

Generating semantic injections
Creating test data
Evaluating policies

Contributing

Issues and pull requests welcome at https://github.com/rescrv/policyai

License

Apache-2.0

policyai 0.3.0