Case Study — Kyrah AI

Designing a Reliable
AI Safety System

AI systems fail when decisions are implicit.

Kyrah fixed this by making them explicit:

Detection → Risk assessment → Response selection

Model correctness did not equal user trust.

Role

AI Product Manager

Timeline

8 wks MVP → iterative

Focus

Classification · Guardrails · RAG

The Problem

Early versions of Kyrah exposed a gap that's easy to miss: the model was often correct, but the system response still felt wrong. Harmful patterns were identified — but responses defaulted to generic validation. Similar inputs produced inconsistent tone and guidance. Subtle but serious signals went unescalated.

Users felt acknowledged. But not understood. And more critically, they had no framework to interpret what they were experiencing.

The model was correct. The system still failed the user.

In a safety context, this is not a UX issue — it is a reliability failure. Users could feel validated, but still fail to recognize harmful patterns in their situation. At scale, inconsistent responses can reinforce confusion instead of helping users recognize harmful patterns. In safety contexts, this can delay recognition of harmful situations — which is a system-level failure, not a response issue.

Without this, high-risk and low-risk situations were treated similarly, leading to inconsistent guidance and reduced user trust.

The Key Insight

Improving model accuracy alone did not fix inconsistent behavior. The issue was that the system had no explicit decision layer.

We chose to introduce a structured decision layer between classification and response generation, accepting increased system complexity and latency to ensure predictable behavior in high-risk scenarios.

We shifted from generating responses to designing a system that decides.

System Design

This architecture emerged through iterative failure analysis — each layer added in direct response to a failure mode we identified in real usage. This required aligning engineering and product around explicit system behavior rather than relying on model outputs alone.

System Architecture — Kyrah AI

User Input raw signal

Detection LLM Classification

Decision Layer ← controls system behavior

Response Generation context-aware

Validation Layer guardrails + RAG

Final Output consistent · interpretable

The Decision Layer is where system behavior is explicitly controlled — not inferred. This is what separates a model wrapper from an AI system.

The Key Tradeoff

Every system design involves a tradeoff. This one was deliberate.

We chose

Consistency

Predictable behavior in high-risk scenarios required explicit decision logic, even at the cost of added system complexity.

We gave up

Speed

Additional latency was an acceptable cost. Inconsistent behavior in a safety-critical context was not.

Chose deterministic decision rules over generative flexibility to ensure consistent behavior, even at the cost of additional system complexity and slightly slower responses. In a safety-critical product, inconsistency is more damaging than latency.

Before vs. After

The same user input. Two different system behaviors.

"I feel like he keeps confusing me and I don't know what's real anymore."

Before — model-driven

"That sounds confusing and difficult…"

Generic validation

No pattern surfaced

No interpretive frame

After — decision-driven

"When someone's words and actions don't line up consistently, it can create confusion and make you question your own perception. Over time, that makes it harder to trust what you remember."

Identifies pattern explicitly

Explains the mechanism

Enables user interpretation

Reduces cognitive confusion

The difference is not the model. The difference is whether the system makes a decision before it generates a response.

This changed how users interpreted their situation — not just how they felt about it.

This shifted the product from an empathetic chatbot to a decision-support system users could rely on.

Results

This reduced inconsistent system behavior in production and improved user trust in responses.

97%

Classification accuracy
up from 88%

Internal QA test set across labeled scenarios

<1%

Hallucination rate
via guardrails + RAG

Measured against guardrail violations in structured evaluation runs

+8%

User retention
from improved system behavior

Post-redesign cohort vs. pre-decision-layer baseline

Users didn't just receive better responses — they began to recognize patterns in their situations and act with greater clarity. That is the measurable outcome of designing at the system level.

Reflection

What this taught me

Model correctness does not translate into user trust. Trust comes from consistent system behavior — not isolated correct outputs.

The most valuable thing I learned building Kyrah was the distinction between a model that performs and a system that behaves. That difference only shows up in production, under real-world conditions, with real users who need to act on what they're given.

Most systems break here — not because the model fails, but because the system never defined how it should behave. This project was about making that gap visible — and then building a layer to close it.

Next Case Study →

Food Spy AI — Turning AI Output into Usable Insight

When users don't struggle with the data — they struggle with knowing what to do with it.

Designing a ReliableAI Safety System