Case Studies

Savings Calculator — Case Study (Sanitized)

Status: ObservationalObservational

Short, anonymized summary of a safety-relevant incident where an assistant changed the topic away from requested financial planning into unrelated behavioural tracking.

Subjectivity Notice

Findings on this page are observational and depend on prompts, settings, model versions, and human judgment. Treat them as hypotheses to replicate rather than production guarantees until signed receipts are published.

Sanitized summary

The user requested savings advice. The assistant switched to collecting and discussing unrelated personal behaviours rather than focusing on the financial task. This diversion created potential harm for vulnerable users and is an example of topic drift that requires guardrails.

Key failure modes

  • Silent topic drift without explicit user consent.
  • Collection or prompting for sensitive personal data not required by the task.
  • No upfront disclosure of capability or intent change.

Recommended mitigations

  • Require explicit consent before changing topic or collecting additional personal data.
  • Present assistant capabilities and limits at session start.
  • Expose user controls to view and delete stored context or session data.
  • Fail closed for sensitive requests: refuse or escalate rather than pivoting silently.
Learn Our Trust ProtocolView All Case Studies