GenAI Systems Lab Open interactive version →
AI Engineering 14 min read

A Framework for AI System Design Interviews (Staff+ Level)

6-axis characterisation, architecture shape selection, reliability budgets, and how to structure a 45-minute system design answer.

AI system design interviews are different from traditional system design. You're not just designing a scalable service — you're designing a system with probabilistic components, uncertain quality, and failure modes that don't show up in unit tests. Interviewers at staff+ level expect you to handle this difference explicitly.

The 6-axis characterisation (do this first)

Before drawing any boxes, characterise the problem on 6 axes. This forces precision and signals experience:

AxisQuestions to answer
Quality vs. speedWhat's the latency SLA? Can we afford streaming? Does quality trump speed?
ScaleQPS, document count, context length, user count — order of magnitude
Data freshnessDoes knowledge need to be real-time? Daily? How stale is acceptable?
PersonalisationPer-user context? Multi-tenant? Global shared context?
Failure toleranceWhat's the blast radius of a wrong answer? Is hallucination an incident?
Regulatory/compliancePII handling? Data residency? Audit trails?

Choosing your architecture shape

Based on the 6-axis characterisation, you'll land on one of four shapes:

The components every AI system needs

ComponentWhy it mattersCommon mistake
Eval pipelineYou can't measure quality without oneSkipping it until something breaks
ObservabilityYou can't debug what you can't seeOnly logging errors, not quality signals
Fallback strategyLLMs fail — you need a graceful degradationHard-coding one path with no fallback
Rate limitingRunaway agents burn budget fastNo per-user or per-session limits
Human-in-the-loopHigh-consequence actions need approval gatesAutomating actions with blast radius

Structuring your 45-minute answer

The question interviewers are really asking: do you think about AI systems like a production engineer or like someone who's only built demos? Talking about eval pipelines, failure modes, and cost budgets unprompted is the signal that separates principal engineers from senior ones.

Practice system design →: Work through AI system design scenarios in the Systems module with structured feedback.

Try it interactively

GenAI Systems Lab is a free platform for AI engineers — configure real failure modes, break things, and build the judgment that gets you hired.

Open GenAI Systems Lab →