GenAI Systems Lab Open interactive version →
AI Engineering 9 min read

How to Answer 'How Do Agents Work?' in a Technical Screen

ReAct, tool use, memory, multi-agent orchestration — what a senior interviewer wants to hear, what separates a junior answer from a strong one, and a worked example of walking through an agent architecture under time pressure.

Technical screens for ML engineering roles increasingly include agent questions — 'explain how you'd build an agent to do X' or 'how does the ReAct pattern work?' Most candidates know the buzzwords. Senior interviewers test whether you understand the failure modes.

The baseline answer most candidates give

'An agent is an LLM that can use tools. It reasons about what to do, calls a tool, gets back a result, and uses that result in the next step.' This is correct but shallow. It won't differentiate you.

The ReAct loop — show you know the mechanism

ReAct (Yao et al., 2022) is the foundational agent pattern. The loop is: Thought → Action → Observation, repeated until the agent produces a final answer.

The key insight in ReAct is that the Thought step externalizes the model's reasoning into the context window, where it can be inspected, traced, and used in subsequent steps. This is what makes agents debuggable.

Tool design — where most candidates are weak

Good interviewers probe tool design. The questions to answer for each tool: what are its side effects? Is it idempotent? What's the error contract? How does the agent recover from tool failure?

Memory architecture

For complex agents, mention the 6 memory types: in-context (what's in the current prompt), episodic (past interaction summaries), semantic (facts about the user/domain), procedural (how to perform tasks), working memory (scratch space during a task), and external (databases, vector stores). Most production agents use in-context + external. Episodic and semantic matter for personalized agents.

Multi-agent patterns

If the question involves complex workflows, mention orchestration patterns: supervisor (one agent delegates to specialized subagents), pipeline (agents in sequence, each transforms the output), and mesh (agents communicate peer-to-peer for collaborative tasks). Each has different failure semantics — a supervisor failure is catastrophic, a pipeline failure is recoverable if intermediate state is persisted.

Failure modes — what separates senior answers

Interactive lab:

Try it interactively

GenAI Systems Lab is a free platform for AI engineers — configure real failure modes, break things, and build the judgment that gets you hired.

Open GenAI Systems Lab →