GenAI Systems Lab Open interactive version →
AI Engineering 12 min read

The Research Engineer Interview: Paper Implementation, Research Taste, Experimental Design, Open Problems

Four rounds, all different from MLE: implement a paper contribution from scratch in 45 minutes, critique an evaluation setup, design a rigorous experiment, and name an important open problem with a research direction. What each round tests and how to prepare.

The Research Engineer Interview: What It Actually Tests

Research Engineer is the most misunderstood role in AI. Candidates prepare like it's a senior MLE role with a paper reading component. It's not. The interview probes a specific combination: deep mathematical understanding, the ability to implement a paper from scratch under time pressure, research taste to critique methodology, and enough engineering rigor to make research code reliable in production.

How It Differs from MLE and SWE Interviews

Round 1: Implement the Paper

You're given a 2-3 page excerpt from a paper — just the method section, not the results. You have 45-60 minutes to implement the core contribution in Python/NumPy. No framework. Tests provided.

Round 2: Research Taste and Critique

You're handed a paper — sometimes a real published paper, sometimes a fabricated one with deliberate flaws — and asked: 'What do you think of this?' The answer they want is not a summary. It's a structured critique.

Round 3: Experimental Design

Given a research hypothesis, design the experiment that would test it rigorously. This is the inverse of the paper critique round — instead of finding what's missing, you're designing from scratch.

Round 4: Open Problems

'What do you think is the most important unsolved problem in [retrieval / alignment / efficient inference / multimodal learning]?' This round has no right answer. It's testing whether you've thought seriously about the field.

The answer structure that works: name the problem, say why current approaches fail to solve it (be specific — name the papers), say what a solution would look like and what would need to be true for it to exist, name the experiment you'd run if you had 3 months. The worst answer: names a problem and immediately says 'it's really hard.' That tells them nothing about how you think.

What to Build to Prepare

Try it interactively

GenAI Systems Lab is a free platform for AI engineers — configure real failure modes, break things, and build the judgment that gets you hired.

Open GenAI Systems Lab →