Role Separation Architecture

The single-assistant fallacy

The default shape of an LLM product is one model, one chat window, one identity. This works for casual use and breaks under sustained partnership. A single context window cannot simultaneously be:

A debugger that reads code with no emotional inflection.
A reviewer that pushes back on bad reasoning.
A planner that maintains long-horizon goals.
An emotional-support assistant that listens without solving.

These roles have incompatible priors. A debugger that softens its tone produces wrong output. A support assistant that runs root-cause analysis produces the wrong relationship. Forcing one model to context-switch between them costs tokens, costs latency, and — most importantly — costs the user's mental model of "who am I talking to right now."

The partition

A partner system should be partitioned into named roles, each with:

Its own system prompt (no overlap in instructions).
Its own memory scope (no shared context unless explicitly bridged).
Its own access boundary (which tools, which files, which sub-agents it can invoke).
A stable user-facing identity, so the human knows which mental mode to be in.

The number of roles is small. Two to four is typical. More than four becomes routing overhead.

A worked example

A reasonable partition for an individual user's personal partner system:

Role	Function	Memory	Tone
Analyst	Technical work, code, finance, planning	Project state, work logs, decisions	Direct, terse, falsifiable
Companion	Emotional support, daily life, low-stakes conversation	Personal context, mood patterns, relationship history	Warm, listening, non-prescriptive
Archivist	Memory write/read, log indexing, retrieval	All of the above as a read-only index	Mechanical, no opinions

The Analyst never plays Companion. The Companion never gives diagnostic advice. The Archivist never expresses preferences. When a request crosses a boundary, the active role hands off explicitly rather than expanding its own scope.

Why this is hard

The temptation is to merge roles "for convenience." It is always cheaper in the short term — one fewer prompt, one fewer subscription, one fewer context to manage. It always degrades the system in the medium term, because the merged model regresses to the mean of its training distribution, which is the bland general assistant.

The other temptation is to let a single role accumulate scope until it is a general assistant in everything but name. This is prevented by writing role boundaries down explicitly, in a place the operator re-reads regularly. Roles drift; the constraint document does not.

Implementation notes

Routing. A thin top-level dispatcher reads the user's input and picks the role. Keep it dumb — keyword and intent matching is sufficient; do not let it think.
Cross-role bridges. When the Analyst needs context the Companion has (or vice versa), the bridge is a deliberate read from the Archivist, not a context dump.
Persona stability. Each role has a name, a voice, a refusal pattern. These do not change between sessions. Stability is what makes the partition feel like a real distinction instead of UI dressing.

What this is not

This is not multi-agent orchestration in the sense of an autonomous swarm. The roles do not act without the human; they wait. Role separation is for a human in the loop who wants different cognitive modes to be available without context-switching costs, not for autonomous task completion.