reStrategy article

State, Not Time: A Reference Architecture for Agentic Operational Context Management

State, Not Time: A Reference Architecture for Agentic Operational Context Management

The Problem

A person managing four or more parallel workloads — products, customers, teams, delivery commitments — will eventually fail at one of them. Not from lack of effort, but from a structural problem: they are acting as the integration layer between disconnected systems.

Email arrives in one place. Project state lives in another. Documents are versioned in a third. Decisions made in a meeting exist only in the memory of whoever attended. When the volume of active workstreams exceeds what one person can hold in working memory, pieces drop.

The failure mode is specific: context loss at handoff points. A customer thread that needed a reply three days ago was seen, mentally flagged as "needs action," and then overwritten by twelve other things. A delivery workstream that was slipping went unnoticed for a week because the last status check required actively going to look for it. A decision made in a call three weeks ago is now irrecoverable because nobody captured it, and the person who remembered it has moved on.

This is not a productivity problem. It is a state management problem. And state management problems have engineering solutions.

The real constraint in modern operations is not time - it is the system’s ability to preserve, update, and surface state when decisions need to be made.


Why Existing Tools Don't Solve It

Task managers, dashboards, and inbox tools share one architectural flaw: they are retrieval systems. They require the human to initiate contact — open the app, run the search, check the board. In a low-workload environment that friction is acceptable. In a high-workload environment, the act of retrieval competes with the work itself, and retrieval loses.

The second flaw is fragmentation. No single tool holds complete context for any given workstream. A project's current status is in a knowledge base. The last conversation with a customer about it is in email. The decision that changed the direction is in a Slack thread from six weeks ago. The document that captures the current spec is in Google Drive. A person moving between these tools to reconstruct context before a call is performing integration work that should be done by a system.

The third flaw is passivity. Existing tools surface information when you go looking. They do not monitor for signals, synthesise context, and deliver a briefing at the moment it is needed. The cognitive overhead of knowing when to go looking — and then actually doing it — falls entirely on the operator.


The Architectural Reframe

The shift required is from pull to push, and from human-as-integrator to agent-as-integrator.

In a pull architecture, the human queries systems. In a push architecture, agents monitor systems and deliver synthesised context to the human at the right moment, on the right surface, in the right form. The human's role changes: from retrieval and assembly, to decision and action.

This is not a chatbot. A chatbot responds when queried. An agentic system acts on triggers — scheduled, event-driven, or threshold-based — without waiting to be asked.

agent-orches-arch.pngThe architecture has four distinct layers. Each layer has a single responsibility. Mixing responsibilities across layers is where most implementations break down.

Layer 1: The State Layer

All durable context lives here. Project states, customer interaction histories, open decisions, next actions, meeting notes. Notion is suited to this role because it supports structured data, free-form content, and relational linking in the same system. The state layer is not a filing system — it is a live record of current reality across every active workstream.

The state layer must be maintained with discipline. Agents can assist with capture (see the Decision Log Agent pattern below), but the initial decision about what gets recorded and in what structure is a human architecture decision. The quality of every agent's output is bounded by the quality of the state it reads from.

Layer 2: The Document Layer

Version-controlled documents, proposals, contracts, and collaborative drafts live here. Google Drive with Google Docs is the natural fit for teams on Google Workspace. This layer connects to the intelligence layer via API: an agent can retrieve, summarise, or review any document without the operator switching context.

Layer 3: The Communication Layer

All agent outputs route to one surface. Slack is the right choice for organisations already using it — it is where the team already operates. The critical design decision: Slack becomes the operator's single point of action, not just a messaging tool. Notifications, briefings, draft replies, and flagged items all arrive here. Where possible, actions — approve, respond, escalate — are taken from within Slack without opening a second system.

This single-surface principle prevents notification fragmentation. If agent outputs route to email, Slack, a mobile app, and a dashboard simultaneously, the operator is back to managing multiple contexts.

Layer 4: The Intelligence Layer

The LLM (large language model) sits here. It is not an agent — it is the reasoning engine that powers every agent. It reads from the state and document layers, generates outputs (briefings, draft replies, summaries, log entries), and routes results to the communication layer. It is stateless between runs; all persistent state lives in Layer 1.

This separation — intelligence stateless, state durable — is what makes the architecture reliable. The LLM does not need to remember. The state layer does.


The Five Agent Patterns

agent-orches-agents.pngEach agent is defined by four things: a trigger, a data source, an action, and an output surface.

Agent 1: Inbox Triage

The operator reads, adjusts if needed, and sends. The agent eliminates the reconnaissance phase of email management — the twenty-minute morning pass to determine what needs attention.

Agent 2: Context Briefing

The operator walks into every call with the current state already assembled. No pre-call context reconstruction required.

Agent 3: Workload Tracker

Status comes to the operator. The operator does not go looking for status.

Agent 4: Follow-Up

Every open loop has a defined close date. The agent holds the loop; the operator closes it.

Agent 5: Decision Log

When context is needed weeks later, it is retrievable without reconstructing it from memory or email threads.


Integration Topology

The connections between layers are managed via MCP (Model Context Protocol — an open standard for connecting LLMs to external tools and data sources). MCP connectors exist for Notion, Google Workspace, and Slack. The intelligence layer can read from and write to all three without custom API integration work.

The data flow for a typical agent run:

  1. Trigger fires (event-driven or scheduled)

  2. Agent invokes the intelligence layer with a defined prompt and the relevant MCP tools enabled

  3. Intelligence layer retrieves context from the state layer and/or document layer via MCP

  4. Intelligence layer generates the output — briefing, draft, alert, or log entry

  5. Output routes to Slack via the Slack MCP connector

  6. If the run includes a state update (decision captured, commitment logged), the intelligence layer writes back to Notion via the Notion MCP connector

No custom webhook infrastructure is required. MCP handles the full integration surface.


Model Selection

Not every agent requires the same model. Calibrating model selection to reasoning depth reduces cost and latency without compromising output quality.

Agent

Reasoning demand

Recommended model tier

Context Briefing

High — multi-source synthesis

Full reasoning model (e.g. Claude Sonnet)

Decision Log

High — structured capture from ambiguous input

Full reasoning model

Inbox Triage

Medium — classification + draft generation

Mid-tier or fast model

Workload Tracker

Low — threshold detection + digest formatting

Fast model

Follow-Up

Low — retrieval + surface

Fast model

A common mistake is running every agent on the most capable available model. This inflates cost and does not improve the agents where improvement is actually needed. The triage agent does not need the same reasoning depth as the briefing agent.


Implementation Path

agent-orches-impl.pngAttempting to wire all five agents simultaneously before any single one is stable is the most common failure mode in this class of implementation.

Phase 1 — State layer first. Before any agent can work, the state layer must exist and be actively maintained. Define the record structure for workstream entries — current status, last action, open decisions, next action — and populate it for two or three active workstreams. Run manually for two weeks. This is the foundation every agent reads from. Skip it and every downstream agent will surface incomplete or stale context.

Phase 2 — Single agent, single output. Build and calibrate one agent before adding another. The Context Briefing Agent is a strong starting point: it has a clear trigger (calendar), a high-signal data source (state layer), and delivers immediate, visible value. Connect Notion and Google Calendar via MCP, define the briefing prompt, route output to Slack. Calibrate the prompt and trigger window over two weeks before adding anything else.

Phase 3 — Add agents progressively. The Workload Tracker runs on data already in the state layer — no new MCP connections required. Add it second. The Inbox Triage Agent adds Google Workspace MCP; add it third. The Follow-Up and Decision Log agents depend on the state layer being mature enough to hold the records they work from — add them last.

Phase 4 — Calibrate. Once all five agents are running, audit signal-to-noise. Which agents are generating useful output? Which are producing alerts the operator ignores? Adjust trigger windows, refine prompts, swap model tiers where appropriate. Expect two to four weeks of calibration before the system runs at a level of reliability that justifies depending on it.

The full architecture — Phase 1 through Phase 4 — can be operational for a single operator in four to six weeks. For a team-level deployment, the state layer structuring takes longer and should be treated as a separate project with its own timeline.


Working With Reveriext

The tools in this architecture are accessible. The implementation decisions that determine whether it actually works — state layer information architecture, agent prompt engineering, trigger calibration, model selection, Slack routing logic — are where most teams underinvest and where the system either holds or breaks.

Reveriext's reStrategy practice works with organizations to design and build agentic operating architectures at this level of specificity. If you are evaluating whether this approach fits your team's operating model, or want a technical assessment of where to start, reach out to the reStrategy team at reveriext.com.

Was this helpful?

20 views

Ready to translate complexity into clarity?

reStrategy is Reveriext's AI strategy & architecture practice. Two weeks. One page.

Explore reStrategy →