NarraNexus · Core Concepts

Architecture

A layered system with strict dependency direction, multiple message entry points, and a 6-step runtime pipeline at its core.

System Architecture

NarraNexus is organized into seven layers. Upper layers call downward; lower layers never reference upper layers. This keeps each layer replaceable without cascading changes.

API

backend/routes/

HTTP and WebSocket endpoints. Request validation, response serialization. No business logic lives here.

Orchestration

agent_runtime/

The AgentRuntime sequences every interaction through a 6-step pipeline. This is the brain of the system.

Service Protocol

*_service.py

Stable public interfaces for Narrative selection and Module management. Bridge pattern delegates to private implementations.

Implementation

_*_impl/

Concrete business logic behind each service. Underscore-prefixed packages, never imported directly.

Background

services/

Long-running polling processes: JobTrigger (scheduled tasks), MessageBusTrigger (agent messaging), ModulePoller (instance state).

Data Access

repository/

Generic BaseRepository with typed CRUD operations. Handles pagination, batch queries, and N+1 prevention.

Data

schema/, utils/

Pydantic models and a singleton AsyncDatabaseClient. Supports both SQLite (local) and MySQL (production).

The Orchestration layer (AgentRuntime) is the only layer that touches both services and the API boundary. Everything below it is reusable across different entry points — the same pipeline runs whether a message comes from a WebSocket, a scheduled job, or another agent.

How Messages Enter the System

The runtime pipeline always runs the same steps regardless of where the message came from. What changes is the trigger — the mechanism that initiates the run. Each trigger attaches a WorkingSource tag so the agent can adapt its behavior to different message sources.

User chat

CHAT

WebSocket connection from the frontend. User sends a message, the pipeline runs, tokens stream back in real time. This is the default path.

Scheduled jobs

JOB

The JobTrigger polls the database for due tasks (recurring reminders, monitoring jobs, one-off tasks). When a job fires, the pipeline runs autonomously and delivers results to the user’s inbox.

Other agents

MESSAGE_BUS

The MessageBusTrigger polls for unread messages in agent channels. When an agent is @mentioned or receives a DM, the pipeline runs with the message as input.

IM integrations

MATRIX

External messaging platforms (Matrix is implemented; Telegram and Lark are planned). Messages arrive via platform-specific triggers and are normalized into the standard pipeline input.

A core design principle: for N agents, never create N listeners. Background triggers (JobTrigger, MessageBusTrigger) use a single shared poller that routes work to the correct agent. This keeps resource usage constant regardless of how many agents are deployed.

Runtime Pipeline

Every message — regardless of source — flows through this pipeline. Implementation files live in agent_runtime/_agent_runtime_steps/. The pipeline has three phases.

PrepareSteps 0 – 2.5

Load agent state, find the right storyline, decide which modules are needed, and gather all context. The user sees progress indicators during this phase.

Step 0

Initialize

Load agent config, create the Event record, start or resume a Session, load the agent’s Awareness profile. If EverMemOS is enabled, episode search is launched in parallel here — it runs concurrently with the next steps and is awaited just before execution.

Step 1

Select Narrative

Determine which storyline this message belongs to. An LLM continuity check compares the query against the current Narrative. If the topic changed, vector search finds a matching existing Narrative or creates a new one.

Step 1.5

Load History

Read the conversation’s markdown history file. Parse previous interactions and instance state to inform module decisions in the next step.

Step 2

Load Modules

An LLM decision determines which module instances should be active for this interaction (add, keep, or remove). Module objects are instantiated and MCP servers started. The execution path is chosen: Agent Loop (99% — full LLM reasoning) or Direct Trigger (1% — skip LLM, call a tool directly).

Step 2.5

Sync Instances

Establish instance-to-Narrative links in the database. Create new records for added instances, archive completed ones. If a JobModule instance was created, the corresponding Job record is initialized.

ExecuteStep 3

The agent thinks and responds. This is where the user sees tokens streaming in real time.

Step 3

Agent Loop

The ContextRuntime assembles the full context: module instructions, conversation history (dual-track memory), EverMemOS episodes (if available), social network data, awareness profile, and MCP tool URLs. This context is passed to the LLM as a system prompt + message history.

The agent runs in a loop — it can reason, call tools via MCP, observe results, and continue reasoning until it produces a final response. Each token is streamed to the user in real time via WebSocket.

For the rare Direct Trigger path (1%), the LLM is skipped entirely — the system calls the target MCP tool directly and returns the result.

PersistSteps 4 – 5

Save everything that happened, then run module hooks in the background. The user has already seen the response — this phase is mostly invisible.

Step 4

Persist Resultsblocking

Record the execution trajectory, update the Event with the final output, refresh the Narrative summary and embedding (LLM call for the main Narrative), update the Session, and log token costs. This step blocks briefly because the Event data is needed by hooks.

Step 5

Post-processbackground

Dispatched to a background task — the user’s connection closes immediately. Each active module’s hook_after_event_execution runs in parallel: ChatModule saves messages, SocialNetworkModule extracts entity info (1–3 LLM calls), JobModule evaluates completion conditions, MemoryModule writes to EverMemOS. Callback results trigger any dependent instances.

What's Next

NarrativeHow topic-based storylines are created, matched, and resumed ModulesThe capability system that powers each pipeline step Context EngineeringHow the system prompt and message history are assembled Custom ModulesBuild your own module with hooks, MCP tools, and IM integrations