Architecture¶

SumoSpace is to LLMs what an operating system is to programs.

The kernel schedules tasks. Tools are system calls. The committee is the permission system — no action is taken without deliberation.

Module Map¶

flowchart TB
    CLI["CLI / typer"] --> Kernel

    subgraph Core ["Core"]
        Kernel["Kernel"]
        SumoSettings["SumoSettings"]
        Classifier["Classifier"]
    end

    subgraph CommitteeGroup ["Committee"]
        Planner["Planner"]
        Critic["Critic"]
        Resolver["Resolver"]
    end

    subgraph MemoryGroup ["Memory"]
        WorkingMemory["Working Memory"]
        EpisodicMemory["Episodic Memory"]
    end

    subgraph VectorGroup ["Vector Store"]
        VectorStore["chroma / faiss / qdrant"]
        Embedder["Embedder"]
        Reranker["Reranker"]
    end

    subgraph ToolGroup ["Tools"]
        BuiltIn["Built-in Tools"]
        Plugin["Plugin Tools\n(entry_points)"]
        ToolRegistry["Tool Registry"]
    end

    subgraph ObsGroup ["Observability"]
        AuditLogger["Audit Logger"]
        SnapshotManager["Snapshot Manager"]
        Telemetry["Telemetry"]
    end

    Kernel --> SumoSettings
    Kernel --> Classifier
    Classifier --> CommitteeGroup
    CommitteeGroup --> ToolRegistry
    ToolRegistry --> BuiltIn
    ToolRegistry --> Plugin
    Kernel --> MemoryGroup
    MemoryGroup --> VectorGroup
    Kernel --> ObsGroup
    RAGEngine["RAG Engine"] --> VectorGroup
    Kernel --> RAGEngine

The Committee In Depth¶

The Committee is a three-agent deliberation pipeline that sits between the user's task and tool execution. No filesystem write, no shell command, no network call happens unless the Critic approves.

Planner¶

The Planner receives: - The user's task string - The classified Intent - The list of available tools (names + descriptions only) - Working memory context (recent tool outputs)

It produces a structured JSON plan — an ordered list of steps:

{
  "reasoning": "The user wants docstrings. I need to read the file first, then write updated content.",
  "steps": [
    {"tool": "read_file",  "args": {"path": "./src/utils.py"}},
    {"tool": "write_file", "args": {"path": "./src/utils.py", "content": "..."}}
  ]
}

Critic¶

The Critic receives the full plan and evaluates it against:

Scope check — does the plan touch files outside the working directory?
Destructive check — does any step delete or overwrite without a prior read?
Shell blocklist — does any tool argument contain a blocked pattern (e.g. rm -rf, curl, wget)?
Logical consistency — does the plan make sense for the stated intent?

Example rejection:

{
  "approved": false,
  "reason": "Step 2 writes to /etc/passwd — path is outside the project root. Rejected."
}

When rejected, the Resolver is invoked.

Resolver¶

The Resolver receives: - The original plan - The Critic's rejection reason

It produces a revised plan that addresses the Critic's concerns. This feeds back to the Critic for a second review. After 3 failed cycles, the task fails safely — no tools are executed.

Safety Model¶

SumoSpace implements three independent safety layers:

Layer 1 — Shell Blocklist (instant, regex-based)
Layer 2 — Committee Review (semantic, LLM-based)
Layer 3 — Snapshot + Rollback (recovery after execution)

Layer 1 catches obvious attacks: rm -rf, curl | bash, subprocess.call, etc.

Layer 2 catches logic-level risks: out-of-scope writes, data destruction, infinite loops.

Layer 3 catches failures that slip through layers 1 and 2: if a tool produces a bad result, you can undo it with sumo rollback.

Data Flow¶

Here is the exact call sequence from kernel.run("task") to trace.final_answer:

SumoKernel.run(task) — entry point
Classifier.classify(task) → Intent
RAGEngine.retrieve(task) → relevant code context
WorkingMemory.load(session_id) → recent history
PlannerAgent.plan(task, intent, context) → Plan
CriticAgent.review(plan) → Approval or Rejection
If rejected → ResolverAgent.resolve(plan, rejection) → revised Plan → back to step 6
If approved → ToolExecutor.run(plan.steps) sequentially:
- For each step: SnapshotManager.snapshot() → tool.execute() → append to step_traces
WorkingMemory.save(session_id, step_traces)
AuditLogger.log(session_id, trace)
Return AgentTrace

Scope Isolation (Multi-Tenancy)¶

Each session_id gets an isolated ChromaDB collection. Queries in session A cannot retrieve documents ingested in session B.

flowchart LR
    S1[Session A] --> C1["Collection: session_a_*"]
    S2[Session B] --> C2["Collection: session_b_*"]
    C1 --> VS[(Vector Store)]
    C2 --> VS

Extension Points¶

Extension	Interface	Registration
Provider	`BaseAdapter`	`SumoSettings(provider=...)`
Tool	`BaseTool`	`ToolRegistry.register()` or `entry_points`
Hook	Async callable	`SumoSettings(hooks={event: fn})`
Vector Store	`BaseVectorStore`	`SumoSettings(vector_store=...)`
Embedder	`BaseEmbedder`	`SumoSettings(embedder=...)`

See Tools and Hooks for complete implementation guides.