MindX System Workflow and Architecture

This document provides a comprehensive overview of the MindX system's architecture, detailing the interaction between its core components and the flow of data and control during its primary operational workflows.

Ⅰ. High-Level Architecture: The Agent Hierarchy

MindX operates on a hierarchical model of specialized agents, each with a distinct role. This separation of concerns ensures that strategic, tactical, and operational tasks are handled by the appropriate component.

graph TD
    subgraph "Human / High-Level Control"
        A[User/Operator]
    end
    subgraph "Strategic Layer"
        B(MastermindAgent) -- "issues strategic campaigns" --> C(StrategicEvolutionAgent);
    end
    subgraph "Orchestration & Tactical Layer"
        C -- "delegates improvement tasks" --> D(CoordinatorAgent);
        D -- "invokes via CLI" --> E(SelfImprovementAgent);
    end
    subgraph "Core Services & Data"
        F(BeliefSystem)
        G(ModelRegistry)
        H(IDManagerAgent)
        I(MemoryAgent)
        J(Monitoring Agents)
    end
    A -- "issues directives" --> B;
    A -- "can interact directly" --> D;
    B -- "uses" --> F;
    B -- "uses" --> G;
    C -- "uses" --> F;
    C -- "uses" --> G;
    C -- "uses" --> J;
    D -- "uses" --> I;
    D -- "uses" --> H;
    E -- "uses" --> G;
    style A fill:#f9f,stroke:#333,stroke-width:2px
    style B fill:#cff,stroke:#333,stroke-width:2px
    style C fill:#cfc,stroke:#333,stroke-width:2px
    style D fill:#fcf,stroke:#333,stroke-width:2px
    style E fill:#ffc,stroke:#333,stroke-width:2px

MastermindAgent: The apex strategist. It formulates long-term goals and initiates broad "improvement campaigns."
StrategicEvolutionAgent (SEA): The campaign manager. It analyzes the system to break down a broad campaign goal into a concrete, safe, multi-step plan.
CoordinatorAgent: The system conductor and service bus. It manages infrastructure, handles agent lifecycle, and provides operational foundation for the MastermindAgent's orchestration.
SelfImprovementAgent (SIA): The "code surgeon." A CLI tool that performs the dangerous work of modifying and evaluating code.
Core Services: A suite of agents and modules providing foundational capabilities like memory, identity, knowledge representation, and LLM access.

Ⅱ. The LLM Selection & Control Workflow

A core feature of MindX is its ability to dynamically select the best Large Language Model (LLM) for a given task. This workflow ensures a balance of performance, cost, and capability.

Data Flow for LLM Selection

graph TD
    subgraph "Offline/Setup Phase"
        A[Operator] -- "runs" --> B(scripts/audit_gemini.py);
        B -- "discovers & tests" --> C[Gemini API];
        B -- "writes config" --> D(models/gemini.yaml);
    end
    subgraph "Runtime Phase"
        subgraph "Initialization"
            E(ModelRegistry) -- "uses" --> F(LLMFactory);
            F -- "creates" --> G(GeminiHandler);
            G -- "reads config from" --> D;
            E -- "builds capability map from" --> D;
        end
        subgraph "Selection & Execution"
            H[Agent] -- "1. requests handler for a 'TaskType' (e.g., code_generation)" --> E;
            E -- "2. passes all model capabilities to" --> I(ModelSelector);
            I -- "3. calculates weighted score for each model" --> I;
            I -- "4. returns ranked list of model IDs" --> E;
            E -- "5. selects top model & returns its handler" --> H;
            H -- "6. calls generate_text(model_id)" --> G;
            G -- "7. looks up api_name & calls API" --> C;
        end
    end

Step-by-Step Explanation

Configuration (Offline): An operator runs scripts/audit_gemini.py. This tool queries the Gemini API, discovers all available models, tests their functions (text, vision, etc.), and writes a detailed configuration file, models/gemini.yaml. This file includes the model's API name, its tested capabilities, and default scores for various tasks (e.g., reasoning: 0.85, code_generation: 0.88).
Initialization (Runtime): When MindX starts, the ModelRegistry is initialized. It uses the LLMFactory to create a GeminiHandler (and handlers for any other configured providers). The GeminiHandler reads models/gemini.yaml to know which models exist. The ModelRegistry also reads this file to build a map of ModelCapability objects, which store the task scores, cost, and other metadata for each model.
Selection (Runtime):

- An agent needs to perform a task, like code_generation. It asks the ModelRegistry for a suitable handler. - The ModelRegistry passes its entire list of model capabilities to the ModelSelector. - The ModelSelector calculates a weighted score for every model based on its suitability for code_generation, its historical success rate, cost, and latency. - It returns a ranked list of the best models.

Execution (Runtime):

- The ModelRegistry takes the top-ranked model, identifies its provider (gemini), and returns the cached GeminiHandler. - The agent uses this handler to call generate_text, passing the specific ID of the top-ranked model. - The GeminiHandler uses its internal catalog (from gemini.yaml) to find the correct API name for the model and makes the final call to the Google Gemini API.

Ⅲ. The Strategic Improvement Workflow

This is the primary end-to-end workflow for making significant, planned improvements to the system. It involves the entire agent hierarchy.

Control Flow for a Strategic Campaign

sequenceDiagram
    participant M as MastermindAgent
    participant S as StrategicEvolutionAgent (SEA)
    participant C as CoordinatorAgent
    participant SIA as SelfImprovementAgent (CLI)
    participant Monitor as Monitoring Agents
    M->>S: run_evolution_campaign("Improve system logging resilience")
    activate S
    S->>Monitor: Query performance & resource data
    S->>S: Analyze system & formulate strategic plan
    S->>C: request_coordinator_for_sia_execution(task_details)
    activate C
    C->>SIA: asyncio.create_subprocess_exec('python self_improve_agent.py ...')
    activate SIA
    SIA->>SIA: Perform safe improvement cycle (analyze, implement, evaluate)
    SIA-->>C: return JSON result via stdout
    deactivate SIA
    C-->>S: Return interaction result
    deactivate C
    S->>S: Evaluate outcome & continue/conclude plan
    S-->>M: Return final campaign summary
    deactivate S

Step-by-Step Explanation

Goal Initiation: The MastermindAgent initiates the process by giving a high-level goal to the StrategicEvolutionAgent (SEA), such as "Improve system logging resilience."
Analysis and Planning (SEA):

- The SEA uses its SystemAnalyzerTool to gather data. This tool queries the PerformanceMonitor for LLM error rates and the ResourceMonitor for system load. - Based on this analysis, the SEA identifies a concrete target for improvement (e.g., "The logging_config.py file lacks robust error handling for file permissions"). - It then uses an LLM to generate a safe, multi-step plan. A critical rule is enforced: any plan to modify code must include steps for creating a rollback plan before the modification and running validation tests after.

Task Delegation (SEA to Coordinator): The SEA delegates the tactical execution of its plan to the CoordinatorAgent. For a code change, it sends an interaction of type COMPONENT_IMPROVEMENT with the target file path and the specific improvement goal.
Execution (Coordinator to SIA):

- The CoordinatorAgent receives the request. It uses its heavy_task_semaphore to ensure it doesn't overload the system with too many concurrent modifications. - It invokes the SelfImprovementAgent (SIA) as a separate process via its command-line interface, passing the target file and goal as arguments. This decoupling is a key safety feature.

Code Modification (SIA):

- The SIA performs its robust improvement cycle: it analyzes the code, generates a modification using an LLM, and then evaluates the change for syntax errors, adherence to the goal (via LLM critique), and, if modifying itself, by running a self-test suite. - The SIA returns a structured JSON object to stdout indicating the success or failure of the operation.

Completion and Reporting:

- The CoordinatorAgent captures the result from the SIA and marks its interaction as complete. - The SEA receives the result from the Coordinator, evaluates the outcome, and proceeds with the next step in its strategic plan (e.g., running validation or triggering a rollback). - Once its entire campaign plan is complete, the SEA reports the final summary back to the MastermindAgent.

All Documents Document Index The Book of mindX Improvement Journal API Reference