platform-tab.md · 14.5 KB

Platform Tab: Enterprise SRE Dashboard

Overview

The Platform Tab provides a comprehensive enterprise-grade dashboard for monitoring and managing the mindX autonomous intelligence platform, featuring advanced SRE metrics, DevOps excellence tracking, and real-time system observability.

Status: ✅ DEPLOYED & OPERATIONAL Metrics: 50+ real-time KPIs across 6 dashboard sections Performance: Sub-second refresh rates with grid-optimized layout Compliance: Enterprise SRE standards with DORA metrics tracking


🔍 mindX Accuracy Audit (Display vs Reality)

The Platform tab must reflect what mindX actually is and what the backend exposes. The following is the source of truth for implementation.

What mindX Actually Uses (Real Data Sources)

AreaReal BackendEndpointsNotes
HealthFastAPI backendGET /health, GET /system/statusstatus, components (llm_provider, mistral_api, agint, coordinator)
AgentsCommand handler / registryGET /agents, GET /agents/, GET /registry/agentsRegistered + file-based agents
Inbound APIInboundMetrics middlewareGET /api/monitoring/inboundtotal_requests, requests_per_minute, average_latency_ms, latency_p50/p90/p99_ms, rate_limit_rejects
System resourcespsutilGET /system/resources, GET /system/metricsCPU, memory, disk; optional mindterm
Rate limitsRate limit dashboardGET /monitoring/rate-limitsRate limit and circuit breaker status
Toolstools/ folderGET /toolstools_count, tools list
GitHubGitHub agentGET /github/status, GET /github/scheduleBackup status, schedule
Memorydata/memory/ STM/LTMNo vector-count API in main serviceMemory vectors: show "—" or add endpoint later
Ollama/LLMmindXagent / startupGET /mindxagent/ollama/statusOllama connection, models

What mindX Does NOT Use (Do Not Display as Current)

Display Rules

  1. Platform Header Metrics: Populate from /health, /agents, /api/monitoring/inbound. Memory Vectors = "—" until an endpoint exists.
  2. Topology: Use /agents or /agents/; map agents to orchestration/core/specialized per AGENTS.md.
  3. Backend & LLM Status: Replace generic "Observability & Service Mesh" with Backend health, System components, Inbound metrics, Rate limits, Ollama status.
  4. Request flow: Show mindX flow: Client → FastAPI → Coordinator → Agents → LLM (Ollama/API).
  5. SRE/DevOps cards: Show "—" or "N/A" for metrics not provided by API; fill from /system/metrics, /api/monitoring/inbound where applicable.
  6. Metadata: Avoid "Multi-Cloud Global" unless multi-region is true; use "Single instance" or "Local" for default deployment.

🎯 Dashboard Sections

1. Platform Header Metrics

Location: Top section with KPI cards Refresh Rate: Real-time (1-second intervals) Metrics Displayed:

2. Topology Visualization

Location: Left column, center section Technology: Interactive SVG-based network graph Features:

3. SRE Metrics Dashboard

Location: Right column, top section Standards: Google SRE Handbook compliance Key Metrics:

Service Level Objectives (SLOs)

Service Level Indicators (SLIs)

Error Budget Management

4. Performance Engineering

Location: Right column, center section Focus: System performance optimization

Latency Analysis

Throughput Metrics

Scalability Indicators

5. DevOps Excellence

Location: Bottom left section Framework: DORA (DevOps Research and Assessment) metrics

Deployment Frequency

Change Failure Rate

Lead Time for Changes

6. Infrastructure & Operations

Location: Bottom right section Focus: Infrastructure as Code and operational excellence

Infrastructure as Code (IaC)

GitOps Metrics

Chaos Engineering


🔧 Technical Implementation

Frontend Architecture

Component Structure

class PlatformTab extends TabComponent {
    constructor(config) {
        super({
            id: 'platform',
            label: 'Platform',
            refreshInterval: 5000, // 5-second updates
            autoRefresh: true
        });
    }
}

Data Integration

// Data expressions for real-time metrics
window.dataExpressions.registerExpression('platform_topology', {
    endpoints: [
        { url: '/monitoring/topology', key: 'topology' },
        { url: '/monitoring/health', key: 'health' }
    ],
    transform: (data) => this.transformTopologyData(data),
    onUpdate: (data) => this.updateTopologyVisualization(data)
});

Backend Endpoints

Health Monitoring

GET /monitoring/health
Response: {
    "status": "healthy",
    "uptime": "99.95%",
    "services": {...},
    "agents": {...}
}

Performance Metrics

GET /monitoring/performance
Response: {
    "sre_metrics": {...},
    "latency": {...},
    "throughput": {...}
}

SRE Compliance

GET /monitoring/sre/compliance
Response: {
    "slos": [...],
    "slis": [...],
    "error_budget": {...}
}

📊 Real-Time Updates

Refresh Intervals

Data Flow Architecture

API Endpoints → Data Expressions → Transform Functions → UI Components
      ↓              ↓              ↓              ↓
Real-time Data → Caching Layer → State Management → Visual Updates

Performance Optimization


🎨 User Experience

Visual Design

- 🟢 Green: Healthy/Optimal - 🟡 Yellow: Warning/Degraded - 🔴 Red: Critical/Error - 🔵 Blue: Information/Neutral

Interaction Features

Accessibility


🔒 Security & Compliance

Data Protection

Compliance Features


📈 Performance Benchmarks

Load Testing Results

Scalability Metrics


🚨 Alerting & Monitoring

Built-in Alerts

Integration Capabilities


🔧 Configuration

Dashboard Customization

{
    "platform": {
        "refresh_intervals": {
            "health": 1000,
            "performance": 5000,
            "topology": 10000,
            "sre": 30000
        },
        "alert_thresholds": {
            "error_rate": 0.05,
            "latency_p95": 2000,
            "uptime": 99.9
        },
        "display_options": {
            "theme": "cyberpunk",
            "grid_layout": true,
            "compact_mode": true
        }
    }
}

Environment Variables

# Dashboard configuration
export MINDX_PLATFORM_REFRESH_INTERVAL=5000
export MINDX_PLATFORM_MAX_METRICS=10000
export MINDX_PLATFORM_CACHE_TIMEOUT=300000

Alert configuration

export MINDX_PLATFORM_ALERT_EMAIL="admin@mindx.ai" export MINDX_PLATFORM_SLACK_WEBHOOK="https://hooks.slack.com/..."

🐛 Troubleshooting

Common Issues

Slow Dashboard Loading

# Check backend performance
curl http://localhost:8000/monitoring/health

Verify database connectivity

python -c "import psycopg2; psycopg2.connect('...')"

Check Ollama server status

curl http://10.0.0.155:18080/api/tags

Missing Metrics

# Verify monitoring agents are running
ps aux | grep resource_monitor

Check metric collection logs

tail -f logs/monitoring.log

Restart monitoring services

systemctl restart mindx-monitoring

UI Rendering Issues

# Clear browser cache

Check browser console for JavaScript errors

Verify API endpoints are accessible

curl http://localhost:8000/api/rage/stats

📚 Related Documentation


🎯 Future Enhancements

Planned Features

Research Areas


The Platform Tab represents enterprise-grade observability for autonomous AI systems, providing the monitoring and insights necessary for reliable, scalable, and self-improving intelligence platforms.


Referenced in this document
AGENTSperformance_monitorpgvectorscale_memory_integrationresource_monitor

All DocumentsDocument IndexThe Book of mindXImprovement JournalAPI Reference