AI-Powered Monitoring & Incident Response Platform
Paladin AI is an intelligent monitoring and incident response platform that integrates with existing monitoring infrastructure (Prometheus, Grafana, Loki, Alertmanager) to provide automated incident analysis, intelligent querying, and proactive system insights.
- π€ AI-Powered Analysis: Advanced workflow orchestration using LangGraph
- π Monitoring Integration: Native support for Prometheus, Loki, Grafana, and Alertmanager
- π¬ Multi-Interface Access: Web UI, CLI, and API interfaces
- π§ Intelligent Memory: Long-term context and learning capabilities
- π Document Processing: RAG-based document ingestion and retrieval
- π Session Persistence: Robust checkpoint and recovery system
- β‘ Real-time Processing: Live monitoring and instant incident response
graph TB
subgraph "User Interfaces"
WEB[Web Frontend<br/>Next.js]
CLI[CLI Client<br/>Python]
API[REST API<br/>FastAPI]
end
subgraph "Core Platform"
SERVER[Paladin Server<br/>LangGraph + FastAPI]
MEMORY[Memory System<br/>Mem0AI + Neo4j]
RAG[RAG Engine<br/>Qdrant + OpenAI]
end
subgraph "Data Layer"
MONGO[(MongoDB<br/>Checkpoints)]
VECTOR[(Qdrant<br/>Embeddings)]
GRAPH[(Neo4j<br/>Memory Graph)]
end
subgraph "Monitoring Stack"
PROM[Prometheus]
LOKI[Loki]
GRAFANA[Grafana]
ALERT[Alertmanager]
end
subgraph "External Services"
DISCORD[Discord Bot<br/>MCP Integration]
WEBHOOK[Webhook Server<br/>External Alerts]
end
WEB --> SERVER
CLI --> SERVER
API --> SERVER
SERVER --> MEMORY
SERVER --> RAG
SERVER --> MONGO
MEMORY --> GRAPH
RAG --> VECTOR
SERVER <--> PROM
SERVER <--> LOKI
SERVER <--> GRAFANA
SERVER <--> ALERT
DISCORD --> SERVER
WEBHOOK --> SERVER
- Python 3.13+ with UV package manager
- Node.js 18+ with npm/yarn
- Docker & Docker Compose (for infrastructure)
- MongoDB (for checkpoints)
- OpenAI API Key (for AI features)
# Clone the repository
git clone https://github.com/your-org/paladin-ai.git
cd paladin-ai
# Install all dependencies
make install-dev# Copy environment template
cp .env.example .env
# Configure required variables
export OPENAI_API_KEY="your-openai-api-key"
export MONGODB_URL="mongodb://localhost:27017"# Start the infrastructure stack
make infra-up
# Start the main server
make run-server
# In another terminal, start the frontend
make run-frontend- Web Interface: http://localhost:3000
- API Documentation: http://localhost:8000/docs
- CLI:
paladin-cli chat
| Documentation | Description |
|---|---|
| Installation Guide | Complete setup and configuration |
| Architecture | System design and components |
| API Reference | REST API documentation |
| CLI Guide | Command-line interface usage |
| Frontend Guide | Web interface documentation |
| Monitoring Integration | Prometheus, Loki, Grafana setup |
| Memory System | AI memory and learning capabilities |
| RAG System | Document processing and retrieval |
| Workflows | LangGraph workflow system |
| Discord Integration | Discord bot and MCP setup |
| Deployment | Production deployment guide |
| Troubleshooting | Common issues and solutions |
paladin-ai/
βββ π server/ # Python FastAPI backend
βββ π frontend/ # Next.js React frontend
βββ π cli/ # Python CLI client
βββ π webhook/ # Webhook server
βββ π mcp/ # Discord MCP integration
βββ π infra/ # Infrastructure configs
βββ π tests/ # Integration tests
βββ π docs/ # Documentation
# Development setup
make install-dev # Install all dependencies
make dev # Start full development environment
# Individual services
make run-server # Start FastAPI server
make run-frontend # Start Next.js frontend
make run-cli # Interactive CLI session
# Discord integration
make mcp-server # Start Discord MCP server
make mcp-worker # Start message worker
# Testing and quality
make test # Run all tests
make format # Format code
make lint # Run linting
make type-check # Type checking
# Infrastructure
make infra-up # Start infrastructure stack
make infra-down # Stop infrastructure stack
make infra-logs # View infrastructure logs- LangGraph Workflows: Complex AI reasoning chains
- Tool Integration: Monitoring system connectors
- Memory System: Long-term context and learning
- RAG Engine: Document processing and retrieval
- Session Management: MongoDB checkpointing
- Real-time Chat: WebSocket-based communication
- Session Management: Persistent conversation history
- Document Upload: Drag-and-drop file processing
- Dark/Light Themes: Modern responsive design
- Command Interface: CLI-style commands in web UI
- Interactive Chat: Rich terminal interface
- Command History: Persistent session management
- File Upload: Document processing from CLI
- Monitoring Queries: Direct monitoring system access
- Mock Environment: Complete testing stack
- Monitoring Stack: Prometheus, Grafana, Loki, Alertmanager
- Database Systems: MongoDB, PostgreSQL, Neo4j
- Caching Layer: Valkey (Redis-compatible)
# Web UI or CLI
"Investigate high CPU usage in production cluster"
# AI analyzes metrics, logs, and provides insights
# Suggests remediation actions
# Creates incident timeline# Set up intelligent alerts
"Monitor memory usage patterns and predict capacity issues"
# AI learns normal patterns
# Detects anomalies early
# Provides recommendations# Upload runbooks and documentation
"Search for kubernetes troubleshooting steps"
# RAG system finds relevant docs
# Contextual answers from knowledge base
# Learning from previous incidents# Discord channel monitoring
@paladin "Check status of payment service"
# Bot responds with real-time metrics
# Provides analysis and recommendations
# Escalates critical issues- API Key Management: Secure credential handling
- CORS Configuration: Controlled cross-origin requests
- Request Timeouts: Configurable timeout middleware
- Error Handling: Comprehensive error boundaries
- Session Security: Encrypted session management
- Fork the repository
- Create feature branch:
git checkout -b feature/amazing-feature - Install dev dependencies:
make install-dev - Make changes and test:
make test - Format code:
make format - Commit changes:
git commit -m 'Add amazing feature' - Push to branch:
git push origin feature/amazing-feature - Open Pull Request
- Code Quality: Use type hints and follow PEP 8
- Testing: Write tests for new features
- Documentation: Update docs for new functionality
- Commits: Use conventional commit messages
This project is licensed under Apache 2.0 with Commons Clause License Condition v1.0 - see the LICENSE file for details.
- Documentation: docs/
- Issues: GitHub Issues
- Discussions: GitHub Discussions
- Discord: Community Server
- LangChain & LangGraph: Workflow orchestration
- OpenAI: AI capabilities
- Mem0AI: Memory management
- FastAPI: High-performance API framework
- Next.js: Modern React framework
- Prometheus Ecosystem: Monitoring infrastructure
Built with β€οΈ for DevOps and SRE teams
Website β’ Documentation β’ Community