Boost productivity by using AI in cloud operational health management - Link to blog post
- Autonomous Event Processing: AI-powered virtual operator automatically acknowledges, triages, and creates tickets for AWS Health and Security Hub events following customizable policies
- Headless Operation: AI agent can run independently without user interfaces, processing events and taking actions automatically
- Intelligent Noise Filtering: Filters operational events based on contextualized organizational policies to reduce alert fatigue
- Multi-Source Integration: Processes operational events from AWS Health, Security Hub, and user-reported incidents with unified workflow, a start for adding more event sources.
- Auditable Actions: All AI decisions and actions are logged to S3 with full traceability and compliance reporting
- Modern AI Stack: Powered by Amazon Nova and Claude 3.7 Sonnet with prompt caching for optimized performance
- Event-Driven Architecture: Highly decoupled microservices that can be configured independently based on your needs
- Refactored agents to use Strands Agents for cleaner code and better robustness
- Flexible Model Selection: Multi-LLM support with automatic fallback for enhanced reliability
- Added standalone web chat option for easy trial and demo use without Slack requirements
- Replaced RAG-based research agent implementation to MCP-based, reducing dependency and cost on self-maintained knowledge base
- Auditable agent action report stored to S3 bucket
- Optimization of Agent long term memory and knowledge retrieval
- Support of user reported free-form operational event as a source
- Support of prompt caching for token usage efficiency
- Old version archived to 'legacy' branch
OHERO uses a highly decoupled, event-driven microservices architecture that enables incremental deployment and selective feature activation:
The OheroACT Framework defines how OHERO's AI agent processes operational events through three customizable stages: Acknowledge, Consult, and Triage. Each stage has configurable rules, actions, and response formats customizable to your organization's needs.
flowchart TD
Start([User Query]) --> CheckState{Check: Query is asking to handle or report an operational event?}
CheckState -->|No| Consult[Action: execute Consult]
CheckState -->|Yes| Acknowledge[Action: execute Acknowledge]
Consult --> End([End])
Acknowledge --> CheckPhase{Check: proceed to Triage?}
CheckPhase -->|Yes| Triage[Action: execute Triage]
CheckPhase -->|No| FinalResponse[Action: Stop and Respond to user]
Triage --> SynthesizeFinal[Action: synthesize final response]
FinalResponse --> SynthesizeFinal
SynthesizeFinal --> End
- Check the Deployment Guide
For questions and support:
- Check the FAQ
- Review Event-Driven Architecture documentation
- Open an issue in this repository

