CTL AI Watchdog Agent
Kubernetes-style cluster control agent with AI-driven service restart, log inspection, volume management, and real-time topology visualization.
Role
Architect & Lead Engineer
Timeline
2025
Type
DevOps
Summary
A CTL watchdog agent that monitors a cluster of services in real time, visualizes the full topology as a live graph, and embeds an AI agent that can interpret logs, diagnose degraded services, trigger restarts, and manage volumes — all from a single control surface.
The Hard Part
Designing a reliable AI agent loop that can safely act on live infrastructure — deciding when to restart a service, when to escalate, and when to wait — without creating cascading failures or acting on stale state.
Visual Proof
Protected enterprise system. Screenshots and architecture are sanitized. Sensitive implementation details, credentials, and internal endpoints are omitted for security.

Topology Overview
Full cluster topology visualized as a live graph — services, dependencies, and health status at a glance.

Circle Layout — AI Mode
Circular topology layout with AI-highlighted degraded services and suggested remediation paths.

Force Graph — AI Mode
Force-directed graph layout showing service cluster groupings and inter-service dependencies.

Tree View — AI Mode
Hierarchical tree view of services with AI annotations on degraded nodes and restart recommendations.
Full case study in progress.
Interested?
Let's talk about what I can build for you.
Open to senior full-stack roles, founding engineer positions, and contract engagements. Remote only.