AI workflows that survive contact with production
Battle-tested prompts, free in-browser tools for incident triage and alert rules, and deep guides for the stack you actually run — Linux, OpenStack, Kubernetes, Terraform, Prometheus. Safety and back-out steps baked into every prompt.
Free, no signup to start · For Linux admins, SREs & platform teams
- Linux
- OpenStack
- Kubernetes
- GitLab CI
- Prometheus
- Terraform
- Grafana
Download the Free 500-Prompt DevOps AI Toolkit
500 battle-tested, copy-paste AI prompts engineered by a senior systems engineer — every one with fill-in placeholders and safety/back-out notes. Drop your email and it's yours.
- 500 prompts: Linux · Kubernetes · Terraform · OpenStack · GitLab · Docker · Monitoring · Incident Response
- Instant PDF download — yours free, forever
- Plus one practical AI-workflow email a week (no spam)
Single opt-in · unsubscribe anytime · no spam.
From "what's broken?" to a safe next step
Pick your task
Incident triage, a Prometheus alert rule, a Terraform plan review, a stuck OpenStack VM — find the prompt or free tool that matches.
Fill in your specifics
Every prompt ships with fill-in placeholders, a real worked example, and safety + back-out notes. Paste into Claude, ChatGPT, or Cursor.
Save, export & run it
Upgrade to Pro to run the dashboards unlimited, save your work across devices, and export to YAML / JSON / PDF — or use the free library forever.
Featured categories
Pick a stack. Get prompts, guides, and reviews tuned for it.
-
AI for Linux Admins
Diagnose, automate, and harden Linux servers using AI assistants. Ubuntu, RHEL, Debian, Rocky.
-
AI for OpenStack
Troubleshoot Nova, Neutron, Cinder, RabbitMQ, and Keystone with AI-assisted workflows.
-
AI for Prometheus & Monitoring
Write better alert rules, PromQL queries, and Grafana dashboards with AI.
-
AI for GitLab CI/CD
Debug pipelines, generate jobs, and review .gitlab-ci.yml with AI.
-
AI for Bash & Python Automation
Generate, review, and harden automation scripts. Idempotent, safe, production-ready.
-
AI for Incident Response
Faster RCAs, postmortems, runbooks, and on-call workflows powered by AI.
-
AI for Kubernetes & Helm
Troubleshoot clusters, review manifests, generate Helm charts, debug pods, and harden Kubernetes workloads with AI-assisted workflows.
-
AI for Infrastructure as Code
Generate, review, refactor, and secure Ansible, Helm, and cloud infrastructure code with AI.
-
AI for Terraform
Design state, modules, providers, and workflows. Plan reviews, drift detection, large-state refactors, and policy-as-code with AI.
-
AI for DevOps Security & Hardening
Use AI to review infrastructure security, harden Linux servers, detect risky commands, audit CI/CD pipelines, and improve production safety.
-
AI for Slack
Build smarter Slack workflows: ChatOps bots, alert routing, incident channels, on-call handoffs, message summarization, and webhook security.
-
AI for Microsoft Teams
AI-powered Teams workflows: adaptive cards, webhook routing, Bot Framework ChatOps, Power Automate flows, meeting transcripts to postmortems, Graph API automation.
-
AI for Automation
Automate runbooks, toil, and event-driven workflows with AI: intelligent runbook selection, self-healing, ChatOps automation, and orchestration across your stack.
-
AI for Ansible
Write, refactor, and debug Ansible playbooks, roles, and inventories with AI — idempotent tasks, Jinja2 templates, Vault secrets, and safe rolling changes.
-
AI for NGINX
Configure, debug, and harden NGINX with AI — reverse proxy, TLS, rate limiting, caching, location-block precedence, and performance tuning.
-
AI for Postgres
Tune, debug, and design PostgreSQL with AI — slow queries and EXPLAIN plans, indexing, vacuum/bloat, replication, and safe schema migrations.
-
AI for MySQL
Optimize and troubleshoot MySQL and MariaDB with AI — query tuning, InnoDB internals, indexing, replication, deadlocks, and zero-downtime migrations.
-
AI for RabbitMQ
Design and debug RabbitMQ with AI — exchanges and routing, queue backpressure, dead-lettering, clustering and quorum queues, and consumer reliability.
-
Reduce MTTR with AI
Cut mean time to resolution with AI — faster detection and triage, alert correlation, instant runbooks, quicker root-cause analysis, and tighter postmortem-to-fix loops.
-
Post Mortems with AI
Write better postmortems with AI — draft timelines from chat and alerts, keep the language blameless, surface contributing factors, and turn findings into action items that ship.
-
AWS with AI
Build, debug, and secure AWS with AI — IAM and least privilege, VPC and networking, EC2/ECS/EKS, Lambda, S3, CloudFormation and CDK, and cost control.
-
Azure with AI
Design, troubleshoot, and harden Azure with AI — RBAC and Entra ID, VNets and NSGs, AKS, App Service, Functions, Bicep and ARM, and cost management.
-
GCP with AI
Operate and secure Google Cloud with AI — IAM, VPC and firewall rules, GKE, Cloud Run, Cloud Functions, Terraform, and billing and cost optimization.
-
Docker with AI
Build, debug, and harden Docker with AI — Dockerfiles, image builds, registries, networking, volumes, the daemon, and container runtime errors.
-
AI for Kafka
Operate and debug Apache Kafka with AI — brokers and controllers, partitions and ISR, producers and consumers, KRaft and ZooKeeper, rebalances, retention, and throughput tuning.
Start here — the most useful reads
- 1 How AI Reduces DevOps Incident Response Time (MTTR Guide) Reduce MTTR with AI · 16 min read
- 2 The Most Common Linux Server Problems (and How to Fix Them) AI for Linux Admins · 18 min read
- 3 How to Use AI to Troubleshoot Kubernetes Clusters Faster AI for Kubernetes & Helm · 16 min read
- 4 The Best Way to Learn Terraform for Real Infrastructure AI for Terraform · 18 min read
- 5 How AI Helps DevOps Engineers Write Better Terraform Code AI for Terraform · 15 min read
- 6 Top 25 GitLab CI/CD Pipeline Mistakes (and How to Avoid Them) AI for GitLab CI/CD · 20 min read
- 7 How to Build a Production-Ready OpenStack Cloud (2026 Guide) AI for OpenStack · 20 min read
- 8 The Best AI Prompts for Linux System Administrators AI for Linux Admins · 16 min read
- 9 How DevOps Teams Use AI to Reduce Cloud Costs (FinOps) AI for Automation · 16 min read
- 10 What Does a Senior DevOps Engineer Do Every Day? AI for Automation · 15 min read
The prompts engineers reach for most
- 1 Prometheus Alert Rule Generator Prometheus & Monitoring · Intermediate
- 2 Kubernetes Node NotReady Diagnosis Kubernetes & Helm · Advanced
- 3 Linux Host Network Connectivity Debug Linux Admins · Intermediate
- 4 Dockerfile Security Review DevOps Security & Hardening · Beginner
- 5 Terraform Remote Backend Migration Terraform · Advanced
- 6 Ansible Playbook Generator Bash & Python Automation · Intermediate
- 7 GitLab CI/CD `rules:` Debugging GitLab CI/CD · Intermediate
- 8 NGINX 502/504 Bad Gateway Triage NGINX · Intermediate
- 9 Alert-Storm Correlation and Triage Incident Response · Beginner
- 10 Nova Instance Stuck-State Recovery OpenStack · Intermediate
- 11 Postgres Slow Query EXPLAIN Triage Postgres · Intermediate
- 12 MySQL Slow Query Log + EXPLAIN Tuning MySQL · Intermediate
- 13 RabbitMQ Queue Investigation RabbitMQ · Advanced
Collecting copy data…
We just started tracking which prompts engineers copy. As the library gets used, the most-copied prompts will rank here automatically. In the meantime, the Popular tab has the picks worth starting with.
Browse all promptsFeatured AI prompts for cloud engineers
- AI for Infrastructure as Code Intermediate
Ansible Vault Secrets Management Prompt
Use Ansible Vault — encrypt secrets, vault IDs, multi-vault setups, integration with external secret managers.
- Claude
- ChatGPT
Open prompt - AI for GitLab CI/CD Intermediate
GitLab CI/CD → Kubernetes Deploy Patterns Prompt
Design GitLab CI/CD pipelines that deploy to Kubernetes — kubectl vs Helm vs Kustomize, secrets handling, multi-environment promotion, GitOps comparison.
- Claude
- ChatGPT
Open prompt - AI for GitLab CI/CD Intermediate
GitLab CI/CD Pipeline Optimization Prompt
Speed up slow GitLab pipelines — DAG with `needs:`, cache vs artifacts, parallel jobs, image pre-builds, dependency proxy, and shallow clones.
- Claude
- ChatGPT
Open prompt - AI for Prometheus & Monitoring Advanced
Grafana Loki + Prometheus Correlation Prompt
Correlate metrics and logs in Grafana — exemplars from Prometheus to traces, derived fields from Loki, jump from spike to log line.
- Claude
- ChatGPT
Open prompt - AI for Kubernetes & Helm Intermediate
Helm Chart Review Prompt
Get a senior-engineer review of a Helm chart — values hygiene, template correctness, security defaults, upgrade safety.
- Claude
- ChatGPT
- Cursor
Open prompt - AI for Infrastructure as Code Intermediate
Infrastructure as Code Security Review Prompt
AI security review of Terraform, CloudFormation, or Helm charts — surface dangerous defaults, missing encryption, overly-permissive IAM, and exposed services.
- Claude
- ChatGPT
Open prompt
DevOps & AI guides
- AI for Automation · 11 min read
Rollback Strategy in DevOps: A 2026 Practical Guide
Discover the role of rollback strategy in DevOps. Enhance your deployment process, reduce recovery time, and improve project stability.
Read guide - AI for Automation · 10 min read
The Role of Scheduler Kubernetes: 2026 Deep Dive
Explore the role of scheduler Kubernetes in optimizing pod assignments, enhancing resource management, and resolving `Pending` states effectively.
Read guide - AI for Kafka · 10 min read
AI-Assisted Kafka Troubleshooting Explained
How AI-assisted Kafka troubleshooting works — diagnosing broker faults, consumer lag, rebalance storms, and ISR shrink faster, with the governance to run it safely.
Read guide - AI for Automation · 11 min read
ChatGPT DevOps Workflow Integration: A Practical Guide
Discover how chatgpt devops workflow integration can cut repetitive tasks by 70% and speed up production. Start automating today!
Read guide - AI for Kafka · 11 min read
Debugging Kafka Consumer Lag with AI
Measure Kafka consumer lag correctly, find the real root cause with AI-assisted analysis, and apply durable fixes — from poison messages to under-provisioned groups.
Read guide
AI tools we actually use
-
ChatGPT
by OpenAI
4.6The broadest AI ecosystem with deep plugin support and the largest user community.
- Best for
- Ansible/Terraform generation, fast scaffolding, plugin-heavy workflows
- Pricing
- Free tier; Plus $20/mo; Team & Enterprise tiers
Read review -
Claude
by Anthropic
4.8The most cautious and context-aware AI assistant for infrastructure work.
- Best for
- Production troubleshooting, postmortems, IaC review
- Pricing
- Free tier; Pro $20/mo; Team & Enterprise tiers
Read review -
Cursor
by Anysphere
4.7The AI-first code editor that understands your whole repo.
- Best for
- Editing real IaC repos — Helm charts, Terraform modules, K8s operators
- Pricing
- Free tier (limited); Pro $20/mo; Business $40/seat/mo
Read review -
Amazon Q Developer
by Amazon Web Services
4.3AWS's AI assistant for building and operating on AWS — IaC, CLI, and resource Q&A grounded in your account.
- Best for
- Building & operating on AWS — CloudFormation/CDK/Terraform, CLI, and AWS resource troubleshooting
- Pricing
- Free tier (generous); Pro $19/user/mo
Read review
Senior infrastructure audits, fixed price
Get a senior set of eyes on your stack — fixed-price OpenStack/Kolla-Ansible, Kubernetes, Terraform, and observability audits from $250. Rare private-cloud expertise most consultants simply don't have.
James Joyner IV
Sr. Systems Software Engineer · San Jose, CA
I build and run large-scale, widely distributed Linux systems — AWS, CentOS/Ubuntu, and private cloud on OpenStack with Kolla-Ansible — and I live in the observability and on-call that comes with them (Prometheus, VictoriaMetrics, Grafana). I started DevOps AI ToolKit to share the AI workflows, prompts, and runbooks that actually survive contact with production.