139+ production issues solved
Debug faster.
Ship confidently.
Your production incident handbook. Search symptoms, get root causes, and apply fixes across Docker, Kubernetes, AWS, and more.
139
Issues
48
Tools
7
Categories
100%
Tested
Trending
Popular Issues
Container Orchestration
Pods stuck in CrashLoopBackOff after deployment
Pod status shows CrashLoopBackOff in kubectl get pods
CriticalKubernetes
Containers
Docker builds failing with 'no space left on device'
docker build fails with 'write /var/lib/docker/tmp: no space left on device'
CriticalDocker
Cloud
Application hit RDS max_connections during traffic spike
Application logs show 'FATAL: too many connections for role app'
CriticalAWS
Cloud
EKS worker nodes stuck in NotReady state after cluster upgrade
kubectl get nodes shows nodes in NotReady status
CriticalAWS
Container Orchestration
ImagePullBackOff when rolling out a new container image
Pods stay in ImagePullBackOff or ErrImagePull
WarningKubernetes
Cloud
CloudFront returning 504 Gateway Timeout from origin
Users intermittently receive 504 errors from CloudFront URLs
CriticalAWS
Explore
Browse by Tool
Kubernetes25 issuesAWS22 issuesDocker18 issuesTerraform17 issuesNetworking4 issuesNginx3 issuesElasticsearch3 issuesPrometheus3 issuesJenkins2 issuesGitHub Actions2 issuesHelm2 issuesGitLab CI2 issuesLoad Balancer1 issuesSSL/TLS1 issuesDNS1 issuesGrafana1 issuesKibana1 issuesArgoCD1 issuesNetwork1 issuesFirewall1 issuesJFrog Artifactory1 issuesHAProxy1 issuesLogstash1 issuesDocker Compose1 issuesVPN1 issuesCircleCI1 issuesCoreDNS1 issuesAlertManager1 issuesProxy1 issuesFilebeat1 issuesGrafana Loki1 issuesTraefik1 issuesAzure DevOps1 issuesFluentd1 issuesOpenTelemetry1 issuesThanos1 issuesAnsible1 issuesJaeger1 issuesInfluxDB1 issuesBitbucket Pipelines1 issuesTCP1 issuesVector1 issuesSonarQube1 issuesGrafana Tempo1 issuesTeamCity1 issuesTelegraf1 issuesTravis CI1 issuesZipkin1 issues
How it works
From symptom to fix in seconds
01
Spot the symptom
Recognize the error from your logs, metrics, or alerts
02
Diagnose the cause
Get the root cause analysis and diagnosis commands
03
Apply the fix
Copy-paste the exact commands to resolve the issue
04
Prevent recurrence
Implement safeguards and monitoring to stay safe