Neeraja K.
  • Thinking
  • Workflows
  • Insights
  • Picks
  • Resources
  • SRE Intel ✦
Resume ↗

Resources

SRE / DevOps / Platform engineering reading list. Curated from production experience — not a scrape of Awesome lists.

Updated 2026-04-26This week's fresh picks
- Modernizing KYC with AWS serverless solutions and agentic AI for financial services - PACIFIC enables multi-tenant, sovereign product carbon footprint exchange on the Catena-X data space using AWS - Real-time analytics: Oldcastle integrates Infor with Amazon Aurora and Amazon Quick Sight - Build a multi-tenant configuration system with tagged storage patterns - Unlock efficient model deployment: Simplified Inference Operator setup on Amazon SageMaker HyperPod - Automate safety monitoring with computer vision and generative AI - From Ingress NGINX to Higress: migrating 60+ resources in 30 minutes with AI - Auto-diagnosing Kubernetes alerts with HolmesGPT and CNCF tools

Kubernetes & Platform

Kubernetes Production Best Practices
learnk8s.io ↗
EKS Best Practices Guide
AWS ↗
CNCF Landscape
CNCF ↗
Kubernetes Failure Stories
k8s.af ↗

Observability

Prometheus Operator Docs
prometheus-operator.dev ↗
Grafana Best Practices
Grafana ↗
OpenTelemetry Getting Started
OTel ↗
Google SRE Workbook, SLOs
Google ↗

Infrastructure as Code

Terraform Best Practices
terraform-best-practices.com ↗
HashiCorp Learn
HashiCorp ↗
Atlantis, Terraform Pull Request Automation
runatlantis.io ↗

Streaming & Kafka

Confluent Kafka Tutorials
Confluent ↗
Kafka: The Definitive Guide (free)
Confluent ↗
Strimzi, Kafka on Kubernetes
strimzi.io ↗

SRE & Reliability

Google SRE Book (free)
Google ↗
Incident.io, Incident Management Guide
incident.io ↗
The On-Call Handbook
GitHub ↗
DORA Metrics, DevOps Research
dora.dev ↗

Weekly commentary: Engagement Picks → · OSS tracking: Open Source Picks →

© 2025 Neeraja KhanapureMade with ☀️ · SRE · Platform · DevOps