Listen to a podcast, please open Podcast Republic app. Available on Google Play Store and Apple App Store.
The Platform Engineering Playbook Podcast is where AI meets open-source infrastructure knowledge—and you're part of the editorial process. Every episode is researched, scripted, and produced with AI, then reviewed by the community and published on GitHub for anyone to improve. Facing tool sprawl across 130+ platforms? Justifying PaaS costs to your CFO? Navigating the Shadow AI crisis hitting 85% of organizations? We tackle the messy realities of platform engineering that most content avoids, delivering data-backed insights and decision frameworks you can use Monday morning. Built for senior engineers, SREs, and DevOps practitioners with 5+ years in production, we dissect cloud economics, AI governance, infrastructure trade-offs, and career strategy—with the receipts to back it up. Think we got something wrong? Have better data? Open a pull request at platformengineeringplaybook.com. This is infrastructure podcasting as a living document, where the community keeps us honest and the content gets better with every contribution.
Read the playbook at https://platformengineeringplaybook.com
| Episode | Date |
|---|---|
|
The Hidden Kubernetes Tax Costing Teams $43,800 a Year
|
Mar 30, 2026 |
|
Your Kubernetes Stack Is Why AI Isn’t Shipping
|
Mar 27, 2026 |
|
AI Agents in Kubernetes Need Standards — Before Everything Breaks
|
Mar 26, 2026 |
|
AI Agents Are About to Break Kubernetes — Unless We Standardize Now
|
Mar 25, 2026 |
|
How to Monitor LLMs in Production Before They Drain Your Budget
|
Mar 24, 2026 |
|
Helm Security Is Broken. WebAssembly Fixes It.
|
Mar 23, 2026 |
|
The Kubernetes AI Pattern That Cuts GPU Costs
|
Mar 20, 2026 |
|
You’re Monitoring the Wrong Kubernetes Metrics
|
Mar 19, 2026 |
|
The AI Security Hole Your Red Team Is Missing
|
Mar 18, 2026 |
|
Your Kubernetes Monitoring Is Blind to AI Attacks
|
Mar 17, 2026 |
|
The 6 Types of AI Cloud Infrastructure
|
Mar 16, 2026 |
|
Why AI Code Is Killing Your Monitoring Budget
|
Mar 13, 2026 |
|
How Karpenter Fixes Kubernetes Autoscaling
|
Mar 12, 2026 |
|
AI Is Not the Problem — Your Infrastructure Is
|
Mar 11, 2026 |
|
Why Kubernetes Doesn’t Scale Without an IDP
|
Mar 10, 2026 |
|
The AWS Cost That Doesn’t Show Up in Cost Explorer
|
Mar 09, 2026 |
|
87% of Ansible Playbooks Are Broken (AI Just Proved It)
|
Mar 06, 2026 |
|
GrafanaCON 2026: The Agenda That Signals the Future of Observability
|
Mar 05, 2026 |
|
Can AI Run Your Production Systems?
|
Mar 04, 2026 |
|
Claude Went Down. The API Didn’t. Here’s Why.
|
Mar 03, 2026 |
|
Backstage Is Becoming the Control Plane for Engineering
|
Mar 02, 2026 |
|
The End of ingress-nginx: Kubernetes Migration Guide Before 2026
|
Feb 27, 2026 |
|
Claude Code Remote Control Changes Developer Workflows
|
Feb 26, 2026 |
|
Databricks Lakebase vs Postgres: The AI Database Shift
|
Feb 25, 2026 |
|
How to Secure AI Agents with MCP, OPA & Ephemeral Runners
|
Feb 24, 2026 |
|
Cloudflare Takes Down the Internet Again — With a Config Change
|
Feb 23, 2026 |
|
The Next Platform Engineer: AI + Observability + FinOps
|
Feb 20, 2026 |
|
Ray + Kubernetes: The Production AI Stack Explained
|
Feb 19, 2026 |
|
Replace 5 Databases with 1? SurrealDB for AI Agents Explained
|
Feb 18, 2026 |
|
Agoda’s API Agent Turns Any API into MCP — No Code, No Deployments
|
Feb 17, 2026 |
|
LocalStack Kills Community Edition: What Breaks in March
|
Feb 16, 2026 |
|
OpenTofu vs Terraform: What Enterprise Teams Are Actually Doing (2026)
|
Feb 13, 2026 |
|
Why Databases Inside Kubernetes Are Becoming Technical Debt
|
Feb 12, 2026 |
|
47% of CNCF Projects Slowed Down in 2025 — Why That’s Actually Good News
|
Feb 11, 2026 |
|
The Claude Skills That Stop AI From Writing Dangerous Infrastructure as Code
|
Feb 10, 2026 |
|
Docker vs Nix: Why Your Builds Aren’t Actually Reproducible
|
Feb 09, 2026 |
|
The Data Canary Pattern: How Netflix Prevents Bad Metadata Deploys
|
Feb 07, 2026 |
|
Claude Opus 4.6: The First AI That Feels Like a Teammate
|
Feb 06, 2026 |
|
Autonomous AI in DevOps Is Here — And Most Teams Are Doing It Wrong
|
Feb 05, 2026 |
|
Kubernetes Is Retiring Ingress NGINX (And 50% of Clusters Aren’t Ready)
|
Feb 04, 2026 |
|
OpenAI’s New macOS App: Is Agentic Coding Finally Here?
|
Feb 03, 2026 |
|
98% of Container CVEs Are Hiding Where You’re Not Scanning
|
Feb 02, 2026 |
|
Why Forward-Deployed Engineers Are Making $300K+ (And Why Companies Are Desperate for Them)
|
Jan 31, 2026 |
|
AWS DevOps Agent in Production: What Most Teams Get Wrong
|
Jan 30, 2026 |
|
AI Agents Are Rewriting the SRE Playbook (For Better or Worse)
|
Jan 29, 2026 |
|
DevOps Is Dead — Platform Engineering Replaced It
|
Jan 28, 2026 |
|
47 Countries Went Offline — What Platform Engineers Must Learn From It
|
Jan 27, 2026 |
|
Two Missing Characters Nearly Compromised AWS’s Supply Chain
|
Jan 26, 2026 |
|
Kubernetes Just Became Essential for AI Growth (CNCF Report)
|
Jan 25, 2026 |
|
ChatGPT Scales PostgreSQL to power 800 million users
|
Jan 24, 2026 |
|
3 Skills You Need to Transition to Platform Engineer
|
Jan 23, 2026 |
|
The Infrastructure Monitoring Tools Teams Regret Choosing
|
Jan 22, 2026 |
|
Your CI/CD Pipeline is a Debt Trap
|
Jan 21, 2026 |
|
Kubernetes Just Revolutionized Learning — Get Ahead Now!
|
Jan 20, 2026 |
|
How AWS's New Euro Cloud Changes Data Control Forever
|
Jan 19, 2026 |
|
Why Pulumi's New Move Could Change Terraform Forever
|
Jan 18, 2026 |
|
Astro Joins Cloudflare: What It Means for Platform Engineers
|
Jan 17, 2026 |
|
ScyllaDB X Cloud Challenges DynamoDB Cost and Performance
|
Jan 16, 2026 |
|
Invisible Linux Malware: The Undetectable Threat to Your Cloud Infrastructure
|
Jan 15, 2026 |
|
The AI-Cloud Native Symbiosis - How Intelligent Infrastructure is Transforming Platform Engineering
|
Jan 14, 2026 |
|
MIT 10 Breakthrough Technologies 2026 - The Platform Engineering Perspective
|
Jan 13, 2026 |
|
AWS Route 53 Global Resolver - Enterprise DNS Security at the Edge
|
Jan 12, 2026 |
|
Kubernetes Upcoming Features Deep Dive - Extended Toleration Operators and Mutable PV Node Affinity
|
Jan 11, 2026 |
|
Why Is a 2016 AWS Instance Still the Best Value? (Cloudspecs Research)
|
Jan 10, 2026 |
|
Iran IPv6 Blackout - When Governments Weaponize Protocol Transitions
|
Jan 09, 2026 |
|
Venezuela BGP Anomaly - Deep Technical Analysis
|
Jan 08, 2026 |
|
HolmesGPT: AI Root Cause Analysis for Kubernetes
|
Jan 08, 2026 |
|
Docker Kanvas: Infrastructure as Design
|
Jan 07, 2026 |
|
Remote MCP Architecture - Running AI Tool Servers on Kubernetes
|
Jan 06, 2026 |
|
AWS DevOps Agent - Promises vs Reality
|
Jan 05, 2026 |
|
AWS Graviton5: 192 Cores, 5x Cache - ARM Takes Over the Data Center
|
Jan 04, 2026 |
|
Can OpenTelemetry Save Observability in 2026?
|
Jan 03, 2026 |
|
When Serverless Fails: Unkey's 6x Performance Migration to Containers
|
Jan 02, 2026 |
|
From Alert Fatigue to Signal-Driven Ops: The Observability Shift
|
Jan 01, 2026 |
|
Security Ops Specialty: The Underrated Skill Every Platform Engineer Needs in 2026
|
Dec 31, 2025 |
|
Agentic AI Foundation - MCP and the Future of AI-Native Platform Engineering
|
Dec 30, 2025 |
|
FinOps 2026 for Platform Engineers: The Complete Skills Guide
|
Dec 29, 2025 |
|
Platform Engineering Salary Report 2026: Skills That Pay
|
Dec 28, 2025 |
|
Platform Engineering 2026 Predictions Roundup (Platform Engineering 2026 Look Forward Series - Part 5/5)
|
Dec 27, 2025 |
|
Kubernetes Enters the Boring Era (Platform Engineering 2026 Look Forward Series - Part 4/5)
|
Dec 26, 2025 |
|
Developer Experience Metrics Beyond DORA (Platform Engineering 2026 Look Forward Series - Part 3/5)
|
Dec 24, 2025 |
|
Platform Engineering Goes Mainstream in 2026 (Platform Engineering 2026 Look Forward Series - Part 2/5)
|
Dec 23, 2025 |
|
Agentic AI Transforms Platform Operations in 2026 (Platform Engineering 2026 Look Forward Series - Part 1/5)
|
Dec 22, 2025 |
|
CNPE (Certified Cloud Native Platform Engineer) Certification Study Guide
|
Dec 21, 2025 |
|
Kubernetes 1.35 Timbernetes Deep Dive: Breaking Changes, In-Place Resize GA, Gang Scheduling
|
Dec 20, 2025 |
|
Terraform Stacks + Native Monorepo Support: HashiCorp's Answer to IaC Complexity
|
Dec 20, 2025 |
|
95% Fewer CVEs, $0 Cost: Docker Just Open-Sourced Enterprise Security
|
Dec 19, 2025 |
|
Kubernetes 1.35 "Timbernetes" - The End of the Pod Restart Era
|
Dec 18, 2025 |
|
40,000x Fewer Deployment Failures: How Netflix Adopted Temporal
|
Dec 17, 2025 |
|
Kubernetes: Helm vs Crossplane vs kro (Honest Comparison)
|
Dec 16, 2025 |
|
Platform Engineering 2025 Year in Review
|
Dec 15, 2025 |
|
Okta's GitOps Journey - Scaling ArgoCD from 12 to 1,000 Clusters
|
Dec 14, 2025 |
|
Platform Engineering Team Structures That Work
|
Dec 13, 2025 |
|
CDKTF Deprecated - The End of HashiCorp's Programmatic IaC Experiment
|
Dec 12, 2025 |
|
stern v1.33.1 - Listen to the Docs with AudioDocs
|
Dec 11, 2025 |
|
CoreDNS v1.13.1 - Listen to the Docs with AudioDocs
|
Dec 11, 2025 |
|
kubectx & kubens v0.9.5 - Listen to the Docs with AudioDocs
|
Dec 11, 2025 |
|
AWS re:Invent 2025 Recap 4/4 - Data & AI Wrap-Up
|
Dec 11, 2025 |
|
AWS re:Invent 2025 Recap Part 3/4 - EKS & Cloud Operations
|
Dec 10, 2025 |
|
AWS re:Invent 2025 Part 2/4 - Infrastructure & Developer Experience
|
Dec 09, 2025 |