Latest Blog Posts

Latest Company News Cost Optimization Cost Optimization DevOps EKS Goldilocks Kubernetes Network Platform Engineering Uncategorized VPA

Featured

Reducing GPU Cold Start Times in Kubernetes: Patterns and Solutions

Nic Vermandé Mar 9, 2026

Three years ago, GPU infrastructure conversations centered on training. Organizations debated cluster sizes for model development, negotiated cloud quotas for r...

HPA’s Three Architectural Flaws (And Why Your Autoscaling Keeps Failing)

Nic Vermandé Mar 3, 2026

The Promise vs. Reality of HPA HPA is the most deployed autoscaler in Kubernetes...

Why Spark on Kubernetes Breaks in Production

Konstantin Zelmanovich Feb 5, 2026

If you’re running Spark on Kubernetes, the production symptoms are familiar: exe...

Why Pod Rightsizing Fails in Production: A Deep Dive into VPA and What Actually Works

Nic Vermandé Jan 28, 2026

The Cost of Stagnation Kubernetes has evolved through three eras: survival (get ...

GKE Cost Optimization: How to Cut Kubernetes Spend at Scale in 2026

Rob Croteau Jan 8, 2026

Google Kubernetes Engine (GKE) is the default Kubernetes platform for many produ...

AKS Pricing Explained: 10 Best Practices to Cut Kubernetes Costs on Azure

Konstantin Zelmanovich Jan 1, 2026

Azure Kubernetes Service (AKS) removes much of the operational heavy lifting of ...

AI Infra for Production: Why GPU Resource Management in Kubernetes Demands a New Approach

Nic Vermandé Dec 30, 2025

Kubernetes was never designed for the realities of real-time, production inferen...

What Is Amazon EKS Cost Optimization? (And How to Actually Do It)

Daniel Kleinstein Dec 25, 2025

Amazon Elastic Kubernetes Service (EKS) cost optimization is the process of mini...

GKE Workload Optimization: 9 Best Practices for Performance, Reliability, and Cost

Rob Croteau Dec 21, 2025

GKE Workload Optimization: 9 Best Practices for Performance and Cost Google Kube...

Kubernetes v1.35 Deep Dive: In-Place Resize GA, Gang Scheduling & the Cgroup v2 Cliff

Nic Vermandé Dec 17, 2025

Kubernetes 1.34 gave us the building blocks: DRA went GA, PSI metrics landed in ...

1 2 3 4 5 6 7 8