<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Kubernetes on Andrea Cervesato</title><link>https://cervesato.it/tags/kubernetes/</link><description>Recent content in Kubernetes on Andrea Cervesato</description><generator>Hugo</generator><language>en</language><copyright>Andrea Cervesato</copyright><lastBuildDate>Tue, 07 Apr 2026 00:00:00 +0000</lastBuildDate><atom:link href="https://cervesato.it/tags/kubernetes/index.xml" rel="self" type="application/rss+xml"/><item><title>From Alerting to Inference: Metrics Never Stopped Mattering</title><link>https://cervesato.it/posts/metrics-never-stopped-mattering/</link><pubDate>Tue, 07 Apr 2026 00:00:00 +0000</pubDate><guid>https://cervesato.it/posts/metrics-never-stopped-mattering/</guid><description>&lt;p&gt;Your LLM is slow. Users are complaining. Queues are growing. Someone on the team is already profiling the model, looking at batch sizes, considering a bigger GPU.&lt;/p&gt;
&lt;p&gt;Nine times out of ten, the answer is already in the metrics.&lt;/p&gt;
&lt;p&gt;I&amp;rsquo;ve spent most of my career staring at metrics. Bare-metal servers, Kubernetes clusters, managed services on public cloud. And if there&amp;rsquo;s one thing I keep re-learning, it&amp;rsquo;s that the infrastructure is lying to you, and you&amp;rsquo;re not asking the right questions.&lt;/p&gt;
&lt;p&gt;This isn&amp;rsquo;t a new lesson. Same lesson, different domain.&lt;/p&gt;</description></item></channel></rss>