Update blog

cmdr2 · cmdr2 · commit 7007a49fac6c · 2025-11-03T19:13:20.000+05:30
diff --git a/content/blog/2025-10-27-1761560082.md b/content/blog/2025-10-27-1761560082.md
@@ -23,7 +23,7 @@ Therefore the job of running a computation graph (like ONNX) efficiently on GPU(
 - every machine in each factory is being utilized optimally
 - account for the time it takes to move things between cities/factories/machines
 
-And most importantly, you need to focus on your overall goal, i.e. either the time it takes to produce the finished product (i.e. latency) or maximum utilisation of all your machines (i.e. throughput).
+And most importantly, you need to focus on your overall goal, i.e. either the time it takes to produce the finished product (i.e. latency), or maximum utilisation of all your machines (i.e. throughput), or maybe power efficiency.
 
 If you're supporting multiple models, then you're dealing with multiple computation graphs. And if you're supporting multiple GPU vendors (NVIDIA, AMD etc), and multiple architectures of each vendor (e.g. 3060, 4080, 5080 etc), then you're dealing with multiple factory configurations.