Running Kubernetes with GPU inside WSL2 on my RTX Laptop
A step-by-step guide to setting up Kubernetes with NVIDIA GPU access inside WSL2 on a Windows laptop.
A step-by-step guide to setting up Kubernetes with NVIDIA GPU access inside WSL2 on a Windows laptop.
Configure modern Python tooling in Helix: LSP, type checker (ty), code formatter (ruff) and more.
A hands-on guide to vLLM monitoring: the key Prometheus metrics (TTFT, TPOT, queueing, KV cache, swapping), Grafana panels, and alert rules that help you debug latency and plan capacity.
Why traditional LLM serving wastes GPU memory – and how vLLM’s PagedAttention model enables larger batches, higher throughput, and more predictable latency.
A concise cheat sheet for uv: managing Python versions, dependencies, virtual environments, scripts, and tools in one fast, cross-platform tool.
How to scan NuGet packages for security vulnerabilities using GitLab CI.
How to use FFmpeg to convert FLAC files to Apple Lossless without losing the original quality and uploading them to Apple Music.
Using Github Actions and pip-tools to compile a requirements.txt file from your dependencies.
Keycloak allows configuring a custom LDAP user filter for User Federation to select a subset of user entries in Active Directory.
Over the past few years, I've been interviewing dozens of software engineers who didn't know how their developed services run in production.