Chris Yang

GCP / TPU / Inference / Infra notes

Notes from Cloud AI Infra work — TPU v7 Ironwood, GPU inference (vLLM / SGLang), large-scale training pipelines, and the cloud infrastructure underneath. Written for engineers who want concrete details over abstract overviews.

Latest posts