description [ICLR 2026][LLM Reasoning][large language reasoning models] This paper treats reflection tokens (e.g., "wait", "but") in the reasoning process as schedulable "resources" and, inspired by ...