The Thinking That Happens Between Tokens: How Latent Reasoning Is Rewriting What Models Can Do
There’s a moment in human problem-solving that’s easy to overlook: the pause before the answer. Not silence, exactly — something is happening in there, some rearrangement of partial ideas that never surfaces as words. For decades, language models had no equivalent. They generated token after token in a single forward pass, their “thinking” invisible even…