Why is Gemini 3.1 Pro’s Output Token Count So Large? 3 Steps to Understand the Hidden Thinking Tokens in Reasoning Models
Author's Note: A detailed explanation of why Gemini 3.1 Pro Preview's output tokens far exceed the visible text: The Thinking Tokens reasoning chain mechanism, billing rules, and cost-saving tips for tuning thinking_level. "I just sent one sentence, and the model only replied with a dozen words. Why does it show nearly 900 output tokens? Where…
