Published onJune 2, 2026How the community trained Gemma to "Think" with Tunix and TPUsreasoningmodeltunixrewardmodelsgrpotrainingstructuredLarge Language Models (LLMs) often benefit from "thinking" before they speak for complex tasks. Frontier LLMs like Gemini 3 and leading open weight models like Gemma 4 can produce explicit reasoning...