仓库源文站点原文

How much do LLM memorize?

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Prolonged Reinforcement Learning

illusion of thinking

Gemini 2.5 tech report