SearchAbout
Source

UK AI Safety Institute

Visit site ↗
More compute, more capability: Why AI agent evaluations need to account for test-time compute
Safety & Policy

More compute, more capability: Why AI agent evaluations need to account for test-time compute

AISI's research highlights the importance of accounting for test-time compute in AI agent evaluations, as fixed budgets can underestimate capabilities, especially for newer models. Increasing compute can improve performance, and the benefits are more significant for more advanced models.

UK AI Safety Institute13 min read2d agoSign in to upvoteSign in to save
You’re all caught up.