Sarvam 30B supports native tool calling and performs consistently on benchmarks designed to evaluate agentic workflows involving planning, retrieval, and multi-step task execution. On BrowseComp, it achieves 35.5, outperforming several comparable models on web-search-driven tasks. On Tau2 (avg.), it achieves 45.7, indicating reliable performance across extended interactions. SWE-Bench Verified remains challenging across models; Sarvam 30B shows competitive performance within its class. Taken together, these results indicate that the model is well suited for real-world agentic deployments requiring efficient tool use and structured task execution, particularly in production environments where inference efficiency is critical.
Living in Australia? Try the Guardian Australia’s daily sports newsletter
,详情可参考新收录的资料
出租人违反前款规定的,承租人有权解除合同,并有权要求赔偿因此遭受的损失。,更多细节参见新收录的资料
Россиян предупредили о смертельной опасности лечения простуды алкоголем14:41。关于这个话题,新收录的资料提供了深入分析
It's also helpful to discover along which dimensions