Casper Hansen (@casper_hansen_) - o3 competitor: GLM 4.5 by Zhipu AI - hybrid reasoning model (on by defa...

2025年7月28日 13:07

推文概览

查看 @casper_hansen_ 在 2025年7月28日 13:07 发布的这条 X/Twitter 推文。这条内容包含 4 张图片。

o3 competitor: GLM 4.5 by Zhipu AI
- hybrid reasoning model (on by default)
- trained on 15T tokens
- 128k context, 96k output tokens
- $0.11 / 1M tokens
- MoE: 355B A32B and 106B A12B

Benchmark details:
- tool calling: 90.6% success rate vs Sonnet’s 89.5% vs Kimi K2 86.2%
- coding: 40.4% win rate vs Sonnet, 53.9% vs Kimi K2, 80.8% vs Qwen3 Coder

Models: https://huggingface.co/collections/zai-org/glm-45-687c621d34bda8c9e4bf503b

Zhipu AI has also released their entire post-training infrastructure: https://github.com/THUDM/slime

Slight correction to the post:
- it was not just 15T tokens
- it was 23.1T tokens total!

For those wondering how to get GLM 4.5 cheaply: you need to use their Mainland China API.
https://x.com/casper_hansen_/status/1949828862096949314

Update: Zhipu AI says the initial benchmark I posted are not up-to-date, so here is the updated version.

Note that I found the original benchmark comparison on their bigmodel documentation.

o3 competitor: GLM 4.5 by Zhipu AI - hybrid reasoning model (on by default) - trained on 15T tokens - 128k co...

推文概览

相关创作者

Free Twitter video downloader. Top Twitter trends and hashtags list, Monitor, track hottest trending topics, hashtags.

其他链接

下载器

相关产品

© 2024 TwitFast 保留所有权利。