Augment Code ยท @augmentcode

๐†๐๐“-๐Ÿ’.๐Ÿ ๐š๐ฅ๐ฆ๐จ๐ฌ๐ญ ๐ญ๐จ๐ฉ๐ฌ ๐‚๐ฅ๐š๐ฎ๐๐ž ๐Ÿ‘.๐Ÿ• ๐จ๐ง ๐œ๐จ๐๐ข๐ง๐ ?! New eval dropping using our #1 SWE-...

View this X/Twitter post from @augmentcode published on 2025ๅนด4ๆœˆ15ๆ—ฅ ไธŠๅˆ12:02. This post contains 1 images.

Published
2025ๅนด4ๆœˆ15ๆ—ฅ ไธŠๅˆ12:02
Thread Items
2
Media Items
1
Augment Code avatar
Augment Code
@augmentcode
2025ๅนด4ๆœˆ15ๆ—ฅ ไธŠๅˆ12:02

Tweet Overview

View this X/Twitter post from @augmentcode published on 2025ๅนด4ๆœˆ15ๆ—ฅ ไธŠๅˆ12:02. This post contains 1 images.

๐†๐๐“-๐Ÿ’.๐Ÿ ๐š๐ฅ๐ฆ๐จ๐ฌ๐ญ ๐ญ๐จ๐ฉ๐ฌ ๐‚๐ฅ๐š๐ฎ๐๐ž ๐Ÿ‘.๐Ÿ• ๐จ๐ง ๐œ๐จ๐๐ข๐ง๐ ?!

New eval dropping using our #1 SWE-bench coding agent!

- GPT-4.1 beats Gemini 2.5 Pro and almost tops Claude 
   3.7 Sonnet!
- Even GPT-4.1 mini matches Claude 3.5 Sonnet V2 
   performance. It was the top model just 2mo ago!
Augment Code media
The evaluation is done through our proprietary codebase understanding benchmark AugmentQA. You can learn more at: https://www.augmentcode.com/blog/you-make-your-evals-then-your-evals-make-you-introducing-augmentqa

Try our agent yourself at: http://www.augmentcode.com.

Related Creators

TwitFast

v1.4.88

Free Twitter video downloader. Top Twitter trends and hashtags list, Monitor, track hottest trending topics, hashtags.

ยฉ 2024 TwitFast ไฟ็•™ๆ‰€ๆœ‰ๆฌŠๅˆฉใ€‚