Augment Code ยท @augmentcode

๐†๐๐“-๐Ÿ’.๐Ÿ ๐š๐ฅ๐ฆ๐จ๐ฌ๐ญ ๐ญ๐จ๐ฉ๐ฌ ๐‚๐ฅ๐š๐ฎ๐๐ž ๐Ÿ‘.๐Ÿ• ๐จ๐ง ๐œ๐จ๐๐ข๐ง๐ ?! New eval dropping using our #1 SWE-...

View this X/Twitter post from @augmentcode published on April 15, 2025 at 12:02 AM. This post contains 1 images.

Published
April 15, 2025 at 12:02 AM
Thread Items
2
Media Items
1
Augment Code avatar
Augment Code
@augmentcode
April 15, 2025 at 12:02 AM

Tweet Overview

View this X/Twitter post from @augmentcode published on April 15, 2025 at 12:02 AM. This post contains 1 images.

๐†๐๐“-๐Ÿ’.๐Ÿ ๐š๐ฅ๐ฆ๐จ๐ฌ๐ญ ๐ญ๐จ๐ฉ๐ฌ ๐‚๐ฅ๐š๐ฎ๐๐ž ๐Ÿ‘.๐Ÿ• ๐จ๐ง ๐œ๐จ๐๐ข๐ง๐ ?!

New eval dropping using our #1 SWE-bench coding agent!

- GPT-4.1 beats Gemini 2.5 Pro and almost tops Claude 
   3.7 Sonnet!
- Even GPT-4.1 mini matches Claude 3.5 Sonnet V2 
   performance. It was the top model just 2mo ago!
Augment Code media
The evaluation is done through our proprietary codebase understanding benchmark AugmentQA. You can learn more at: https://www.augmentcode.com/blog/you-make-your-evals-then-your-evals-make-you-introducing-augmentqa

Try our agent yourself at: http://www.augmentcode.com.

Related Creators

TwitFast

v1.4.88

Free Twitter video downloader. Top Twitter trends and hashtags list, Monitor, track hottest trending topics, hashtags.

ยฉ 2024 TwitFast All rights reserved.