SWE-agent

持续追踪核心框架、最新模型规格与重要系统接口的更新迭代轨迹。

v1.1.0

v1.1.0: 10s of thousands of training trajectories

v1.1.0: 10s of thousands of training trajectories We're very excited to announce our new project SWE-smith , generating 10s of thousands of training trajectories for SWE agents. Using this training data, our LM SWE-agent-LM-32b achieves open-weights SotA on SWE-bench verified with SWE-agent! Apart from that, v1.1.0 is mostly a fix release with minor improvements, in particular adding compatibility with SWE-bench multilingual/multimodal, and SWE-smith. However, please pay attention to the breaking changes below. Breaking changes Changes to trajectory data format. The messages field is replaced ...

v1.0.1

v1.0.1: SOTA on SWE-Bench Full

SWE-agent 1.0.1 News: After our announcements for SOTA on SWE-Bench Lite and Verified, we now can claim SOTA on the full set of 2k GitHub issues of SWE-Bench full: Interestingly, the improvement on the issues that are not also in the Lite/Verified subsets is much higher than the gain for the Lite/Verified subsets. Evaluating only on Lite/Verified doesn’t tell the whole story! - What's Changed This fixup release brings fixes mostly to the compatibility with local models. We have also significantly expanded the documentation in that aspect ( models & keys documentation ). Changed Change: Mak...

v1.0.0

v1.0.0

SWE-agent 1.0 News So much new stuff! Here's a quick rundown of the cool new things you can do: ✨ Fast, massively parallel code execution with SWE-ReX . ✨ Run SWE-agent locally but execute code in the cloud (using modal, AWS, or anything else that runs SWE-ReX ). ✨ Configurable retry mechanisms: Try multiple agent configurations, models, parameters, etc., then choose the best one. ✨ Flexible tool definitions with tool bundles . ✨ All language models supported using litellm (see models ). ✨ Override any configuration option from the command line (see command line basics ). ✨ New command line tr...

0.7.0

SWE-agent EnIGMA (0.7.0)

SWE-agent is SOTA on offensive cybersecurity SWE-agent EnIGMA (Enhanced Interactive Generative Model Agent) is SOTA on offensive cybersecurity challenges, with a 3.3x improvement over previous agents on the NYU CTF challenge dataset. The EnIGMA project introduces multiple novelties that are available to all use cases of SWE-agent, such as Interactive Agent Tools and a Summarizer to handle long outputs. Major additions Capability to run over CTF challenges Interactive Agent Tools , including gdb Summarizers to handle long outputs Smaller additions Add filemap command in the spirit of repomap by...

v0.6.1

v0.6.1

This is (mostly) a patch release, in particular fixing several issues that had been introduced by the speed improvements of v0.7.0. We also solve a bug where existing linter errors in a file left SWE-agent unable to edit (because of our lint-retry-loop). Breaking changes Change: sparse clone method is now correctly called "shallow" by @klieret in #591 Improved Enh: Show commands when encountering timeout error by @klieret in #582 Enh: Configuration option to show time in log by @klieret in #583 Enh: Allow to configure LONG_TIMEOUT for SWEEnv by @klieret in #584 Enh: Always write log to traj di...

v0.6.0

v0.6.0

What's Changed We sped up SWE-agent by 2x (timed with GPT4o). This is mostly due to faster communication with the running processes inside of the Docker container and other container setup &amp; installation related improvements. Here are a few relevant PRs: Switch to fast communicate and shallow clone by default by @klieret in #530 Change: Only wait 1s for docker to start by @klieret in #541 Feat: experimental shallow cloning by @klieret in #498 Enh: Start from clone of python conda environment for speedup by @klieret in #548 Enh: Use uv for editable install by default by @klieret in <a

v0.5.0

v0.5.0

What's Changed ✨ The big news is our brand new documentation ✨ Secondly, @ollmer added a new flag --cache_task_images that will significantly speed up SWE-agent when running on the same environment/repository multiple times (no more waiting for cloning and installation!) Breaking changes We have reformatted our codebase. If you create a PR based on a previous commit, make sure you install our pre-commit hook to avoid merge-conflicts because of formatting. See our docs for more information. Remove direct imports in __init__.py (you can no longer from sweagent import Agent by @klieret in #436 Ad...

0.4.0

0.4.0 Web UI

What's Changed We’re excited to launch the SWE-agent web UI! Specify a bug, press start and watch SWE-agent do the magic ✨ New Contributors @tam-ng0905 made their first contribution in #321 @nonparibus made their first contribution in #310 @RainRat made their first contribution in #320 Full Changelog : v0.3.0...v0.4.0

0.3.0

0.3.0

What's Changed ✨ Features Run SWE-agent in the cloud using GitHub Codespaces Add GPT4-turbo model by @zgrannan in #252 feat: Amazon Bedrock support (Claude models) by @JGalego in #207 🐛 Fixes Better error handling for --open_pr by @klieret in #239 Fixed a potential error by @DanjieTang in #242 fix: TARGETARCH not set on some OS/docker setups by @mspronesti in #249 Pass Python version to get_environment_yml by @waterson in #271 Fix Together model validation error by @mikanfactory in #236 Doc: Avoid invalid github token by @klieret in <a class="issue-link js-issue-link" data-error-text="Failed

v0.2.0

v0.2.0

What's Changed Added Allow to run on local repos (new flag: --repo_path ) by @klieret in #193 Patch files are now saved separately to a patch directory by @klieret in #126 Allow to supply custom installation commands when running on gh issues or locally ( --environment_setup ) by @klieret in #153 Allow to specify openapi base url in keys.cfg by @bvandorf in #118 Improved Improve error handling of docker issues by @klieret in #165 Make github token fully optional by @klieret in #189 Fixed Fix opening PR from fork by @klieret in #229 Fix: Choosing TogetherAI models by @klieret