8000 Releases ยท SWE-agent/SWE-agent ยท GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

Releases: SWE-agent/SWE-agent

v1.1.0: 10s of thousands of training trajectories

22 May 16:11
0f3acaf
Compare
Choose a tag to compare

v1.1.0: 10s of thousands of training trajectories

We're very excited to announce our new project SWE-smith, generating 10s of thousands of training trajectories for SWE agents.
Using this training data, our LM SWE-agent-LM-32b achieves open-weights SotA on SWE-bench verified with SWE-agent!

image

Apart from that, v1.1.0 is mostly a fix release with minor improvements, in particular adding compatibility with SWE-bench multilingual/multimodal, and SWE-smith. However, please pay attention to the breaking changes below.

Breaking changes

  • Changes to trajectory data format. The messages field is replaced by query by @klieret in #1107
  • Renamed many tool bundles that used "windowed" file viewer (defaults and more) by @klieret in #1147
  • Removed review_on_submit tool bundle (replaced by review_on_submit_m) by @klieret in #1148
  • Change in windowed tools (formerly default): Don't append \n to new file by @klieret in #1114

Added

New dataset support:

New utilities:

Enhanced

  • Feat: Config/override max_output_tokens by @klieret in #1036
  • Enh: [#1042] fix(run_batch): handle JSON parsing errors in trajectory check by @FRAOTIAC in #1043
  • Enh: Allow to override tools dirs etc. by @klieret in #1046
  • Enh: Allow to override path to swe-bench dataset by @klieret in #1093
  • Enh: Allow to disable python-standalone for batch by @klieret in #1115
  • Enh: More information on skipped exit status by @klieret in #1117

Fixed

New Contributors

Full Changelog: v1.0.1...v1.1.0

v1.0.1: SOTA on SWE-Bench Full

28 Feb 16:46
6aff215
Compare
Choose a tag to compare

SWE-agent 1.0.1

News: After our announcements for SOTA on SWE-Bench Lite and Verified, we now can claim SOTA on the full set of 2k GitHub issues of SWE-Bench full:

swe_v1_sota_sbv_announcement_v4

Interestingly, the improvement on the issues that are not also in the Lite/Verified subsets is much higher than the gain for the Lite/Verified subsets. Evaluating only on Lite/Verified doesnโ€™t tell the whole story! -

What's Changed

This fixup release brings fixes mostly to the compatibility with local models. We have also significantly expanded the documentation in that aspect (models & keys documentation).

Changed

  • Change: Make anthropic_filemap the new default config by @klieret in #927

Added

  • Enh: Set timeout for post_startup_commands by @klieret in #973
  • Enh: Allow to override max_input_tokens for local models by @klieret in #992

Fixes

  • Fix: Handling local models cost lookup issues by @klieret in #937
  • Fix: Requires-python >= 3.11 by @klieret in #940
  • traj inspector viewport reset by @klieret in #946
  • Fix: Reset viewport when next/prev step/traj by @klieret in #948
  • Fix: Disable highlighting of model outputs by @klieret in #949
  • Fix: Create PRs by @klieret in #954
  • Fix: Add init,py to agent/hooks by @RNabel in #961
  • Fix: Pin textual to version 1.0.0 by @RNabel in #960
  • Fix: OpenAI API: Don't pass None tool_calls to the OpenAI API by @RNabel in #967
  • Fix: Forces platform to be linux/amd64 for swe-bench batch runs by @carlosejimenez in #942
  • Fix "TypeError: Cannot read properties of null (reading 'replace')" in Trajectory viewer by @0xba1a in #989
  • Fix: No retries if costs cannot be calculated by @klieret in #990
  • Fix: Race condition/size change during iteration by @klieret in #993
  • Fix: Handle total cost limit exceeded by @klieret in #994

New Contributors

Full Changelog: v1.0.0...v1.0.1

v1.0.0

13 Feb 16:21
8ed382c
Compare
Choose a tag to compare

SWE-agent 1.0

swe_a_v1_announcement_v6

News

So much new stuff! Here's a quick rundown of the cool new things you can do:

โœจ Fast, massively parallel code execution with SWE-ReX.
โœจ Run SWE-agent locally but execute code in the cloud (using modal, AWS, or anything else that runs SWE-ReX).
โœจ Configurable retry mechanisms: Try multiple agent configurations, models, parameters, etc., then choose the best one.
โœจ Flexible tool definitions with tool bundles.
โœจ All language models supported using litellm (see models).
โœจ Override any configuration option from the command line (see command line basics).
โœจ New command line trajectory inspector to scroll few hundreds of trajectories with ease.
โœจ New command line interface with subcommands for running over single issues, batches, and various utility commands.
โœจ Greatly simplified and cleaned up codebase. In particular, the Agent class is now much easier to modify.

Read more about this in our 1.0 features & migration guide.

New Contributors

Full Changelog: v0.7.0...v1.0.0

SWE-agent EnIGMA (0.7.0)

25 Sep 14:45
dc18a74
Compare
Choose a tag to compare

SWE-agent is SOTA on offensive cybersecurity

SWE-agent EnIGMA (Enhanced Interactive Generative Model Agent) is SOTA on offensive cybersecurity challenges, with a 3.3x improvement over previous agents on the NYU CTF challenge dataset. The EnIGMA project introduces multiple novelties that are available to all use cases of SWE-agent, such as Interactive Agent Tools and a Summarizer to handle long outputs.

Major additions

Smaller additions

Fixes

  • Compatibility with SWE-bench 2.0 by @klieret in #671
  • ensure variables work in special command docstring by @forresty in #628
  • Important fix: Catch CostLimitExceeded in retry because of format/block by @klieret in #682
  • Fix: Handle empty traj in should_skip by @klieret in #616
  • Fix for end-marker communicate: Exit status always 0/invalid by @klieret in #644
  • Fix: Insufficient quoting of git commit message by @klieret in #646
  • Fix nonsensical trajectory formatting for PRs by @klieret in #647
  • Fix: sweunexpected keyword 'python_version' by @klieret in #692
  • Fix: Use LONG_TIMEOUT for pre_install commands by @klieret in #695
  • Fix: UnboundLocalError when catching decoding issue by @klieret in #709
  • Also create empty patch files for completeness by @klieret in #725
  • Fix: Raise ContextWindowExceeded instead of exit_cost by @klieret in #727
  • Fix: Deal with non-utf8 encoded bytes in comm by @klieret in #731
  • Fix: Handle spaces in repo names by @klieret in #734
  • Fix: Ensure utils is part of package by @klieret in #742
  • Fix: Submitting ' ' in human mode crashes container by @klieret in #749
  • Fix: Block su as command by @klieret in #752
  • Fix: SWE_AGENT_MODEL_MAX_RETRIES needs casting by @klieret in #757

New Contributors

๐ŸŽ‰ @talorabr, @udiboy1209, @haoranxi, @NickNameInvalid, @rollingcoconut joined the team to build EnIGMA ๐ŸŽ‰

v0.6.1

20 Jun 15:21
Compare
Choose a tag to compare

This is (mostly) a patch release, in particular fixing several issues that had been introduced by the speed improvements of v0.7.0.
We also solve a bug where existing linter errors in a file left SWE-agent unable to edit (because of our lint-retry-loop).

Breaking changes

  • Change: sparse clone method is now correctly called "shallow" by @klieret in #591

Improved

  • Enh: Show commands when encountering timeout error by @klieret in #582
  • Enh: Configuration option to show time in log by @klieret in #583
  • Enh: Allow to configure LONG_TIMEOUT for SWEEnv by @klieret in #584
  • Enh: Always write log to traj directory by @klieret in #588

Fixed

  • fix docker.errors.NotFound by @klieret in #587
  • Fix: Revert to full clone method when needed by @klieret in #589
  • Fix: Refresh container_obj before querying status by @klieret in #590
  • Fixed #571 - show message that model arg is ignored in case of using Azure OpenAI by @jank in #592
  • Fix: Linting blocks for existing lint errors by @klieret in #593
  • Fix: Process done marker not found in read with timeout by @klieret in #596

v0.6.0

05 Jun 13:16
14a5189
Compare
Choose a tag to compare
< 8000 div data-view-component="true" class="Box-body">

What's Changed

image

We sped up SWE-agent by 2x (timed with GPT4o). This is mostly due to faster communication with the running processes inside of the Docker container and other container setup & installation related improvements. Here are a few relevant PRs:

  • Switch to fast communicate and shallow clone by default by @klieret in #530
  • Change: Only wait 1s for docker to start by @klieret in #541
  • Feat: experimental shallow cloning by @klieret in #498
  • Enh: Start from clone of python conda environment for speedup by @klieret in #548
  • Enh: Use uv for editable install by default by @klieret in #547

Fixed

  • Web UI: Remove -n option to wait by @klieret in #487
  • Web UI: Kill the Flask server on exit. by @kwight in #479
  • Web UI: Avoid proxy errors on MacOS by @klieret in #506
  • Ensure container_name is reset for non-persistent containers by @klieret in #463
  • Fix: Do not allow persistent container with cache task imgs by @klieret in #551

Improved

  • Improve scrolling behavior in web UI by @anishfish2 in #420
  • Web UI: Render Markdown in agent feed messages. by @kwight in #486
  • Enh: Remove redundant 'saved traj to X' messages by @klieret in #528
  • Allow to disable config dump to log by @klieret in #537
  • Resolve relative paths to demonstrations and commands by @klieret in #444

New Contributors

Full Changelog: v0.5.0...v0.6.0

v0.5.0

28 May 17:14
c8e8ba6
Compare
Choose a tag to compare

What's Changed

โœจ The big news is our brand new documentation โœจ

image

Secondly, @ollmer added a new flag --cache_task_images that will significantly speed up SWE-agent when running on the same environment/repository multiple times (no more waiting for cloning and installation!)

Breaking changes

  • We have reformatted our codebase. If you create a PR based on a previous commit, make sure you install our pre-commit hook to avoid merge-conflicts because of formatting. See our docs for more information.
  • Remove direct imports in __init__.py (you can no longer from sweagent import Agent by @klieret in #436

Added

  • Running the web UI is now supported when running swe-agent completely in docker
  • Speed up evaluation by caching task environments as docker images by @ollmer in #317

Improved

  • Add gpt-4o model by @raymyers in #344
  • Web: Allow to specify commit hash by @klieret in #358
  • Add default environment_setup config by @klieret in #351
  • Enh: Suppress openai logging; improve formatting of stats by @klieret in #416
  • Remove signal dependency by @klieret in #428
  • Do not use select if running on Windows by @klieret in #429
  • Use custom Config class to support env and keys.cfg (this allows passing keys as environment variables) by @klieret in #430

Fixes

  • Web: Fix script_path input by @klieret in #334
  • Fix: Don't print patch msg for exit_cost patch by @klieret in #343
  • Fix: Do not request job control in bash by @klieret in #345
  • Fix: --base_commit not used for gh urls by @klieret in #346
  • Fix: Separate data path/traj dir cause exception by @klieret in #348
  • Add docker-py lower bound by @klieret in #406
  • Fix: IndexError when replaying incomplete trajectories by @klieret in #410

New Contributors

Full Changelog: v0.4.0...v0.5.0

0.4.0 Web UI

09 May 14:58
1e065f8
Compare
Choose a tag to compare

What's Changed

Weโ€™re excited to launch the SWE-agent web UI! Specify a bug, press start and watch SWE-agent do the magic โœจ

quick_ui

New Contributors

Full Changelog: v0.3.0...v0.4.0

0.3.0

02 May 15:47
43b8de5
Compare
Choose a tag to compare

What's Changed

โœจ Features

  • Run SWE-agent in the cloud using GitHub Codespaces
  • Add GPT4-turbo model by @zgrannan in #252
  • feat: Amazon Bedrock support (Claude models) by @JGalego in #207

๐Ÿ› Fixes

โค๏ธ New Contributors

Full Changelog: v0.2.0...v0.3.0

v0.2.0

15 Apr 19:01
58aa046
Compare
Choose a tag to compare

What's Changed

Added

  • Allow to run on local repos (new flag: --repo_path) by @klieret in #193
  • Patch files are now saved separately to a patch directory by @klieret in #126
  • Allow to supply custom installation commands when running on gh issues or locally (--environment_setup) by @klieret in #153
  • Allow to specify openapi base url in keys.cfgby @bvandorf in #118

Improved

Fixed

New Contributors

Full Changelog: v0.1.2...v0.2.0

0