v1.1.0: 10s of thousands of training trajectories
We're very excited to announce our new project SWE-smith, generating 10s of thousands of training trajectories for SWE agents.
Using this training data, our LM SWE-agent-LM-32b achieves open-weights SotA on SWE-bench verified with SWE-agent!
Apart from that, v1.1.0 is mostly a fix release with minor improvements, in particular adding compatibility with SWE-bench multilingual/multimodal, and SWE-smith. However, please pay attention to the breaking changes below.
Breaking changes
- Changes to trajectory data format. The
messages
field is replaced byquery
by @klieret in #1107 - Renamed many tool bundles that used "windowed" file viewer (
defaults
and more) by @klieret in #1147 - Removed
review_on_submit
tool bundle (replaced byreview_on_submit_m
) by @klieret in #1148 - Change in
windowed
tools (formerlydefault
): Don't append \n to new file by @klieret in #1114
Added
New dataset support:
- Feat: Support multilingual evaluation by @kabirgh in #1090
- Feat: SWE-smith & multimodal base support by @klieret in #1092
New utilities:
Enhanced
- Feat: Config/override max_output_tokens by @klieret in #1036
- Enh: [#1042] fix(run_batch): handle JSON parsing errors in trajectory check by @FRAOTIAC in #1043
- Enh: Allow to override tools dirs etc. by @klieret in #1046
- Enh: Allow to override path to swe-bench dataset by @klieret in #1093
- Enh: Allow to disable python-standalone for batch by @klieret in #1115
- Enh: More information on skipped exit status by @klieret in #1117
Fixed
- Fix: Setting max_input_tokens to 0 by @klieret in #999
- Fix: Explicitly set log file encoding by @klieret in #1013
- Fix: Ensure pydantic-settings env prefix set by @klieret in #1018
- Fix: run batch processing with modal by @vsee in #1023
- Fix: Catch exit forfeit by @klieret in #1024
- Fix: Use 'latest' image tag for SWE-Bench images by @klieret in #1029
- Fix: Show tenacity retry reasons by @klieret in #1032
- Fix: Compatibility with textual 2.0 by @klieret in #1033
- Fix: Use default trajectories dir according to ENV by @vsee in #1054
- Fix: fix Windows path error, replace Path with PurePosixPath or string by @alwaysgoodtime in #1052
- Fix: Ensure tools PATH takes precedence by @klieret in #1058
- Fix: Ensure state exists by @klieret in #1065
- Fix spelling of 'agent' in hello world by @edspencer in #1077
- Fix: Inspector needs to handle new message format by @klieret in #1094
- Fix: SWEBenchInstances with path and no subset initiated as other instance type by @klieret in #1096
- Fix: Token limit exceeded for PR body issue by @klieret in #1098
- Fix: Work around litellm claude 3.7 tokens to 128k by @klieret in #1106
- Fix(repo): Ensure absolute path for copy repo by @klieret in #1116
- Fix execution time timeouts by @klieret in #1118
- Fix: Hierarchical merge of multiple configs by @klieret in #1123
- fix message type missing by @klieret in #1127
- Fix: Conditional for warning about empty template by @klieret in #1137
New Contributors
- @vsee made their first contribution in #1023
- @FRAOTIAC made their first contribution in #1043
- @jpaodev made their first contribution in #1050
- @alwaysgoodtime made their first contribution in #1052
- @alexgshaw made their first contribution in #1056
- @talorabr made their first contribution in #1026
- @katia-sentry made their first contribution in #1070
- @edspencer made their first contribution in #1077
- @kabirgh made their first contribution in #1090
Full Changelog: v1.0.1...v1.1.0