v0.0.3
What's Changed
- Minor fix to test file name by @yf225 in #1
- Add CI workflow by @yf225 in #2
- Allow direct running of add.py example by @yf225 in #6
- [CI] Use A10G (g5.4xlarge) machine type by @yf225 in #4
- Use site-package for torch in pyre_configuration by @jansel in #8
- Add use_default_config setting by @jansel in #9
- Add LICENSE/CONTRIBUTING.md/CODE_OF_CONDUCT.md by @jansel in #11
- Support persistent reductions by @jansel in #10
- Fix handling of block_ptr + reductions by @jansel in #12
- Support inductor lowerings that require multiple buffers by @jansel in #13
- Adjust rtol/atol for test_sum_keepdims by @yf225 in #14
- Support Python 3.10; Run lint in CI by @yf225 in #7
- Support looped reductions by @jansel in #15
- Compile in a subprocess to kill hangs by @jansel in #16
- Refactor autotuning logging by @jansel in #17
- Support view ops by @jansel in #18
- Support indirect loads by @jansel in #19
- Improve README.md by @jansel in #20
- Support if/else control flow by @jansel in #21
- Add hl.constexpr specialization by @jansel in #22
- Fix license file for PEP 621 by @oulgen in #23
- Use search-strategy: all for all site packages in pyre config by @stroxler in #25
- Add decorator check by @oulgen in #24
- Trigger CI on pull requests made by ghstack by @oulgen in #27
- Add hl.register_block_size and explicit tile sizes by @jansel in #30
- Update lint github workflow by @jansel in #31
- Add ../pytorch-nightly to Pyre optional_search_path by @yf225 in #36
- Fix TensorDescriptor handling in _find_device by @yf225 in #35
- Add HELION_USE_DEFAULT_CONFIG env var to force use default config by @yf225 in #37
- Add more pytorchbot utils by @oulgen in #43
- Add the core properties to Config object by @drisspg in #49
- Switch build system to Hatchling which has much better Language Server support by @drisspg in #55
- Add attention example and fix some bugs by @jansel in #56
- Fix bug where non-tensor variables are not exposed to inner loops by @jansel in #58
- Add
hl.grid(...)
support by @yf225 in #59 - Fix more unit tests by @oulgen in #64
- Fix test_matmul_tensor_descriptor unit test by @yf225 in #65
- Prototyping an hl.atomic opp by @drisspg in #63
- Add hl.specialize and improve reduction handling by @jansel in #72
- [test] Touch test/init.py to support more testing workflows by @danzimm in #73
- [reland without ghstack] handle PTXASError by @jansel in #79
- Support data-dependent loop bounds by @jansel in #81
- Add support for hl.tile(begin, end) and hl.tile(begin, end, block_size) by @jansel in #82
- Support user-defined minimum in hl.register_block_size by @jansel in #83
- Don't re-wrap exceptions in exc.TorchOpTracingError by @jansel in #84
- Add hl.tile_index() by @jansel in #89
- Add filecheck dependency by @jansel in #95
- Add env HELION_PRINT_OUTPUT_CODE=1 by @jansel in #93
- Add extra_mask arg to hl.load and hl.store by @jansel in #94
- Bump project version by @oulgen in #101
- Swap to using hatch vcs by @oulgen in #103
- Add publish to pypi workflow by @oulgen in #104
New Contributors
- @stroxler made their first contribution in #25
- @drisspg made their first contribution in #49
- @danzimm made their first contribution in #73
Full Changelog: https://github.com/pytorch-labs/helion/commits/v0.0.3