Tags: NVIDIA/numba-cuda
Tags
v0.16.0 - Updates for recent API changes (#313) - Fix lineinfo generation when compile_internal used (#271) (#287) - Build docs with NVIDIA Sphinx theme (#312) - Don't skip debug tests when LTO enabled by default (#311) - Use `cuda.bindings` and `cuda.core` for `Linker` (#133) - Enable LTO by default when pynvjitlink is available (#310)
v0.14.0 - Add support for launch bounds (#272) - Make 7.5 the default compute capability (#275) - Add optional dependencies (#278) - Add debug support for dynamic types (#270) - Add CUDA 12.9 and new architectures (#273) - Always enable pynvjitlink if available (#263) - Support passing pointers to kernels, and document supported kernel arguments (#269)
v0.13.0 - Skip `test_extension_adds_linkable_code` when test binaries unavailable (#267) - Optionally test exp2 for lower python versions (#266) - Add CUDA simulator test workflow (#178) - CUDASIM: Update to include recently-added APIs (#178) - CUDASIM: Support detection of device function calls (#178) - Remove some invalid array reshape tests (#178) - Rename `numba.cuda.runtime` to `memory_management` (#178) - Fix `test_external_cooperative_func()` (#261)
v0.12.0 - Allow External Code to Use Cooperative Group (#240) - Improve debug info for kernel arguments (#242) - Allow Numba NVRTC Binding Search Additional Paths (#254) - Add Bfloat16 High Level API, Documentation (#245) - add a test to use bf16 bindings inside device functions (#244) - Change CI to only be manually triggered to save on CI runs (#252) - Simplify the CI build and test matrix (#249)
PreviousNext