(CUDA) Templating Rewrite, main branch (2025.06.20.) #1029

krasznaa · 2025-06-20T10:11:02Z

Similar to #1026, in preparation for introducing inhomogeneous magnetic fields as discussed in #1017, this PR attempts to harmonize the setup of the CUDA CFK and KF algorithms with how all the other algorithms are implemented by now.

This is of course a big change, but we do need it relatively urgently.

A couple of things to point out:

The template specializations for the KF were in a relatively easily extensible state already, there I just needed to separate out the building of the fill_sort_keys kernel. This was not an issue previously, where only a single template specialization existed for the KF algorithm.
- Note though that the magnetic field type is a bit more deeply woven into the helper typedefs in traccc::core, but I decided to leave that alone in this PR.
The template specializations for the CKF were however pretty rigidly written. Not allowing for the introduction of the telescope geometry in an easy way.
- It was the specializations for propagate_to_next_surface that I modified a bit more deeply. In such a way that (hopefully) introducing non-constant magnetic fields should now be a bit easier in a follow-up PR.

The clients (in this repository) were all easy to update, but it's worth pointing out to people like @paradajzblond (I couldn't easily find Fabrice's GitHub handle...) that client code will need some changes after this PR.

So that it would follow the same UI that all the other CKF algorithms provide. Had to modify the kernel specialization code a little, as it was itself relying on definitions from the old algorithm class. In sort of a circular way.

So that it would follow the same UI that all the other KF algorithms provide.

stephenswat

Looks mostly harmless assuming you didn't change any of the actual algorithm code, but let's get all the B-field specialisation done in one go. Plus could do with some refactoring to reduce code volume and improve maintainability.

stephenswat · 2025-06-20T11:00:30Z

device/cuda/CMakeLists.txt

+  "src/finding/combinatorial_kalman_filter_algorithm_constant_field_default_detector.cu"
+  "src/finding/combinatorial_kalman_filter_algorithm_constant_field_telescope_detector.cu"


This should go into some kind of specialization directory.

I guess it could. Though we're not doing that with any of the other libraries either. And it's not really "specialization" that we do here. But rather we split the implementation of the traccc::cuda::combinatorial_kalman_filter_algorithm into multiple source files. As combinatorial_kalman_filter_algorithm.cpp just implements the constructor, while the other two implement the two current "execute functions" of the class.

stephenswat · 2025-06-20T11:07:26Z

device/cuda/CMakeLists.txt

+  "src/fitting/kalman_fitting_algorithm_constant_field_default_detector.cu"
+  "src/fitting/kalman_fitting_algorithm_constant_field_telescope_detector.cu"


stephenswat · 2025-06-20T11:10:47Z

device/cuda/src/finding/kernels/specializations/propagate_to_next_surface_default_detector.cu

+    detray::actor_chain<detray::pathlimit_aborter<scalar>,
+                        detray::parameter_transporter<default_algebra>,
+                        interaction_register<interactor_t>, interactor_t,
+                        detray::momentum_aborter<scalar>, ckf_aborter>>;


This actor chain can be deduplicated between the different CKF TUs.

Let me see how to best do this. 🤔

stephenswat · 2025-06-20T11:11:21Z

device/cuda/src/fitting/kalman_fitting_algorithm_constant_field_default_detector.cu

+        detray::rk_stepper<bfield_type::view_t,
+                           default_detector::device::algebra_type,
+                           detray::constrained_step<scalar_type>>;


This RK stepper can also be deduplicated.

stephenswat · 2025-06-20T11:11:38Z

device/cuda/src/fitting/kalman_fitting_algorithm_constant_field_telescope_detector.cu

+        detray::rk_stepper<bfield_type::view_t,
+                           telescope_detector::device::algebra_type,
+                           detray::constrained_step<scalar_type>>;


Same as above, some refactoring will make this a lot more maintainable.

stephenswat · 2025-06-20T11:12:33Z

device/cuda/src/fitting/kernels/specializations/fit_backward_telescope_detector.cu

+#include "traccc/utils/detector_type_utils.hpp"
+
+namespace traccc::cuda {
+using fitter = fitter_for_t<traccc::telescope_detector::device>;


While you're at it, could you specialise these on the B-field?

stephenswat · 2025-06-20T11:13:17Z

device/cuda/src/fitting/kernels/specializations/fit_forward_telescope_detector.cu

+#include "traccc/utils/detector_type_utils.hpp"
+
+namespace traccc::cuda {
+using fitter = fitter_for_t<traccc::telescope_detector::device>;


This will also need to specialized on the B-field.

krasznaa

I'd prefer to attack traccc/utils/detector_type_utils.hpp in a separate PR. One that mainly deals with the magnetic field. I was on purpose trying to limit the number of changes in this PR. 🤔

krasznaa · 2025-06-20T11:31:58Z

device/cuda/CMakeLists.txt

+  "src/finding/combinatorial_kalman_filter_algorithm_constant_field_default_detector.cu"
+  "src/finding/combinatorial_kalman_filter_algorithm_constant_field_telescope_detector.cu"


I guess it could. Though we're not doing that with any of the other libraries either. And it's not really "specialization" that we do here. But rather we split the implementation of the traccc::cuda::combinatorial_kalman_filter_algorithm into multiple source files. As combinatorial_kalman_filter_algorithm.cpp just implements the constructor, while the other two implement the two current "execute functions" of the class.

krasznaa · 2025-06-20T11:32:38Z

device/cuda/src/finding/kernels/specializations/propagate_to_next_surface_default_detector.cu

+    detray::actor_chain<detray::pathlimit_aborter<scalar>,
+                        detray::parameter_transporter<default_algebra>,
+                        interaction_register<interactor_t>, interactor_t,
+                        detray::momentum_aborter<scalar>, ckf_aborter>>;


Let me see how to best do this. 🤔

krasznaa · 2025-06-20T11:32:57Z

device/cuda/src/fitting/kalman_fitting_algorithm_constant_field_default_detector.cu

+        detray::rk_stepper<bfield_type::view_t,
+                           default_detector::device::algebra_type,
+                           detray::constrained_step<scalar_type>>;


While also re-designing the templating in the functions that implement the CKF algorithms for the different backends.

krasznaa · 2025-06-20T14:09:10Z

I think I managed to find a way to make the templating int he CKF functions a bit more user friendly. Making the "main functions" specifically templated on the detector and magnetic field types. And then using some newly introduced typedefs to figure out the correct Detray types based on just the detector and magnetic field types.

Now on to do the same for the KF algorithms...

While also slightly re-designing the templating in the functions that implement the KF algorithms for the different backends.

krasznaa · 2025-06-20T15:57:20Z

This PR ballooned into a much bigger thing by now than I originally intended. But the good news is that after this, adding non-const magnetic field versions of the finding and fitting functions will take a very finite amount of new lines of code. 🤔

sonarqubecloud · 2025-06-20T15:57:45Z

Quality Gate passed

Issues
4 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
15.9% Duplication on New Code

See analysis details on SonarQube Cloud

krasznaa · 2025-06-20T16:06:43Z

core/src/fitting/kalman_fitting_algorithm_constant_field_default_detector.cpp

+    traccc::details::kalman_fitter_t<
+        default_detector::host,
+        covfie::field<traccc::const_bfield_backend_t<
+            default_detector::host::scalar_type>>::view_t>
        fitter{det, field, m_config};


Technically... this could also be expressed like:

auto fitter = details::make_kalman_fitter(det, field, m_config);

(After writing an appropriate helper function of course.)

But let's leave that to a future improvement.

krasznaa · 2025-06-20T16:09:48Z

...ce/cuda/src/finding/combinatorial_kalman_filter_algorithm_constant_field_default_detector.cu

+    const measurement_collection_types::const_view& measurements,
+    const bound_track_parameters_collection_types::const_view& seeds) const {
+
+    using scalar_type = default_detector::device::scalar_type;


Shoot. Forgot to remove this typedef...

krasznaa requested review from stephenswat and beomki-yeo June 20, 2025 10:11

krasznaa added cuda Changes related to CUDA high priority labels Jun 20, 2025

krasznaa added 3 commits June 20, 2025 12:37

Re-wrote the CUDA CKF algorithm.

7135107

So that it would follow the same UI that all the other CKF algorithms provide. Had to modify the kernel specialization code a little, as it was itself relying on definitions from the old algorithm class. In sort of a circular way.

Adapting the client code to the CUDA CKF rewrite.

cee15e0

Re-wrote the CUDA KF algorithm.

14ff2ae

So that it would follow the same UI that all the other KF algorithms provide.

krasznaa force-pushed the CUDATemplatingRewrite-main-20250620 branch from 780e44c to 314edcc Compare June 20, 2025 10:37

stephenswat requested changes Jun 20, 2025

View reviewed changes

krasznaa commented Jun 20, 2025

View reviewed changes

krasznaa added 2 commits June 20, 2025 16:00

Adapting the client code to the CUDA KF rewrite.

896536e

Introduced a common set of typedefs for the CKF algorithms.

a336d1f

While also re-designing the templating in the functions that implement the CKF algorithms for the different backends.

krasznaa force-pushed the CUDATemplatingRewrite-main-20250620 branch from 314edcc to a336d1f Compare June 20, 2025 14:07

Introduced a common typedef for the KF algorithms.

974c489

While also slightly re-designing the templating in the functions that implement the KF algorithms for the different backends.

krasznaa changed the title ~~CUDA Templating Rewrite, main branch (2025.06.20.)~~ (CUDA) Templating Rewrite, main branch (2025.06.20.) Jun 20, 2025

krasznaa commented Jun 20, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

(CUDA) Templating Rewrite, main branch (2025.06.20.) #1029

(CUDA) Templating Rewrite, main branch (2025.06.20.) #1029

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

		"src/finding/combinatorial_kalman_filter_algorithm_constant_field_default_detector.cu"
		"src/finding/combinatorial_kalman_filter_algorithm_constant_field_telescope_detector.cu"

		"src/fitting/kalman_fitting_algorithm_constant_field_default_detector.cu"
		"src/fitting/kalman_fitting_algorithm_constant_field_telescope_detector.cu"

(CUDA) Templating Rewrite, main branch (2025.06.20.) #1029

Are you sure you want to change the base?

(CUDA) Templating Rewrite, main branch (2025.06.20.) #1029

Conversation

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Quality Gate passed

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!