Add support for CUDA sparse BA solver #2717

ahojnnes · 2024-08-16T11:10:39Z

This PR implements enhancement: #2643. It builds upon Ceres' recent CUDA_SPARSE solver type.

Initial experiments show significant runtime improvements. On my machine with an Intel Core i9 10920X and an NVidia RTX 2070, I see a consistent 3x speedup for reconstructions with ~500-5000 images. For smaller problems with ~100 images, the runtime is roughly equivalent. These experiments were done using CUDA 12.5 and cudss 0.3.0.

For now, the feature is disabled by default and requires explicit enabling of the option. This is because no robustness is implemented against situations where the GPU does not have enough memory and some of the thresholds that determine the usage of sparse direct vs. indirect solvers need to be tuned in this new scenario.

ahojnnes · 2024-08-16T11:11:59Z

@S-o-T FYI, since I believe you added this new feature to Ceres.

S-o-T · 2024-08-16T19:54:00Z

src/colmap/estimators/bundle_adjustment.cc

+#if (CERES_VERSION_MAJOR >= 3 ||                                \
+     (CERES_VERSION_MAJOR == 2 && CERES_VERSION_MINOR >= 2)) && \
+    !defined(CERES_NO_CUDSS) && defined(CUDA_ENABLED)
+    if (options_.use_gpu) {


I think that it might be worth experimentation/tuning to find problem size threshold to switch from CPU to GPU sparse backend, although it might be too sensitive to choice between suitesparse/eigen (until mkl sparse backend will be merged to ceres-solver).

Yes, the current parameters are not tuned. It will be difficult to find universal thresholds here that generalize across different CPU/GPU models. It was a little easier before, as we switched between algorithms that all run on the CPU. As such, I decided to expose the parameters now explicitly through the option manager for the bundle adjuster and we might need to do the same for the bundle adjustment that runs as part of the mapper. For now, this feature is disabled by default, as I want to gain some more experience with it over the next weeks.

tineras · 2024-11-28T19:45:48Z

I'm getting the following when trying to use the ba_use_gpu flag:

W20241128 14:23:50.326298 19296 bundle_adjustment.cc:301] Requested to use GPU for bundle adjustment, but Ceres was compiled without CUDA support. Falling back to CPU-based dense solvers. W20241128 14:23:50.326341 19296 bundle_adjustment.cc:320] Requested to use GPU for bundle adjustment, but Ceres was compiled without cuDSS support. Falling back to CPU-based sparse solvers.

The only instance of ceres I know of on my machine is the ceres.dll inside of the colmap/bin folder. I am using CUDA 11.8.

fwiw, nvidia-smi tells me the CUDA version is 12.3

My CUDA_HOME, CUDA_PATH both point to C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.8

Am I missing something or using this incorrectly?

I am only trying this because I'm getting sudden crashes during bundle adjustment and have tried numerous, known working datasets that never crashed before and now everything I process crashes during BA and I am trying to pinpoint what dependencies on my system might have changed.

ahojnnes · 2024-11-28T19:58:57Z

You need to custom compile the latest Ceres development version with CUDA support. This is not part of the current set of pre compiled colmap binaries.

tineras · 2024-11-28T20:00:53Z

You need to custom compile the latest Ceres development version with CUDA support. This is not part of the current set of pre compiled colmap binaries.

Okay, I can certainly do that. I updated my original message, but I'll include it in this reply as well. Sorry to hijack this topic. I'm happy to open a new issues, but was trying to avoid that.

"I am only trying this because I'm getting sudden crashes during bundle adjustment and have tried numerous, known working datasets that never crashed before and now everything I process crashes during BA and I am trying to pinpoint what dependencies on my system might have changed."

Perhaps something in vcpkg from VCPKG_ROOT env variable?

tineras · 2024-11-29T03:24:27Z

You need to custom compile the latest Ceres development version with CUDA support. This is not part of the current set of pre compiled colmap binaries.

Looks like ceres is causing the problem for me. I am doing another run now with a ceres dll (with CUDA support) that I built using vcpkg.

Faulting application name: colmap.exe, version: 0.0.0.0, time stamp: 0x67486f74
Faulting module name: ceres.dll, version: 0.0.0.0, time stamp: 0x6739347e
Exception code: 0xc0000005
Fault offset: 0x000000000001748e
Faulting process id: 0x0x39A0
Faulting application start time: 0x0x1DB41FC694ECFF6
Faulting application path: C:\proj\colmap_3_11_0\bin\colmap.exe
Faulting module path: C:\proj\colmap_3_11_0\bin\ceres.dll
Report Id: 699ead97-3e8e-4237-bbe0-3d4161d2b5e8
Faulting package full name:
Faulting package-relative application ID:

ahojnnes · 2024-11-29T11:01:44Z

You'll have to recompile colmap from scratch. You cannot just recompile the ceres.dll and replace it. First of all, the colmap build system will detect CUDA support in ceres at compile and not at runtime. Second, C++ does not have a stable ABI, so you cannot mix and match different compiler or standard library versions. It will be difficult to match the exact combination of OS/compiler/stdlib/etc. on your system.

Add support for CUDA sparse BA solver

11c2d60

ahojnnes requested review from sarlinpe, B1ueber2y and mihaidusmanu August 16, 2024 11:11

ahojnnes added 3 commits August 16, 2024 13:24

d

1c1b5ea

d

2440f13

d

7e721d2

S-o-T reviewed Aug 16, 2024

View reviewed changes

mihaidusmanu approved these changes Aug 19, 2024

View reviewed changes

ahojnnes merged commit d2ee055 into main Aug 19, 2024
16 checks passed

ahojnnes deleted the user/joschonb/ba-cuda branch August 19, 2024 08:01

alancneves mentioned this pull request Sep 3, 2024

COLMAP with CUDA-enabled BA #2746

Closed

BrewTestBot mentioned this pull request Nov 28, 2024

poselib 2.0.4 (new formula) colmap 3.11.0 Homebrew/homebrew-core#199371

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add support for CUDA sparse BA solver #2717

Add support for CUDA sparse BA solver #2717

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Add support for CUDA sparse BA solver #2717

Add support for CUDA sparse BA solver #2717

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!