Exploit band-diagonal structure in quasisep transitions for better scaling #240

dfm · 2025-07-02T18:21:37Z

This PR builds on the fact that most real applications of quasiseparable kernels have internal block diagonal structure. If we ignore this fact than the computational cost scales as the cube of the model "width" (approximately the number of kernel terms), but we can get better scaling (quadratic in width) if we exploit this structure.

Here is the scaling in model width before and after this change:

As expected, for wide models, this makes a dramatic difference in the runtime performance.

I'll note that we do end up paying a non-trivial compile time cost:

But I think that that's probably worth it.

(Note that in all these examples I've set the environment variable: XLA_FLAGS=--xla_backend_extra_options=xla_cpu_small_while_loop_byte_threshold=1000000.)

In this PR, I'll enable the new behavior by default, but we could consider adding an option to disable it.

…aling.

for more information, see https://pre-commit.ci

dfm force-pushed the band-diag branch from c9f7656 to 099219e Compare July 2, 2025 18:43

Exploit band-diagonal structure in quasisep transitions for better sc…

6a9966b

…aling.

dfm force-pushed the band-diag branch from 099219e to 6a9966b Compare July 8, 2025 16:06

[pre-commit.ci] auto fixes from pre-commit.com hooks

d587a93

for more information, see https://pre-commit.ci

dfm merged commit 190c0ec into main Jul 8, 2025
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Exploit band-diagonal structure in quasisep transitions for better scaling #240

Exploit band-diagonal structure in quasisep transitions for better scaling #240

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Exploit band-diagonal structure in quasisep transitions for better scaling #240

Exploit band-diagonal structure in quasisep transitions for better scaling #240

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

Uh oh!