Embed parallelization into the multi_voxel_fit decorator. #2593

arokem · 2022-05-08T03:00:50Z

I've started playing around with the idea that the multi_voxel_fit decorator could use paramap instead of iterating over voxels. If we can make this work generally that would be pretty cool. So far, I've only tested this with the fwdti model, and in that case, the change to the additional changes to the code are rather minimal, which gives me hope that we might be able to use this wherever we use this decorator, so in csd, dsi, forecast, fwdti, gqi, ivim, mapmri, mcsd, qtdmri, and shore (!).

pep8speaks · 2022-05-08T03:00:52Z

Hello @arokem, Thank you for updating !

In the file dipy/reconst/multi_voxel.py:

Line 71:31: E203 whitespace before ':'

Comment last updated at 2024-06-29 20:43:35 UTC

skoudoro · 2022-05-08T17:07:35Z

Thank you for starting this @arokem!

Have you looked at #1418? I think some ideas can be reused here.

arokem · 2022-05-08T22:43:11Z

Oh yeah - that’s a great pointer! I’ll try to incorporate the ideas you implemented there into this PR.

…

On Sun, May 8, 2022 at 10:07 AM Serge Koudoro ***@***.***> wrote: Thank you for starting this @arokem <https://github.com/arokem>? Have you looked at #1418 <#1418>? I think some ideas can be reused here. — Reply to this email directly, view it on GitHub <#2593 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAA46NTHC7HEWHHJSVVXWKDVI7YGFANCNFSM5VLIP45A> . You are receiving this because you were mentioned.Message ID: ***@***.***>

codecov · 2022-05-09T04:56:06Z

Codecov Report

Attention: Patch coverage is 80.43478% with 9 lines in your changes missing coverage. Please review.

Project coverage is 83.64%. Comparing base (5fc3d44) to head (b2a381a).

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #2593      +/-   ##
==========================================
- Coverage   83.65%   83.64%   -0.02%     
==========================================
  Files         152      152              
  Lines       21374    21395      +21     
  Branches     3459     3465       +6     
==========================================
+ Hits        17881    17895      +14     
- Misses       2629     2636       +7     
  Partials      864      864

Files	Coverage Δ
dipy/reconst/csdeconv.py	`87.42% <100.00%> (ø)`
dipy/reconst/dsi.py	`80.21% <100.00%> (ø)`
dipy/reconst/forecast.py	`92.82% <100.00%> (ø)`
dipy/reconst/fwdti.py	`94.28% <100.00%> (ø)`
dipy/reconst/gqi.py	`54.00% <100.00%> (ø)`
dipy/reconst/ivim.py	`96.00% <100.00%> (ø)`
dipy/reconst/mapmri.py	`92.09% <100.00%> (ø)`
dipy/reconst/mcsd.py	`88.69% <100.00%> (ø)`
dipy/reconst/qtdmri.py	`93.38% <100.00%> (ø)`
dipy/reconst/shore.py	`91.90% <100.00%> (ø)`
... and 2 more

arokem · 2022-05-09T17:06:26Z

I ran a benchmark on a beefy 24-cpu compute server with the recent commit.I get a roughly 13x speedup for fitting the fwdti model with engine="joblib" relative to the default serial mode. I should maybe mention that the server is also doing a bunch of other work, so it's not the cleanest benchmark, but still quite promising.

arokem · 2022-05-16T20:26:33Z

Does anyone understand why half the CI actions are still pending? They have been pending since Friday!

skoudoro · 2022-05-16T21:50:42Z

No, but I will restart them first

skoudoro · 2022-05-17T13:01:39Z

Hi @arokem,

It seems we have a new issue with DIPY installation. I do not know yet what changes. the CI's are failing in all PR.
I will start to dig into it

arokem · 2022-05-17T16:09:26Z

Just rebased on top of #2595

arokem · 2022-05-18T18:50:15Z

Does anyone understand these CI failures? I don't think they are related to the content of the PR, but I might be missing something.

skoudoro · 2022-05-18T20:27:24Z

Does anyone understand these CI failures? I don't think they are related to the content of the PR, but I might be missing something.

Both failures are on the parallelization CI's with a memory leaks issue. This might be due to some of the parallel packages that might change some environment variable flags. These flags could have an impact on this parallelized function.

All supposition, this is what comes first to my mind.

skoudoro · 2022-05-18T20:27:57Z

the failing function are using openmp

arokem · 2022-05-29T21:05:29Z

Hey @skoudoro, I noticed that you did not pin the ray version in #2600, instead pinning only protobuf, but I am seeing this again on the CI: https://github.com/dipy/dipy/runs/6634820045?check_suite_focus=true#step:9:119, which suggests to me that I should pin ray to 0.11 for now. Does that make sense to you? I'll give it a try here.

arokem · 2022-05-29T21:06:25Z

Or possibly 1.11.1

arokem · 2022-05-30T03:02:42Z

We're back to this failure: https://github.com/dipy/dipy/runs/6645881563?check_suite_focus=true#step:9:3751

Interestingly, I can't get this to fail locally on my machine (in an env with dask, ray and joblib installed). I also don't exactly understand how this is related to openmp. Does set_number_of_points use openmp?

arokem · 2022-12-13T19:46:48Z

dipy/reconst/multi_voxel.py

+            single_voxel_with_self = partial(single_voxel_fit, self)
+            n_jobs = kwargs.get("n_jobs", multiprocessing.cpu_count() - 1)
+            vox_per_chunk = np.max([data_to_fit.shape[0] // n_jobs, 1])
+            chunks = [data_to_fit[ii:ii + vox_per_chunk]


This might duplicate memory. Need to benchmark.

We can use this: https://pypi.org/project/memory-profiler/

arokem · 2022-12-13T20:10:26Z

Plan to make progress here:

Set up experimental datasets: All of the models except for DSI can use multi-shell data. Only CSD (I think) can run on single-shell data. For multi-shell datasets we can use HBN and HCP. For DSI, I guess we can use the dsi dataset we have in our data fetchers. We'll need to set up fetchers for HBN data (see Replace CENIR multishell with HBN POD2 data #2695) and for HCP (see Port HCP fetcher from pyAFQ into here #2696).
Set up experimental scripts (separate repo, probably): these should run every one of the models that are decorated in this PR with:
1. Serial mode.
2. Parallelized by voxel with dask, ray, joblib.
3. Parallelized by chunk with dask, ray, joblib.
4. Parallelized with different backends if possible.
5. For ray/dask, parallelize on a big distributed AWS cluster.
Run the experiments. We'll need to have some uniform hardware settings. We'll want to run this on different OS (Windows, Linux, Mac OS) and maybe on different kinds of computational architectures (e.g., distributed cluster vs. one big machine).
Separately benchmark timing (this is straightforward) and memory (using https://github.com/pythonprofilers/memory_profiler).
Compare and contrast 😄

skoudoro

For now, it works with `engin in ["serial", "dask"]

my laptop crash with ray
See below for joblib issue.

I will share the timing when those 2 are fixed.

Thanks @arokem

skoudoro · 2023-01-10T19:08:47Z

dipy/reconst/multi_voxel.py

+                    _parallel_fit_worker,
+                    chunks,
+                    func_args=[single_voxel_with_self],
+                    **kwargs)


dask did not complain but joblib fails with this:

TypeError: __init__() got an unexpected keyword argument 'vox_per_chunk'

we need to update paramap function

See: https://stackoverflow.com/a/78606253/3532933

See yeatmanlab/AFQ-Insight#129

Co-authored-by: Oscar Esteban <code@oscaresteban.es>

arokem · 2024-06-27T22:29:14Z

Rebased. Could you help me with the CI? I am not sure how to check that we're in the "optional" case.

skoudoro

Rebased. Could you help me with the CI? I am not sure how to check that we're in the "optional" case.

Thanks! It should be something like the suggestion below. I hope there is no typo in this syntax

.github/workflows/test_template.yml

Co-authored-by: Serge Koudoro <skab12@gmail.com>

arokem · 2024-06-28T10:42:31Z

Thanks!

…

On Fri, Jun 28, 2024 at 7:08 PM Serge Koudoro ***@***.***> wrote: ***@***.**** commented on this pull request. Rebased. Could you help me with the CI? I am not sure how to check that we're in the "optional" case. Thanks! It should something like the suggestion below. I hope there is no typo in this syntax ------------------------------ In .github/workflows/test_template.yml <#2593 (comment)>: > @@ -96,6 +96,16 @@ jobs: python-version: ${{ matrix.python-version }} channels: defaults, conda-forge use-only-tar-bz2: true + - name: Install HDF5 and pytables on macOS + if: runner.os == 'macOS' ⬇️ Suggested change - if: runner.os == 'macOS' + if: ${{ (runner.os == 'macOS') && (env.EXTRA_DEPENDS != '') && (env.INSTALL_TYPE == 'pip') }} — Reply to this email directly, view it on GitHub <#2593 (review)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAA46NVVEQ4JXZUBKN3BKI3ZJUY2NAVCNFSM5VLIP45KU5DIOJSWCZC7NNSXTPCQOVWGYUTFOF2WK43UKJSXM2LFO45TEMJUG42TIMJQGA3A> . You are receiving this because you were mentioned.Message ID: ***@***.***>

skoudoro

Hi @arokem,

I was going to merge but I saw that a previous comment has been remove but not addressed. see below the suggestion and then I can go ahead.

Thank you

pyproject.toml

Co-authored-by: Serge Koudoro <skab12@gmail.com>

arokem · 2024-06-29T20:44:23Z

Sorry about that and thanks for spotting this.

skoudoro

okay, all good, I will go ahead and merge this PR.

Thank you for this amazing work @arokem and @asagilmore!

NOTE: we need to document it in details somewhere. It will take time. So, for now, I recommend a follow-up PR to add a small recipe in dipy documentation (see https://docs.dipy.org/stable/recipes). Just add a small section: How to accelerate the fitting in all DIPY reconstruction models? or something similar.

We're writing up a report about this here and we'd be happy to have input on the results and ideas that we are developing there (the repo for that report is here: https://github.com/nrdg/2024-dipy-parallelization).

Where can we find the comparison with joblib, dask and ray? I see a lot of work with Ray and this is clearly the recommended backend. However, it would be great to see the advantage against the others backend. Thank you for your feedback

skoudoro · 2024-07-02T07:56:44Z

Following up with my questions above @arokem and @asagilmore

skoudoro added the type:New feature label May 8, 2022

arokem changed the title ~~WIP: Embed parallelization into the multi_voxel_fit decorator.~~ Embed parallelization into the multi_voxel_fit decorator. May 9, 2022

arokem force-pushed the para_multivoxel branch from 892d26c to 7628a78 Compare May 17, 2022 16:09

arokem force-pushed the para_multivoxel branch from 7628a78 to a9b3c2f Compare May 28, 2022 05:07

arokem force-pushed the para_multivoxel branch from 48f7b3f to 8d7f71b Compare May 30, 2022 02:53

arokem commented Dec 13, 2022

View reviewed changes

arokem force-pushed the para_multivoxel branch from 8d7f71b to 173160c Compare December 13, 2022 19:51

arokem force-pushed the para_multivoxel branch from 173160c to ddac5c2 Compare December 19, 2022 19:39

skoudoro reviewed Jan 10, 2023

View reviewed changes

skoudoro force-pushed the master branch 5 times, most recently from 7e158ff to dda2ffa Compare December 8, 2023 16:00

arokem and others added 14 commits June 28, 2024 07:23

Test pinning h5py.

d2cef7f

Format code in csdeconv.

7c91197

Reformat.

4ba589b

Formatting.

805beef

Pin numpy to <2.0 for now.

de8c728

Try to install h5py 3.11 from conda (instead of 3.10).

2bee347

Pin setuptools.

7e8e3c2

See: https://stackoverflow.com/a/78606253/3532933

Install pytables with homebrew on mac.

638ac7a

See yeatmanlab/AFQ-Insight#129

Update pyproject.toml

c6bc0a8

Co-authored-by: Oscar Esteban <code@oscaresteban.es>

Update pyproject.toml

f42f230

Co-authored-by: Oscar Esteban <code@oscaresteban.es>

Update install_dependencies.sh

01ca094

Co-authored-by: Oscar Esteban <code@oscaresteban.es>

Applying review suggestion.

64f839d

Remove default from docstring.

40d3e8a

Clean up defaults in docstring.

79b13ef

arokem force-pushed the para_multivoxel branch from 0140c8c to 79b13ef Compare June 27, 2024 22:24

Should work with numpy 2.0.0 and below (?).

72a3dae

skoudoro reviewed Jun 28, 2024

View reviewed changes

.github/workflows/test_template.yml Outdated Show resolved Hide resolved

Update test_template.yml

e3b4753

Co-authored-by: Serge Koudoro <skab12@gmail.com>

skoudoro reviewed Jun 29, 2024

View reviewed changes

pyproject.toml Outdated Show resolved Hide resolved

pyproject.toml Outdated Show resolved Hide resolved

arokem and others added 2 commits June 29, 2024 13:43

Update pyproject.toml

d4c3e5c

Co-authored-by: Serge Koudoro <skab12@gmail.com>

Update pyproject.toml

b2a381a

Co-authored-by: Serge Koudoro <skab12@gmail.com>

skoudoro approved these changes Jun 29, 2024

View reviewed changes

skoudoro merged commit 6c59e39 into dipy:master Jun 29, 2024
30 of 31 checks passed

arokem mentioned this pull request Jul 1, 2024

ENH: Refactor the model so it is compatible with DIPY's interface jhlegarreta/eddymotion#7

Merged

skoudoro mentioned this pull request Jul 5, 2024

Iteratively reweighted least squares for robust fitting #3170

Closed

skoudoro mentioned this pull request Aug 13, 2024

Multprocessing the multivoxel fit #1026

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Embed parallelization into the multi_voxel_fit decorator. #2593

Embed parallelization into the multi_voxel_fit decorator. #2593

Embed parallelization into the multi_voxel_fit decorator. #2593

Embed parallelization into the multi_voxel_fit decorator. #2593

Conversation

Comment last updated at 2024-06-29 20:43:35 UTC

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment