Improved Full Detector Response storage and conversion #364

jdbuhler · 2025-05-28T14:04:44Z

This PR overhauls the on-disk and in-memory format of the FullDetectorResponse. The principal goals are to

reduce on-disk storage size via HDF5 internal compression, without incurring an unacceptable running time penalty
reduce rsp->h5 conversion time and memory usage
allow h5->rsp conversion without data loss

The biggest change is that the raw CDS ring counts are stored as small integers on disk, separately from the effective area multipliers. This change greatly improves compressibility and reduces peak memory during rsp->h5 conversion. As a bonus, it allows more efficient weighted averaging of slices at runtime and dynamic selection of 32- vs 64-bit PSRs.

Conversion has been separated from the FullDetectorResponse class into its own RspConverter class, which handles both rsp->h5 and h5->rsp.

After consultation with Israel et al., these changes remove support for the following deprecated formats and features:

sparse responses
miniDC2 format
reading the spectrum for effective area computation from a file (was already broken in DC3 codebase)

The new on-disk response format is not backwards or forwards compatible with the previous format. For this reason, I've established a new directory COSI-SMEX/develop in the public cosipy bucket on wasabi, put the new response files there, and updated the tutorials to use them instead of the ones from DC2/DC3. It is anticipated that DC4 will use the new format, while DC3 will remain with the current one.

The patch introduces a dependency on the hdf5plugin package, a well-supported companion to h5py that implements the BitShuffle compression algorithm used by the new response format.

Supporting performance numbers

Conversion time and memory

DC2 O3 continuum response: < 4 minutes, 4 GB
DC3 O3 polarization response: < 6 minutes, 6.33 GB
O4 version of continuum response: < 90 minutes, 52.7 GB

Testing was on a 2.3 GHz Intel Xeon Gold 5118 server with 192 GB of RAM and SSD storage; only one core was used.

(The first two used to take hours, while the last took overnight and needed at least half a terabyte of RAM)

HDF5 response file sizes on disk

DC2 O3 continuum response: 592.76 MB (.rsp.gz is 841.58 MB)
DC3 O3 polarization response: 795.76 MB (.rsp.gz is 954.58 MB )
O4 version of continuum response: 4187.05 MB (.rsp.gz is 5186.87 MB)

These file sizes are < 1/10 the size of the previous uncompressed HDF5 responses for O3, and more like 1/50 for O4
response, assuming (as was the case for the file I received) the latter is stored in float32 precision.

Read times

The following are average times to read one slice from the NuLambda axis (i.e., rsp[i]). Testing was on the same machine used for the conversion tests.

DC2 O3 continuum response: 4.5 ms (old), 6.4 ms (new)
DC3 O3 polarization response: 8.8 ms (old), 11 ms (new)
O4 version of continuum response: 350 ms (old), 46 ms (new)

"Old" is the previous response format, while "new" is the format in this PR. Note that for large enough responses, such as the O4 continuum response, it's faster to read less data from disk and decompress than to read the data uncompressed.

By way of comparison, compressing the DC2 O3 continuum response with HDF5's gzip internal compression yielded a substantially larger file (889.77 MB) and read times of 21 ms, > 3x longer than with the BitShuffle compression used in this patch.

into a new RspConverter class * change on-disk response format to split out counts from effective area, and use low-overhead compression on counts * allow counts to be stored in integer types smaller than 32 bits; provide auto-detection of the best size as an option * Rework .rsp conversion code to use less memory * Remove outdated support for sparse .rsp and broken code for file-based normalization * Enable axes of response to be read and written via the Axes object instead of replicating its functionality. Keep textual descriptions of axes in a separate group for pretty-printing. * Allow each Healpix axis to have its own nside; all-sky axes use nside=1 * capture and store header fields other than the axes and the size of counts in the .h5 file for future reference

tables stored at class scope. * Filter axes to be used for output HDF5 file as they are being read from .rsp, rather than arbitrarily removing the last couple of axes afterwards.

…deconvolution * make default for response construction to infer the element size from the data, rather than guess a too-large size

the content of the response * fix __array__ method of FullDetectorResponse * use new test full detector and polarization responses, and update test suite to expect the outputs that occur when they are used (work in progress)

* remove backup file that snuck into docs

* add conversion to Histogram to FullDetectorResponse API, rather than open-coding it in image_deconvolution/dataIF

not relevant to the output being tested and occasionally causes crashes

…side of an axis

…sponse * add unit tests for response conversion between .rsp and .h5 * fix typo in FullDetectorResponse.to_histogram()

* remove old commented-out code

* fix typo in open()

* to further avoid confusion between response versions, rename CONTENTS dataset to COUNTS, since it is now raw counts * FullDetectorResponse.open() should support only HDF5, as we cannot use an .rsp file live, it is not reasonable to convert a usefully large (o3 or above) .rsp.gz file "on the fly", and we have a separate converter class now.

…nverting the same .rsp to .h5 twice yields identical byte streams. * Do order tracking of headers ourselves instead of relying on HDF5 to do it.

…er than "DC2/DC3", with new checksums. Shorten the names while we're at it. * Remove vestiges of miniDC2 from image deconvolution notebooks * make sure setup for cosipy installs hdf5plugin package

existence of COSI-SMEX/develop tree on wasabi

* update .h5s for test full detector responses to newest on-disk format

eff_area, which controls the type returned by __getitem__, etc. This lets us have a float32 response instead of float64 if desired.

Empirically, we get slightly better compression and notably faster decompression without the extra shuffle

entire response in memory in order to do F-to-C order transposition. Instead, we do the transposition one chunk at a time, which roughly halves memory usage for large responses.

not a general Histogram. Rename the relevant function from to_histogram() to to_dr() and update its return type accordingly

* Add get_pixel() method to implement __getitem__, but with the possibility of specifying a weight that we can apply to eff_area rather than to the much larger counts matrix for greater efficiency. Use this method in get_psr() and friends.

with new "quiet" option to RspConverter class - response test cases now use get_pixel() rather than __getitem__ for most tests

and use it in RspConverter when converting back to .rsp

codecov · 2025-05-28T14:10:19Z

Codecov Report

Attention: Patch coverage is 86.97789% with 53 lines in your changes missing coverage. Please review.

Project coverage is 83.67%. Comparing base (7abbeea) to head (52fdccb).

Files with missing lines	Patch %	Lines
cosipy/response/RspConverter.py	83.93%	49 Missing ⚠️
cosipy/response/FullDetectorResponse.py	96.70%	3 Missing ⚠️
cosipy/response/DetectorResponse.py	50.00%	1 Missing ⚠️

Files with missing lines	Coverage Δ
cosipy/image_deconvolution/dataIF_COSI_DC2.py	`92.10% <100.00%> (+0.04%)`	⬆️
cosipy/response/__init__.py	`100.00% <100.00%> (ø)`
cosipy/response/DetectorResponse.py	`90.74% <50.00%> (ø)`
cosipy/response/FullDetectorResponse.py	`90.43% <96.70%> (+34.02%)`	⬆️
cosipy/response/RspConverter.py	`83.93% <83.93%> (ø)`

... and 2 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

so we can exercise it with smaller sizes * fix file exists check

.h5 file was added to the develop directory on wasabi

israelmcmc · 2025-06-10T15:26:00Z

Thanks again @jdbuhler!

I'll check this PR in detail in a couple of weeks. In the meanwhile, I'd like ask the following people to take a quick look at specific parts of the code:

@GallegoSav: can you please take a look at the RspConverter? I know some features have been dropped by Jeremy, I'm OK with that.
@hirokiyoneda: can you please check the code impacting the imaging deconvoltution module and the extended source setup in FullDetectorResponse?
@eneights: can you please check the parts of the code that use the "Pol" axis? Also, can you double check that your ASAD results are still consistent. Jere 8000 my changed that expected values for the unit test, which were using a very very coarse response, but I would expect that the results from a not-so-coarse response are still consistent.

None of these individual checks need to be in-dept, just a quick look for things the pop to your eyes would be helpful.

cosipy/response/RspConverter.py

GallegoSav

Hi , I reviewed the code in the response folder. I just have minor comments about typo

its first dimension. Removing this assumption would be a large change with performance implications for no clear benefit. For now, verify the assumption when we open the response file.

israelmcmc · 2025-06-12T13:23:02Z

Thanks @GallegoSav!

Jeremy Buhler and others added 29 commits May 16, 2025 15:57

* Make more of the behavior of RspCoverter driven by

bc67449

tables stored at class scope. * Filter axes to be used for output HDF5 file as they are being read from .rsp, rather than arbitrarily removing the last couple of axes afterwards.

* replace explicit use of guts of response with an API call in image …

f40738d

…deconvolution * make default for response construction to infer the element size from the data, rather than guess a too-large size

* update some more tests with output for new response

bbeda0b

* remove backup file that snuck into docs

* don't exchange Phi, PsiChi axes -- test suite passes without it

b8cb13d

* add conversion to Histogram to FullDetectorResponse API, rather than open-coding it in image_deconvolution/dataIF

turn off resampling of covariance matrix in 3ML fitting, which is

0c90628

not relevant to the output being tested and occasionally causes crashes

simplify some tests that explicitly check the energy edge array and n…

076c32f

…side of an axis

* add functionality to write .rsp.gz file from an open FullDetectorRe…

c435983

…sponse * add unit tests for response conversion between .rsp and .h5 * fix typo in FullDetectorResponse.to_histogram()

* add method to extract effective area from response

c6bb8eb

* remove old commented-out code

* eff_area cannot be both local field and exposed property

ab3fa7a

* fix typo in open()

* Do not use HDF5's timestamps or order-tracking features, so that co…

9a52953

…nverting the same .rsp to .h5 twice yields identical byte streams. * Do order tracking of headers ourselves instead of relying on HDF5 to do it.

* Update notebooks to point to new-format responses in "develop" rath…

e22376a

…er than "DC2/DC3", with new checksums. Shorten the names while we're at it. * Remove vestiges of miniDC2 from image deconvolution notebooks * make sure setup for cosipy installs hdf5plugin package

document key features of new response format and

3f98849

existence of COSI-SMEX/develop tree on wasabi

* add source for test polarization response to test data

42d43b8

* update .h5s for test full detector responses to newest on-disk format

Add option to FullDetectorResponse.open() to set the datatype of

b814855

eff_area, which controls the type returned by __getitem__, etc. This lets us have a float32 response instead of float64 if desired.

add .shape property for FullDataResponse analogous to Histogram

b21e0ea

Don't shuffle for compression when we are already bit-shuffling.

f955df6

Empirically, we get slightly better compression and notably faster decompression without the extra shuffle

Rework .rsp to .h5 conversion so it does not keep two copies of the

dc55401

entire response in memory in order to do F-to-C order transposition. Instead, we do the transposition one chunk at a time, which roughly halves memory usage for large responses.

fix range too narrow error from line bg estimation with new response

3bee2f2

update generated test files from new responses

f1f7c08

avoid a copy when building spec/aeff responses

da3d340

Loading the full response to memory should return a DetectorResponse,

485954d

not a general Histogram. Rename the relevant function from to_histogram() to to_dr() and update its return type accordingly

* Add accessor for raw counts dataset

b358243

* Add get_pixel() method to implement __getitem__, but with the possibility of specifying a weight that we can apply to eff_area rather than to the much larger counts matrix for greater efficiency. Use this method in get_psr() and friends.

- add tqdm progress bars to rsp-to-h5 conversion; may be disabled

85877dc

with new "quiet" option to RspConverter class - response test cases now use get_pixel() rather than __getitem__ for most tests

Add "headers" property to FullDetectorResponse to access stored headers,

fc8d178

and use it in RspConverter when converting back to .rsp

Merge branch 'cositools:develop' into new_response_format

40405bf

update comment on compression with some new numbers

e5d9702

Jeremy Buhler and others added 5 commits May 28, 2025 10:47

small test tweaks to improve coverage of FullDetectorResponse.py

ce12288

* add ability to specify text buffer size for response converter,

b9a72f8

so we can exercise it with smaller sizes * fix file exists check

Merge branch 'cositools:develop' into new_response_format

fc876dd

Update newly added tutorial to use new-style response; a new

f88e261

.h5 file was added to the develop directory on wasabi

minor comment cleanup

0d5a327

GallegoSav reviewed Jun 12, 2025

View reviewed changes

cosipy/response/RspConverter.py Show resolved Hide resolved

GallegoSav reviewed Jun 12, 2025

View reviewed changes

cosipy/response/RspConverter.py Show resolved Hide resolved

GallegoSav reviewed Jun 12, 2025

View reviewed changes

cosipy/response/RspConverter.py Outdated Show resolved Hide resolved

GallegoSav reviewed Jun 12, 2025

View reviewed changes

Jeremy Buhler added 2 commits June 12, 2025 07:59

fix typo

834ebf7

FullLDetectorResponse assumes that the on-disk response has NuLambda as

52fdccb

its first dimension. Removing this assumption would be a large change with performance implications for no clear benefit. For now, verify the assumption when we open the response file.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improved Full Detector Response storage and conversion #364

Improved Full Detector Response storage and conversion #364

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Improved Full Detector Response storage and conversion #364

Are you sure you want to change the base?

Improved Full Detector Response storage and conversion #364

Uh oh!

Conversation

Supporting performance numbers

Conversion time and memory

HDF5 response file sizes on disk

Read times

Uh oh!

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!