-
Notifications
You must be signed in to change notification settings - Fork 90
kerchunk reference file and Zarr groups #556
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Narrowing this down a bit further, If I downgrade Zarr and Kerchunk to the following versions, the desired behavior returns:
|
Do you mean that the existing reference file opens, or only if you remake it with the older version of kerchunk? |
@martindurant apologies for the lack of clarity -- I was intending to say that the existing reference file opens. If it helps, I essentially tack on the group information to the reference keys after they are returned from scan_grib... and as far as I can tell with the recent changes to kerchunk, it should still be using the Zarr v2 spec? I'm not well versed in what's changed with the Zarr spec for v3, but my intuition is that v2 should still work. |
Yes - it runs under zarr3, but produces v2 metadata, which will continue to be supported. |
Well in that case, this reference file should theoretically still work then. Not entirely sure where to start other than inside the xarray back end and work from there. No errors or warnings show up in the console when passing the open_group info, it just returns an empty dataset. I'll try to do some deeper digging and figure out what changed. |
Writing it out longhand and chosing one particular group: fs = fsspec.filesystem("asyncwrapper", fs=fsspec.filesystem("file"), asynchronous=True)
ds = xr.open_dataset("reference://native", engine='zarr', backend_kwargs={"storage_options": {"fo": "/Users/mdurant/Downloads/hrrr-hybrid.json", "fs": fs}, "consolidated": False})
ds gives
To actually get data out, you also need remote_options (asynchronous=True) and remote_protocol ("https"). |
Just checking with similar issue in reading json and paraquet refence files in zarr v3, the following issue with fsspec
But however running following could able to open the json
with ends up in following warnign and opening of ds
However following open without any warning
|
I think it's not yet released. It was previously called "async_wrapper", but it turns out URL protocol strings should not have a "_" in.
Yes, this is annoying. kerchunk effectively does a different version of consolidation. It could provide a .zmetadata I suppose. |
Hello,
I was previously storing the output of
scan_grib
using Zarr groups in order to have an entire grib2 file stored in single json reference file. This was so I could use xarray to read from single file, passing in theopen_dataset_options={"group": "my_group"}
to get the portion of the file on a common contiguous coordinate I desired.It appears that in the last year, through various updates of kerchunk, something got changed or broken along the way, and I can no longer read reference JSONs with a Zarr group. I don't know how much good it is, but I have an example reference file attached.
I attempted opening it with:
Previously, this would successfully open the group, but as of now, returns an empty dataset. Any suggestions about what I may be doing wrong? Is this functionality that's no longer supported, or is this a bug?
hrrr-hybrid.json
The text was updated successfully, but these errors were encountered: