Add example using a Zarr array #191

jakirkham · 2019-03-29T21:59:05Z

Description

Creates a large Zarr array that is mostly zero except for a small square of 1's in the middle. Demonstrates how one might combine Dask and Zarr to move through larger data in the viewer.

Type of change

Bug-fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

References

How has this been tested?

example: the test suite for my feature covers cases x, y, and z
example: all tests pass with my change

Final checklist:

My PR is the minimum possible work for the desired functionality
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
I have added tests that prove my fix is effective or that my feature works

sofroniewn

Looks good to me!

jakirkham · 2019-04-01T01:07:04Z

As a note, this works reasonably fast using this in-memory dataset.

@sofroniewn and I looked at a dataset stored in Zarr on disk, but found it less responsive than desired. There are any number of potential reasons for that. It’s probably worth benchmarking to identify the exact cause.

@jni mentioned one cause may be the disk’s read speed, which could be significant here. Other issues may be performance issues in Zarr; however, these presumably would impact the in-memory case as well, but don’t seem to.

Besides benchmarking some other things to try would be using Zarr’s LRUStoreCache and/or using memory-mapping ( zarr-developers/zarr-python#377 ).

If there are still problems loading data quickly, would suggest engaging with the Zarr community. There’s a meta issue ( zarr-developers/zarr-python#382 ), which discusses and links to a number of potential improvements in the data loading process. This involves a few different caches at different stages for different purposes.

jni · 2019-04-03T02:28:53Z

More side notes:

I wonder if some of the slowness comes from our out-of-order handling of dimensions
note also that Python has a built-in lru cache: https://docs.python.org/3/library/functools.html#functools.lru_cache may or may not be useful...!

Thanks for the links to the community, @jakirkham!

sofroniewn · 2019-04-06T00:05:16Z

@jni @jakirkham I'm making great progress with the on-disk zarr example. Unsurprisingly it is all about finding the right chunk size and making sure the chunking dimensions align with our slicing dimensions. Any order of the dimensions is fine, but things just have to match up.

I havn't had to make any changes to the code apart from adding an optional clim_range argument so as to avoid the min / max calls.

It's very exciting! Thanks again @jakirkham for your help - this would have taken A LOT longer for me to figure out without your help

jakirkham added 3 commits March 29, 2019 14:54

Add an example using Zarr

25eb050

Require Zarr for testing

268b1a8

Make example size smaller for CI

f71d622

sofroniewn approved these changes Mar 30, 2019

View reviewed changes

jni merged commit df1a963 into napari:master Apr 2, 2019

sofroniewn mentioned this pull request Apr 6, 2019

Support visualization of lazy ndarrays #103

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add example using a Zarr array #191

Add example using a Zarr array #191

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Add example using a Zarr array #191

Add example using a Zarr array #191

Uh oh!

Conversation

Description

Type of change

References

How has this been tested?

Final checklist:

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!