Fixes to apply_parallel for functions working with multichannel data #4927

grlee77 · 2020-08-18T20:19:24Z

Description

The multichannel argument helps give sensible default chunks and expands scalar depth arguments appropriately for multichannel data.

A concrete example where the dtype argument is needed for this to pass was added as a test case. There is an explanation of the reason it is needed in #4900 (comment).

Checklist

Docstrings for all functions
Gallery example in ./doc/examples (new features only)
Benchmark in ./benchmarks, if your changes aren't covered by an
existing benchmark
Unit tests
Clean style in the spirit of PEP8

For reviewers

Check that the PR title is short, concise, and will make sense 1 year
later.
Check that new functions are imported in corresponding __init__.py.
Check that new features, API changes, and deprecations are mentioned in
doc/release/release_dev.rst.

… images

grlee77 · 2020-08-18T20:21:26Z

One subtlety: specifying dtype does not guarantee the specific output dtype that will be returned by apply_parallel. For floating point inputs, the dtype of the output seems to match precision of the input and not that of the specified dtype argument.

The following example demonstrates via direct use of map_blocks that the output dtype is determined via a combination of the input data-type and the dtype argument to map_blocks.

import numpy as np
import dask.array as da
for dtype_in, dtype_map_blocks in [
    (np.uint8, np.float32),
    (np.uint16, np.float32),
    (np.uint32, np.float32),
    (np.float32, np.float16),
    (np.float32, np.float32),
    (np.float32, np.float64),
    (np.float64, np.float16),
    (np.float64, np.float32),
    (np.float64, np.float64)
]:
    x = da.from_array(np.arange(64, dtype=dtype_in))

    out = da.map_blocks(np.sqrt, x, chunks=(8,), dtype=dtype_map_blocks).compute()
    print(f"dtype_in={np.dtype(dtype_in).name}, "
          f"dtype_map_blocks={np.dtype(dtype_map_blocks).name} -> "
          f"dtype_out={out.dtype.name}")

dtype_in=uint8, dtype_map_blocks=float32 -> dtype_out=float16
dtype_in=uint16, dtype_map_blocks=float32 -> dtype_out=float3
8000
2
dtype_in=uint32, dtype_map_blocks=float32 -> dtype_out=float64
dtype_in=float32, dtype_map_blocks=float16 -> dtype_out=float32
dtype_in=float32, dtype_map_blocks=float32 -> dtype_out=float32
dtype_in=float32, dtype_map_blocks=float64 -> dtype_out=float32
dtype_in=float64, dtype_map_blocks=float16 -> dtype_out=float64
dtype_in=float64, dtype_map_blocks=float32 -> dtype_out=float64
dtype_in=float64, dtype_map_blocks=float64 -> dtype_out=float64

emmanuelle · 2020-08-19T14:23:46Z

One subtlety: specifying dtype does not guarantee the specific output dtype that will be returned by apply_parallel. For floating point inputs, the dtype of the output seems to match precision of the input and not that of the specified dtype argument.

Should the docstring be modified to suggest that this parameter is here to help dask, but is by no means a guarantee of the output dtype?

emmanuelle

Thanks @grlee77 ! I just left a small comment.

Your PR also reminds me that we should incorporate the doc on apply_parallel of #4214 and #3386 but this is independent of this PR!

sciunto · 2020-09-05T06:38:08Z

thank you @grlee77 !

grlee77 added 5 commits August 18, 2020 15:30

allow user-specified output dtype for apply_parallel

018af92

Add multichannel argument to handle chunks and depth properly for RGB…

b2fb948

… images

add apply_parallel test cases for rgb images

fbc1acb

test apply_parallel output dtype as well

1da8af8

clarification in dtype docstring

678f921

alexdesiqueira added ⏩ type: Enhancement Improve existing features 🧐 Needs review labels Aug 18, 2020

sciunto added this to the 0.18 milestone Aug 19, 2020

emmanuelle approved these changes Aug 19, 2020

View reviewed changes

grlee77 mentioned this pull request Aug 24, 2020

wrapped function option dask/dask-image#152

Closed

alexdesiqueira mentioned this pull request Aug 24, 2020

2020's calendar of community management #4486

Closed

sciunto added the action: mrg+1 label Sep 5, 2020

sciunto merged commit 3a0ac03 into scikit-image:master Sep 5, 2020

grlee77 deleted the apply_parallel_rgb branch July 8, 2021 20:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fixes to apply_parallel for functions working with multichannel data #4927

Fixes to apply_parallel for functions working with multichannel data #4927

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Fixes to apply_parallel for functions working with multichannel data #4927

Fixes to apply_parallel for functions working with multichannel data #4927

Uh oh!

Conversation

Description

Checklist

For reviewers

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!