horizontally threaded labeled regions #4060

andreas-kupries · 2024-07-20T20:32:04Z

andreas-kupries
Jul 20, 2024

See CC dev docs for a description of my take on calculating labeled regions / connected components in a horizontally threaded manner.

Example results at:

Hopefully this inspires ideas for how VIPS can do this as well.

My primary function (See 1 above) actually does not return an image. It instead returns a data structure describing the regions (area, bounding box, and the range pieces it consists of).

The secondary function (See 2 above) uses (1), transforms the data structure it gets a bit, then feeds the result into from sparse ranges, a virtual image taking range information to describe its content. The other, non-range information, is placed into the image meta data.

See https://core.tcl-lang.org/akupries/aktive/file?ci=7215f3713794bae4&name=etc/transformer/cc.tcl&ln=34-45 for the implementation.

jcupitt · 2024-07-21T09:48:22Z

jcupitt
Jul 21, 2024
Maintainer

Hi @andreas-kupries,

That's a very cool idea! I suppose the difficulty (if any) would be that the memory use could be extremely high (much larger than the source image) in pathological cases. I think your worst case would be a checkerboard pattern (is this right?) where the sparse range data would be number-of-pixels * (3 * sizeof(uint32) + 1), or 9 times larger than the original image.

Perhaps this doesn't matter, but I wonder if a tile-based (rather than scanline-based) approach could get the worst-case memory use down? Cut the image into eg. 128x128 tiles, find connected components in parallel in each tile with a conventional flood fill approach (libvips has one of these), and write the separate tiles out.

A second phase, running in parallel as tiles are computed, could find a rename array for each tile. It could look along the tile edges where they join, and if the new tile had a different label for the same pixel value, it could add an entry to a rename array for the new tile.

You'd have to wait for the whole of phase 2 to complete and write back to eg. disc, but a final stage could then push the image through the set of rename arrays to create a single unified label image.

When you compute the rename arrays, the pathological case is a rename for every pixel along the joining edges, so 2 * 128 * sizeof(uint32), or 1024 bytes. A tile is 64kb, so peak memory use will be 1/64th of the image size.

3 replies

andreas-kupries Jul 21, 2024
Author

A checkerboard with a 1-pixel pitch for the b/w transitions sounds about right as the worst possible pathological case.
Maximum amount of ranges/CC in each row, and nothing to fuse at all when merging blocks.

if the new tile had a different label for the same pixel value

Still have trouble to wrap my head around this here.
Do the tiles reuse the same ids for their regions when they label them ?
In that case they may have indeed used the same id for a connected region crossing the border and we can avoid a rename.
However that would also allow two tiles to assign the same label to two unconnected regions in each, which is definitely wrong.
To avoid that the tiles have to use separate id ranges when they label things.
For 128x128 tiles and the checkerboard we get 8K regions worst case, so that that would be interval by which to separate the id starting points of the tiles.
And that means that we always have to perform renames at the edges, as two tiles cannot have the same pixel values (except the unconnected background, i.e. 0).

It might be possible to overlap rename arrays / union phases with the tile computations with some care.
I.e. join complete tiles into larger horizontal strips first, and join only strips covering the entire image width vertically to then increase the set of covered rows.

the pathological case is a rename for every pixel along the joining 8000 edges

I believe actually every other pixel, to get the maximum amount of 64 different regions along the border.

Side note: I believe that with some changes at the level of the range structures it should also be possible to compute the contours of the connected regions. Which is something I am also interested in for a different project of mine.

The 1-pixel pitch checkerboard would be a worst case for that as well.

jcupitt Jul 22, 2024
Maintainer

Yes, I think a separate numeric id range per tile would be simplest.

I agree, you could do tile label and rename array computation in parallel.

Let's turn this into an enhancement issue for someone to pick up the nest time they are bored.

andreas-kupries Jul 22, 2024
Author

Sure. That is all I can and would ask for.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

horizontally threaded labeled regions #4060

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 3 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Uh oh!

horizontally threaded labeled regions #4060

Uh oh!

andreas-kupries Jul 20, 2024

Replies: 1 comment · 3 replies

Uh oh!

jcupitt Jul 21, 2024 Maintainer

Uh oh!

Uh oh!

andreas-kupries Jul 21, 2024 Author

Uh oh!

jcupitt Jul 22, 2024 Maintainer

Uh oh!

andreas-kupries Jul 22, 2024 Author

andreas-kupries
Jul 20, 2024

Replies: 1 comment 3 replies

jcupitt
Jul 21, 2024
Maintainer

andreas-kupries Jul 21, 2024
Author

jcupitt Jul 22, 2024
Maintainer

andreas-kupries Jul 22, 2024
Author