Population Count Op #36380

tom-bird · 2020-04-10T09:42:48Z

🚀 Feature

It would be great to have the bitwise operation 'Population Count' that counts the number of 1 bits to be exposed.

Not sure if this exists in the back end of torch? It is exposed in tensorflow

Motivation

This is a key operation in implementing binarised neural nets, for example in XNOR-Net. Binarised linear and conv layers can be performed quickly using XNOR and population counts. XNOR is already exposed in the pytorch API.

cc @albanD

Adam-Vandervorst · 2023-03-27T12:24:27Z

Any update or workaround here?

Exferro · 2024-03-13T15:43:04Z

I will express interest too, since NumPy is about to add np.bitwise_count in its new major release. I benchmarked it a bit, and here's what I've got:

My custom implementation of popcount64c taken from the Wikipedia page on Hamming weight (population count) takes ~800 ms to popcount an array a = np.random.randint(20000, size=(10000, 10000)).
At the same time, a novel np.bitwise_count which just calls the underlying CPU instruction takes ~50 ms, which is a 16x speedup.
To give more context, np.bitwise_and(b, b) takes ~75 ms. Thus, we can see that bitwise_and roughly bounds from above the time required for popcount.

Now, if we proceed to GPU, I have the following:

My custom implementation of popcount64c takes ~30ms.
At the same time, torch.bitwise_and(b, b) takes ~2.5ms.
Thus, I would expect that native PyTorch popcount implementation, which calls just a basic processor instruction, would also take milliseconds, again providing a ~10x speedup.

The code I work on daily is quite specific and there I use popcount a lot.
Thus I would really appreciate if PyTorch had a native popcount :) Thank you!

albanD · 2024-04-15T15:32:02Z

Quick update: I think the fact that numpy added it is a good signal that we want it as well for best compatibility.
We would be happy to accept a PR that adds this new unary op!

Felix-Petersen · 2024-06-26T21:03:51Z

I am strongly in support of including popcount in torch. Though, I want to remark that if we follow the numpy variant, it would be important to also have support for uint64 etc. (#58734) as the numpy.bitwise_count "Computes the number of 1-bits in the absolute value of x". This would be necessary if one wants to do an actual popcount when using all bits of the datatype (an all ones int64 leads to bitwise_count=1, and an all ones uint64 leads to bitwise_count=64.)

lapp0 · 2024-09-04T17:08:22Z

Throwing this in here in case it helps anyone. Torch bit packing and packed tensor counting. Went through a few iterations to create a performant solution. I'm in support of this features inclusion natively, but here's a working solution in the mean time.

(Requires you to flatten before bit packing.)

def _pack_bit_tensor(bool_tensor):
    """Packs a boolean tensor into an int64 tensor using bitwise operations and summation."""
    assert len(bool_tensor.shape) == 1
    padding = (64 - bool_tensor.shape[0] % 64) % 64
    if padding > 0:
        bool_tensor = torch.cat([bool_tensor, torch.zeros(padding, dtype=bool)])

    bit_groups = bool_tensor.view(-1, 64)
    shifts = torch.arange(64, device=bool_tensor.device, dtype=torch.int64)

    packed_tensor = torch.sum(bit_groups * (1 << shifts), dim=-1, dtype=torch.int64)
    return packed_tensor


def _bit_tensor_sum(packed_tensor):
    """Counts the number of 1-bits in a packed int64 tensor using the Hamming weight"""
    count = packed_tensor
    count = (count - ((count >> 1) & 0x5555555555555555))
    count = (count & 0x3333333333333333) + ((count >> 2) & 0x3333333333333333)
    count = (count + (count >> 4)) & 0x0F0F0F0F0F0F0F0F
    count = (count * 0x0101010101010101) >> 56
    return torch.sum(count).item()

izdeby added feature A request for a proper, new feature. triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module labels Apr 13, 2020

Exferro mentioned this issue Apr 13, 2024

Bitwise (Pop)Count #124005

Closed

albanD added the module: python frontend For issues relating to PyTorch's Python frontend label Apr 15, 2024

albanD added the actionable label Apr 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Population Count Op #36380

Population Count Op #36380

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Population Count Op #36380

Population Count Op #36380

Comments

Uh oh!

🚀 Feature

Motivation

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!