First of all, thanks a lot for your useful, nice and interesting Java implementation of HyperMinHash.
My question is about this: https://github.com/LiveRamp/HyperMinHash-java/blob/master/src/main/java/com/liveramp/hyperminhash/BetaMinHashCombiner.java#L54
As I read arXiv:1710.08436, while mergeability is trivial, I think it's not trivial that Jaccard Index estimation for multiple (> 2) sets works properly.
- Does the estimation still have same accuracy as of 2-set Jaccard Index ?
- If so, is there any proof ?
Sorry for obscure question. Thanks again.