-
-
Notifications
You must be signed in to change notification settings - Fork 25.9k
[WIP] Metrics testing #4522
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[WIP] Metrics testing #4522
Conversation
Opened #4523 with the hope of tracking that issue. |
I've commented on both the issues you referenced. I think the problem is that Scipy plays fast-and-loose with the Jaccard distance definition. This came up on the scipy mailing list a couple years ago; see my explanation here: http://mail.scipy.org/pipermail/scipy-dev/2012-December/018129.html |
@TomDLT @amueller now that #6932 has been merged should we try to resuscitate this PR? I tried rebasing this PR on master but there seems to be another error that I haven't investigated yet:
|
We probably should. Related: #6915 |
Good idea, but we probably need to start fresh here. Happy to have it reopened @amueller if you fancy continuing this PR. |
There have recently popped up some issues about support for different metrics. See #4520 and #4452 for example.
I am trying to add more tests, but I am not sure I am familiar enough with the metrics / neighbors modules.
@jakevdp your help would be much appreciated.
Currently travis fails because the Jaccard distances in the trees seem to be very different from the scipy ones. I think this is because our trees cast everything to bool, while scipy uses floats.
This seems to be a pretty big issues, as the algorithm that is used might change automatically depending on the dataset!!!