-
-
Notifications
You must be signed in to change notification settings - Fork 25.8k
Add Blas3 to the sources #1311
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
I've been trying to avoid this since it's a LOT of code with lots of special cases (much much more than level1-2) and because level3 is O(n^3) operations probably inlining in cython is not so important and the python functions returned by get_blas_funcs would do the job. What's the particular usercase? Is the function missing from get_blas_funcs (in that case I would gladly help you with f2py) ? |
There are two use-cases: the speed-up of euclidean distances by @vene #1006 and neural networks. in case of Vlad's PR, the compilation worked, but there was an linker error on my box. |
If you are speaking about ger that's level 2. |
Anyway sure I could help with particular problems but adding the full level3 specification is IMHO a bit overkill. |
Also, if you want to team up with the scipy guys (I talked with Ralph Gommers at EuroScipy) they where open-minded to expose a C API for BLAS functions (via Cython) , which frankly would be of great benefit for us. |
Yeah, exposing blas via Cython is really what we want. We would have to maintain a backport but that would be better than doing it ourselves. Yes, I wanted ger. Having the same for batches would be gemm, right? So this would be what I want.... |
Not sure what @vene wants, though. |
Adding gemm seems reasonable to me. I see in @vene's PR references to dgemm so I suppose the goal is to add that to the cblas directory, right ? |
yes, that would be great :) |
Done, see #1313 |
@fabianp could you please give me a hand in that?
The text was updated successfully, but these errors were encountered: