XLA's Dot should follow broadcast semantics from np.matmul, not np.dot #5523

shoyer · 2016-11-10T18:40:49Z

I notice that the XLA Dot operation copies "outer-product style" broadcast semantics from numpy.dot:

Input	Output	Semantics
array [p x q x r] `dot` array [s x r x t]	array [p x q x s x t]	array dot product (read below)

In brief, I think this is a mistake. It would be better to follow the "matmul style" style broadcasting semantics of Python's @ operation and NumPy's matmul.

matmul's broadcasting is much more general, and in my opinion, also easier to understand. For example, it can do batch matrix-multiplication, but also can still do outer product style broadcasting if you insert dummy dimensions of length 1 (the axes do end up in a different order), e.g.,
batch matmul: [p x q x r] matmul [p x r x t] -> [p x q x t]
outer product matmul: [p x 1 x q x r] matmul [1 x s x r x t] -> [p x s x q x t]

If we could go back in time as NumPy developers, we assuredly would change dot to work this way (now we cannot, because of backwards compatibility concerns). So it would be nice to change this for XLA before we lock in this behavior.

The text was updated successfully, but these errors were encountered:

sherrym · 2016-11-10T19:16:20Z

@cwhipkey, @andydavis1 and @prb12 , could you please comment on this? Thanks.

eliben · 2016-11-10T22:06:46Z

@shoyer thanks for the suggestion -- we'll think about it

eliben · 2016-11-21T18:01:06Z

Based on this input and other considerations, we've decided to restrict the semantics of XLA's Dot operation to 1D and 2D arrays in the initial release. We may consider expanding it to higher dimensions in the future, and at that point we'll be considering different possible semantics. For the time being, however, this issue can be closed.

Thanks for the feedback!

vrv added the enhancement label Nov 10, 2016

vrv assigned eliben Nov 10, 2016

eliben closed this as completed Nov 21, 2016

ngimel mentioned this issue Nov 22, 2016

torch dot function consistent with numpy pytorch/pytorch#138

Closed

aselle added type:feature Feature requests and removed enhancement labels Feb 9, 2017

fmassa mentioned this issue Apr 1, 2019

Support Batch Dot Product pytorch/pytorch#18027

Closed

yaroslavvb mentioned this issue Sep 20, 2019

Support matrix/scalar multiplication for np.matmul numpy/numpy#14559

Closed

gbaydin mentioned this issue May 13, 2020

Feature: Batch matmul DiffSharp/DiffSharp#88

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

XLA's Dot should follow broadcast semantics from np.matmul, not np.dot #5523

XLA's Dot should follow broadcast semantics from np.matmul, not np.dot #5523

Uh oh!

Uh oh!

Uh oh!

XLA's Dot should follow broadcast semantics from np.matmul, not np.dot #5523

XLA's Dot should follow broadcast semantics from np.matmul, not np.dot #5523

Comments

Uh oh!

Uh oh!

Uh oh!

Uh oh!