sklearn.manifold.TSNE, ValueError: Buffer dtype mismatch, expected 'float_t' but got 'float' #4124

h10r · 2015-01-19T13:25:58Z

Thank you so much for adding sklearn.manifold.TSNE, I'm very excited that it's now part of sklearn! However, when I try using it, I get the following error message (both with 0.15.2 and 0.16):

Traceback (most recent call last):
File "main.py", line 104, in <module>
vectors_tsne = do_tsne_on_vectors( vectors_in_file )
File "main.py", line 56, in do_tsne_on_vectors
return tsne.fit_transform( vectors )
File "(...)/sklearn/manifold/t_sne.py", line 519, in fit_transform
self._fit(X)
File "(...)/sklearn/manifold/t_sne.py", line 444, in _fit
P = _joint_probabilities(distances, self.perplexity, self.verbose)
File "(...)/sklearn/manifold/t_sne.py", line 51, in _joint_probabilities
distances, desired_perplexity, verbose)
File "_utils.pyx", line 14, in sklearn.manifold._utils._binary_search_perplexity (sklearn/manifold/_utils.c:2023)
ValueError: Buffer dtype mismatch, expected 'float_t' but got 'float'

To fix this, I did:

vectors = np.asfarray( vectors, dtype='float' )

which seems to work for me.

My input is a list of vectors I get from gensim.models.word2vec.

Kind regards and thanks for the great library!
Hendrik

The text was updated successfully, but these errors were encountered:

amueller · 2015-01-20T16:38:51Z

Hi. Thanks for the report.
Can you please provide the code that you are using, type and dtype of your input (vectors)?
What OS are you on (and 32bit or 64bit?)

h10r · 2015-01-20T17:04:58Z

Hi Andreas,

I'm on Mac OS X 10.9.5 (13F34) with 64 bit.

The code

def do_tsne_on_vectors( vectors ):
    tsne_array = np.array( vectors )
    tsne = TSNE(n_components=2, random_state=0)
    return tsne.fit_transform( vectors )

When I do tsne_array.dtype, I get float32 and <type 'numpy.ndarray'>.

The vector just contains numbers:

[  1.84206292e-02   1.06022261e-01  -1.34022012e-01  -6.32563373e-03
   7.17194155e-02   3.79017629e-02  -2.69094426e-02  -1.99426636e-01
  -1.18101127e-02   2.96421014e-02  -1.06844241e-02  -1.01331301e-01 ...

SKlearn version:

VERSION
    0.15.2

Kind regards,
Hendrik

amueller · 2015-01-20T18:03:51Z

Thanks, I can reproduce. That is a bug :-/

h10r · 2015-01-20T19:41:43Z

Ah, too bad. As mentioned, vectors = np.asfarray( vectors, dtype='float' ) is a possible workaround that works for me.

Again, thanks a lot for your work and this great library!
All the best from Stockholm,
Hendrik

amueller · 2015-01-20T20:13:26Z

Yeah, the fix here is pretty simple. I just have to come up with a way to test that this problem doesn't show up anywhere in the library ever again ;)

amueller · 2015-02-25T18:56:55Z

Fixed in #4136.

amueller added the Bug label Jan 20, 2015

amueller added this to the 0.16 milestone Jan 20, 2015

amueller mentioned this issue Jan 20, 2015

[MRG+1] More robust input validation, more testing. #4136

Merged

amueller closed this as completed Feb 25, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sklearn.manifold.TSNE, ValueError: Buffer dtype mismatch, expected 'float_t' but got 'float' #4124

sklearn.manifold.TSNE, ValueError: Buffer dtype mismatch, expected 'float_t' but got 'float' #4124

sklearn.manifold.TSNE, ValueError: Buffer dtype mismatch, expected 'float_t' but got 'float' #4124

sklearn.manifold.TSNE, ValueError: Buffer dtype mismatch, expected 'float_t' but got 'float' #4124

Comments