Wrong size of embeddings

While playing around, I noticed the embeddings are only 512 floats rather than the 4096 you get when using the standalone application. 

So I went digging and I found the culprit which was a copy-paste residue in the function `llama_n_embd`
https://github.com/abetlen/llama-cpp-python/blob/6d1bda443e2f52b080c4c9156d35ff693857609b/llama_cpp/llama_cpp.py#L220-L221
It's calling `llama_n_ctx` rather than `llama_n_embd`.

I don't think this warrants a pull request as it is a very easy issue to fix, so I made a simple issue instead.

Keep up the good work :)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

	def llama_n_embd(ctx: llama_context_p) -> c_int:
	return _lib.llama_n_ctx(ctx)

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions