🦙 Python Bindings for `llama.cpp`

Simple Python bindings for @ggerganov's llama.cpp library. This package provides:

Low-level access to C API via ctypes interface.
High-level Python API for text completion
- OpenAI-like API
- LangChain compatibility

Documentation is available at https://llama-cpp-python.readthedocs.io/en/latest.

Installation from PyPI (recommended)

Install from PyPI (requires a c compiler):

pip install llama-cpp-python

The above command will attempt to install the package and build llama.cpp from source. This is the recommended installation method as it ensures that llama.cpp is built with the available optimizations for your system.

If you have previously installed llama-cpp-python through pip and want to upgrade your version or rebuild the package with different compiler options, please add the following flags to ensure that the package is rebuilt correctly:

pip install llama-cpp-python --force-reinstall --upgrade --no-cache-dir

Note: If you are using Apple Silicon (M1) Mac, make sure you have installed a version of Python that supports arm64 architecture. For example:

wget https://github.com/conda-forge/miniforge/releases/latest/download/Miniforge3-MacOSX-arm64.sh
bash Miniforge3-MacOSX-arm64.sh

Otherwise, while installing it will build the llama.ccp x86 version which will be 10x slower on Apple Silicon (M1) Mac.

Installation with OpenBLAS / cuBLAS / CLBlast / Metal

llama.cpp supports multiple BLAS backends for faster processing. Use the FORCE_CMAKE=1 environment variable to force the use of cmake and install the pip package for the desired BLAS backend.

To install with OpenBLAS, set the LLAMA_OPENBLAS=1 environment variable before installing:

CMAKE_ARGS="-DLLAMA_OPENBLAS=on" FORCE_CMAKE=1 pip install llama-cpp-python

To install with cuBLAS, set the LLAMA_CUBLAS=1 environment variable before installing:

CMAKE_ARGS="-DLLAMA_CUBLAS=on" FORCE_CMAKE=1 pip install llama-cpp-python

Name		Name	Last commit message	Last commit date
Latest commit History 742 Commits
.github		.github
docker		docker
docs		docs
examples		examples
llama_cpp		llama_cpp
tests		tests
vendor		vendor
.dockerignore		.dockerignore
.gitignore		.gitignore
.gitmodules		.gitmodules
.readthedocs.yaml		.readthedocs.yaml
CHANGELOG.md		CHANGELOG.md
CMakeLists.txt < 8000 /div>		CMakeLists.txt
LICENSE.md		LICENSE.md
Makefile		Makefile
README.md		README.md
mkdocs.yml		mkdocs.yml
poetry.lock		poetry.lock
poetry.toml		poetry.toml
pyproject.toml		pyproject.toml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🦙 Python Bindings for `llama.cpp`

Installation from PyPI (recommended)

Installation with OpenBLAS / cuBLAS / CLBlast / Metal

High-level API

Web Server

Docker image

Low-level API

Documentation

Development

How does this compare to other Python bindings of `llama.cpp`?

License

About

Uh oh!

Releases

Packages

Languages

License

ikili/llama-cpp-python

Folders and files

Latest commit

History

Repository files navigation

🦙 Python Bindings for llama.cpp

Installation from PyPI (recommended)

Installation with OpenBLAS / cuBLAS / CLBlast / Metal

High-level API

Web Server

Docker image

Low-level API

Documentation

Development

How does this compare to other Python bindings of llama.cpp?

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

🦙 Python Bindings for `llama.cpp`

How does this compare to other Python bindings of `llama.cpp`?

Packages