Automatically pull models from Huggingface

# Proposed API

```python
from llama_cpp import Llama

llama = Llama.from_pretrained(
    "TheBloke/dolphin-2_6-phi-2-GGUF",
    ...
    n_gpu_layers=-1
)
```

This will likely be implemented via the `huggingface_hub` package, I intend to keep this optional and just throw an error if you try to use `from_pretrained` without it installed.

# Questions

- Pull full repo or just single file?
- Which quant level to use?

# Resources

- https://huggingface.co/docs/huggingface_hub/en/guides/download#download-files-to-local-folder
- https://huggingface.co/docs/huggingface_hub/main/en/package_reference/hf_file_system

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Proposed API

Questions

Resources

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Description

Proposed API

Questions

Resources

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions