[go: up one dir, main page]

Skip to content

nyunAI/Faster-LLM-Survey

 
 

Repository files navigation

Load model

mkdir model && cd model git lfs install git clone https://huggingface.co/meta-llama/Llama-2-7b-hf

Engines

Llama.cpp

bash llamacpp/setup.sh bash llamacpp/generate.sh to generate ggml compiled files for fp16 and q4_0, q4_k_m quantized models bash llamacpp/bench.sh to benchmark the above the

GPTQ via AutoGPTQ

quantized with alpaca. Once can use other dataset versions and corresponding dataset loading implementation.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published