8000 [CANN]: add the basic supports of Flash Attention kernel by shibizhao · Pull Request #13627 · ggml-org/llama.cpp · GitHub
[go: up one dir, main page]

Skip to content