8000 Revert "Revert "CUDA: faster softmax via shared memory + fp16 math (#… · LostRuins/koboldcpp@941e70d · GitHub 8000
[go: up one dir, main page]

Skip to content

Commit 941e70d

Browse files
committed
Revert "Revert "CUDA: faster softmax via shared memory + fp16 math (ggml-org#4742)""
This reverts commit 0526cc5.
1 parent 8b2c774 commit 941e70d

File tree

2 files changed

+318
-26
lines changed

2 files changed

+318
-26
lines changed

0 commit comments

Comments
 (0)
0