8000 fix penalty · InternLM/lmdeploy@c5111b6 · GitHub
[go: up one dir, main page]

Skip to content

Commit c5111b6

Browse files
committed
fix penalty
1 parent 62f4224 commit c5111b6

File tree

1 file changed

+2
-0
lines changed

1 file changed

+2
-0
lines changed

src/turbomind/kernels/sampling_penalty_kernels.cu

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -418,6 +418,8 @@ __global__ void batchApplyRepetitionPenalty(T* logits,
418418
}
419419
}
420420

421+
__syncthreads();
422+
421423
// Phase 2. Replace a logit value by the penalized one.
422424
for (int index = threadIdx.x; index < step; index += blockDim.x) {
423425
// skip prompt

0 commit comments

Comments
 (0)
0