10BC0 Gemma 3 support (text only) by jordimas · Pull Request #1936 · OpenNMT/CTranslate2 · GitHub
[go: up one dir, main page]

Skip to content

Conversation

@jordimas
Copy link
Collaborator
@jordimas jordimas commented Nov 23, 2025

Gemma 3 text only model support

Pending:

  • More testing with some real use cases
  • Test with all model family variant

In case you want to test it, wheels are built here:

https://github.com/OpenNMT/CTranslate2/actions/runs/19616197187#artifacts

@vince62s
Copy link
Member

while you implement q/k_norm just in case you want to implement Hunyuan-MT-7B (probably the best MT model so far) they do q/k_norm AFTER rotary instead of before.
FYI, Eole implementation here: https://github.com/eole-nlp/eole/blob/main/eole/modules/multi_headed_attn.py#L259-L274

@jordimas jordimas changed the title Gemma 3 only support Gemma 3 support (text only) Nov 23, 2025
@jordimas jordimas mentioned this pull request Nov 23, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants

0