⚓ T331505 Self hosted machine translation service

Subject	Repo	Branch	Lines +/-
CX3 Build 0.2.0+20230529	mediawiki/extensions/ContentTranslation	master	+4 K -4 K
cxserver: Remove Flores MT service	operations/deployment-charts	master	+2 -10
Replace references to Flores by MinT and remove custom label	mediawiki/extensions/ContentTranslation	master	+5 -7
Remove Flores client as it is replaced by MinT	mediawiki/services/cxserver	master	+0 -233
cxserver: mesh configuration updated	operations/deployment-charts	master	+1 -1
cxserver: Bump chart version	operations/deployment-charts	master	+1 -1
cxserver: Enable machintranslation proxy	operations/deployment-charts	master	+7 -0
Add MinT support to cxserver	operations/deployment-charts	master	+7 -1
Add MinT service to production config	mediawiki/services/cxserver	master	+3 -0
services_proxy: Add machinetranslation	operations/puppet	production	+5 -0
machinetranslation: Switch service::catalog to production	operations/puppet	production	+1 -1
Update MinT to 2023-05-10-045734-production	operations/deployment-charts	master	+1 -1
Header and response fixes to the spec	mediawiki/services/machinetranslation	master	+8 -2
Update MinT to 2023-05-09-110213-production	operations/deployment-charts	master	+1 -1
OpenAPI spec: Fix translate API parameter name	mediawiki/services/machinetranslation	master	+4 -4
Add machinetranslation service RRs	operations/dns	master	+17 -15
service::catalog: Add machinetranslation service	operations/puppet	production	+15 -0
Update cxserver to 2023-05-03-044244-production	operations/deployment-charts	master	+1 -1
machinetranslation: Remove args, document env vars	operations/deployment-charts	master	+14 -7
Ship a prometheus-statsd-export configuration	operations/deployment-charts	master	+20 -58
Allow passing env var GUNICORN_WORKERS	mediawiki/services/machinetranslation	master	+2 -3
machinetranslation: networkpolicy for metrics-exporter	operations/deployment-charts	master	+6 -1
machinetranslation: Use 2023-05-03-104124-production	operations/deployment-charts	master	+1 -1
Install wmf-certificates in the image	mediawiki/services/machinetranslation	master	+1 -1
machinetranslation: Add people to egress	operations/deployment-charts	master	+20 -0
machinetranslation: Support configuration as env variables	operations/deployment-charts	master	+7 -1
machinetranslation: Support ingress in chart	operations/deployment-charts	master	+20 -4
machinetranslation: Deploy 2023-05-02-080334-production	operations/deployment-charts	master	+1 -1
Fix OpenAPI spec endpoints	mediawiki/services/machinetranslation	master	+11 -7
machinetranslation: Enable ingress functionality	operations/deployment-charts	master	+6 -0
machinetranslation: Enable thanos-swift service mesh	operations/deployment-charts	master	+4 -0
machinetranslation: Bump limitranges and resourcequotas	operations/deployment-charts	master	+93 -1
machinetranslation: Enable monitoring	operations/deployment-charts	master	+1 -1
machinetranslation: Switch to 2023-04-27-093807-production	operations/deployment-charts	master	+1 -1
machinetranslation: Bump to mesh.configuration 1.2.0	operations/deployment-charts	master	+53 -23
Switch gunicorn's statsd_host setting to not hardcode production	mediawiki/services/machinetranslation	master	+1 -1
Various improvements to server.sh, rename to entrypoint.sh	mediawiki/services/machinetranslation	master	+63 -40
machinetranslation: Fix requests vs limits	operations/deployment-charts	master	+2 -2
machinetranslation: deployment_server stanzas	operations/puppet	production	+10 -6
Add new self hosted machinetranslation service (MinT)	operations/deployment-charts	master	+1 K -0
admin_ng: Create machinetranslation namespace	operations/deployment-charts	master	+1 -0
Add machinetranslation tokens	labs/private	master	+4 -0
Add MinT MT Client	mediawiki/services/cxserver	master	+131 -0
Add statsd metrics reporting and ecs structured logging	mediawiki/services/machinetranslation	master	+145 -31
ci: blubber configuration	mediawiki/services/machinetranslation	master	+34 -0
Define test and publish pipeline for mediawiki/services/machinetranslation	integration/config	master	+23 -0

Status	Assigned	Task
Resolved	santhosh	T331505 Self hosted machine translation service
Resolved	Pginer-WMF	T329971 New Service Deployment Request: NNLB-200 for machine translation
Resolved	Pginer-WMF	T331256 Create Gerrit repository for /services/machinetranslation and migrate code from Gitlab
Resolved	santhosh	T331835 Create new MT client in cxserver for self hosted MT service
Resolved	santhosh	T331836 Support multiple MT models in self hosted machine translation service
Resolved	santhosh	T331837 Merge OpusMT service into the self hosted Ctranslate2 based MT service
Resolved	santhosh	T335472 machinetranslation service hardcodes statsd_host and doesn't support altering the port
Open	None	T335491 Provide better long-term storage for translation models
Resolved	Joe	T337284 Remove Flores key from production

Change 912811 merged by jenkins-bot:

[operations/deployment-charts@master] machinetranslation: Switch to 2023-04-27-093807-production

https://gerrit.wikimedia.org/r/912811

Change 912812 merged by jenkins-bot:

[operations/deployment-charts@master] machinetranslation: Enable monitoring

https://gerrit.wikimedia.org/r/912812

Change 912863 had a related patch set uploaded (by Alexandros Kosiaris; author: Alexandros Kosiaris):

[mediawiki/services/machinetranslation@master] Fix OpenAPI spec endpoints

https://gerrit.wikimedia.org/r/912863

In T331505#8810129, @Pginer-WMF wrote:

In T331505#8809552, @elukey wrote:

Thanks a lot for the info! Is it going to be the permanent solution? I am asking since the ML cluster uses Swift, that may be more resilient long term.

I don't expect to be a permanent solution. With the large size of models, this was more of a pragmatic solution to have the system up and running. The initial goal is to replace the existing service on AWS. For example, we plan to enable initially the service for the same 23 languages, although the models support over 200. Once we can replace the current service, I expect follow-up iterations for improvement to expand languages (T326578), models (T333969) and other infrastructure aspects that can make the service better (more robust, maintainable, etc.).

I created a ticket based on your proposal (T335491: Provide better long-term storage for translation models). Feel free to share more details in the ticket or propose other improvements that can help making the service better.
Thanks @elukey!

Sure sure as initial step it makes sense! My point is that Lift Wing already have this functionality (fetching models from Swift), and the more I see the requirements of this service the more I wonder why it wasn't onboarded as ML service (see https://wikitech.wikimedia.org/wiki/Machine_Learning/LiftWing). It is fine to keep going in this direction, too much work as been done, but worth to keep it in mind for the future.

Change 913108 had a related patch set uploaded (by Alexandros Kosiaris; author: Alexandros Kosiaris):

[operations/deployment-charts@master] machinetranslation: Bump limitranges and resourcequotas

https://gerrit.wikimedia.org/r/913108

Change 913109 had a related patch set uploaded (by Alexandros Kosiaris; author: Alexandros Kosiaris):

[operations/deployment-charts@master] machinetranslation: Enable thanos-swift service mesh

https://gerrit.wikimedia.org/r/913109

Change 913116 had a related patch set uploaded (by Alexandros Kosiaris; author: Alexandros Kosiaris):

[operations/deployment-charts@master] machinetranslation: Enable ingress functionality

https://gerrit.wikimedia.org/r/913116

Change 913152 had a related patch set uploaded (by Alexandros Kosiaris; author: Alexandros Kosiaris):

[operations/puppet@production] service::catalog: Add machinetranslation service

https://gerrit.wikimedia.org/r/913152

Change 913108 merged by jenkins-bot:

[operations/deployment-charts@master] machinetranslation: Bump limitranges and resourcequotas

https://gerrit.wikimedia.org/r/913108

Change 913109 merged by jenkins-bot:

[operations/deployment-charts@master] machinetranslation: Enable thanos-swift service mesh

https://gerrit.wikimedia.org/r/913109

Change 913116 merged by jenkins-bot:

[operations/deployment-charts@master] machinetranslation: Enable ingress functionality

https://gerrit.wikimedia.org/r/913116

Change 912863 merged by jenkins-bot:

[mediawiki/services/machinetranslation@master] Fix OpenAPI spec endpoints

https://gerrit.wikimedia.org/r/912863

Pginer-WMF added a project: MinT.May 2 2023, 9:07 AM

Change 914322 had a related patch set uploaded (by Alexandros Kosiaris; author: Alexandros Kosiaris):

[operations/deployment-charts@master] machinetranslation: Deploy 2023-05-02-080334-production

https://gerrit.wikimedia.org/r/914322

Change 914322 merged by jenkins-bot:

[operations/deployment-charts@master] machinetranslation: Deploy 2023-05-02-080334-production

https://gerrit.wikimedia.org/r/914322

Change 914365 had a related patch set uploaded (by Alexandros Kosiaris; author: Alexandros Kosiaris):

[operations/deployment-charts@master] machinetranslation: Enable ingress in chart

https://gerrit.wikimedia.org/r/914365

Change 914365 merged by jenkins-bot:

[operations/deployment-charts@master] machinetranslation: Support ingress in chart

https://gerrit.wikimedia.org/r/914365

Change 914468 had a related patch set uploaded (by KartikMistry; author: KartikMistry):

[operations/deployment-charts@master] Update cxserver to 2023-05-03-044244-production

https://gerrit.wikimedia.org/r/914468

Change 914721 had a related patch set uploaded (by Alexandros Kosiaris; author: Alexandros Kosiaris):

[operations/deployment-charts@master] machinetranslation: Support configuration as env variables

https://gerrit.wikimedia.org/r/914721

Change 914722 had a related patch set uploaded (by Alexandros Kosiaris; author: Alexandros Kosiaris):

[operations/deployment-charts@master] machinetranslation: Add people to egress

https://gerrit.wikimedia.org/r/914722

Change 914721 merged by jenkins-bot:

[operations/deployment-charts@master] machinetranslation: Support configuration as env variables

https://gerrit.wikimedia.org/r/914721

Change 914722 merged by jenkins-bot:

[operations/deployment-charts@master] machinetranslation: Add people to egress

https://gerrit.wikimedia.org/r/914722

Change 914732 had a related patch set uploaded (by Alexandros Kosiaris; author: Alexandros Kosiaris):

[mediawiki/services/machinetranslation@master] Install wmf-certificates in the image

https://gerrit.wikimedia.org/r/914732

Change 914732 merged by jenkins-bot:

[mediawiki/services/machinetranslation@master] Install wmf-certificates in the image

https://gerrit.wikimedia.org/r/914732

Change 914767 had a related patch set uploaded (by Alexandros Kosiaris; author: Alexandros Kosiaris):

[operations/deployment-charts@master] machinetranslation: Use 2023-05-03-104124-production

https://gerrit.wikimedia.org/r/914767

Change 914767 merged by jenkins-bot:

[operations/deployment-charts@master] machinetranslation: Use 2023-05-03-104124-production

https://gerrit.wikimedia.org/r/914767

Change 915364 had a related patch set uploaded (by Alexandros Kosiaris; author: Alexandros Kosiaris):

[operations/deployment-charts@master] machinetranslation: networkpolicy for metrics-exporter

https://gerrit.wikimedia.org/r/915364

Change 915364 merged by jenkins-bot:

[operations/deployment-charts@master] machinetranslation: networkpolicy for metrics-exporter

https://gerrit.wikimedia.org/r/915364

Change 915483 had a related patch set uploaded (by Alexandros Kosiaris; author: Alexandros Kosiaris):

[mediawiki/services/machinetranslation@master] Allow passing env var GUNICORN_WORKERS

https://gerrit.wikimedia.org/r/915483

Change 915488 had a related patch set uploaded (by Alexandros Kosiaris; author: Alexandros Kosiaris):

[operations/deployment-charts@master] machinetranslation: Remove args, document env vars

https://gerrit.wikimedia.org/r/915488

Change 915493 had a related patch set uploaded (by Alexandros Kosiaris; author: Alexandros Kosiaris):

[operations/deployment-charts@master] Ship a prometheus-statsd-export configuration

https://gerrit.wikimedia.org/r/915493

Change 915483 merged by jenkins-bot:

[mediawiki/services/machinetranslation@master] Allow passing env var GUNICORN_WORKERS

https://gerrit.wikimedia.org/r/915483

Change 915493 merged by jenkins-bot:

[operations/deployment-charts@master] Ship a prometheus-statsd-export configuration

https://gerrit.wikimedia.org/r/915493

Change 915488 merged by jenkins-bot:

[operations/deployment-charts@master] machinetranslation: Remove args, document env vars

https://gerrit.wikimedia.org/r/915488

Change 914468 merged by jenkins-bot:

[operations/deployment-charts@master] Update cxserver to 2023-05-03-044244-production

https://gerrit.wikimedia.org/r/914468

Mentioned in SAL (#wikimedia-operations) [2023-05-04T11:38:02Z] <kart_> Updated cxserver to 2023-05-03-044244-production (T333835, T335019, T331505)

Stashbot mentioned this in T333835: Disable machine translation for Cantonese.May 4 2023, 11:38 AM

Stashbot mentioned this in T335019: Post-creation work for fatwiki.

akosiaris closed subtask T335472: machinetranslation service hardcodes statsd_host and doesn't support altering the port as Resolved.May 4 2023, 12:05 PM

KartikMistry updated the task description. (Show Details)May 8 2023, 6:47 AM

Mentioned in SAL (#wikimedia-operations) [2023-05-08T06:48:10Z] <kart_> Deployed MinT to the production (T331505)

akosiaris updated the task description. (Show Details)May 9 2023, 8:12 AM

Change 914351 had a related patch set uploaded (by Alexandros Kosiaris; author: Alexandros Kosiaris):

[operations/dns@master] Add machinetranslation service RRs

https://gerrit.wikimedia.org/r/914351

Change 913152 merged by Alexandros Kosiaris:

[operations/puppet@production] service::catalog: Add machinetranslation service

https://gerrit.wikimedia.org/r/913152

Change 917819 had a related patch set uploaded (by KartikMistry; author: KartikMistry):

[operations/deployment-charts@master] Update MinT to 2023-05-09-082017-production

https://gerrit.wikimedia.org/r/917819

Change 914351 merged by Alexandros Kosiaris:

[operations/dns@master] Add machinetranslation service RRs

https://gerrit.wikimedia.org/r/914351

Change 917828 had a related patch set uploaded (by Alexandros Kosiaris; author: Alexandros Kosiaris):

[mediawiki/services/machinetranslation@master] OpenAPI spec: Fix translate API parameter name

https://gerrit.wikimedia.org/r/917828

Change 917828 merged by jenkins-bot:

[mediawiki/services/machinetranslation@master] OpenAPI spec: Fix translate API parameter name

https://gerrit.wikimedia.org/r/917828

Change 917819 merged by jenkins-bot:

[operations/deployment-charts@master] Update MinT to 2023-05-09-110213-production

https://gerrit.wikimedia.org/r/917819

Mentioned in SAL (#wikimedia-operations) [2023-05-09T11:36:31Z] <kart_> Updated MinT to 2023-05-09-110213-production (T331505, T335725, T331505)

Change 917906 had a related patch set uploaded (by Alexandros Kosiaris; author: Alexandros Kosiaris):

[mediawiki/services/machinetranslation@master] Header and response fixes to the spec

https://gerrit.wikimedia.org/r/917906

Change 917906 merged by jenkins-bot:

[mediawiki/services/machinetranslation@master] Header and response fixes to the spec

https://gerrit.wikimedia.org/r/917906

Change 918002 had a related patch set uploaded (by KartikMistry; author: KartikMistry):

[operations/deployment-charts@master] Update MinT to 2023-05-10-045734-production

https://gerrit.wikimedia.org/r/918002

Change 918002 merged by jenkins-bot:

[operations/deployment-charts@master] Update MinT to 2023-05-10-045734-production

https://gerrit.wikimedia.org/r/918002

Mentioned in SAL (#wikimedia-operations) [2023-05-10T05:42:43Z] <kart_> Updated MinT to 2023-05-10-045734-production (T331505)

Change 918243 had a related patch set uploaded (by Alexandros Kosiaris; author: Alexandros Kosiaris):

[operations/puppet@production] machinetranslation: Switch service::catalog to production

https://gerrit.wikimedia.org/r/918243

Change 918243 merged by Alexandros Kosiaris:

[operations/puppet@production] machinetranslation: Switch service::catalog to production

https://gerrit.wikimedia.org/r/918243

Change 911887 merged by Alexandros Kosiaris:

[operations/puppet@production] services_proxy: Add machinetranslation

https://gerrit.wikimedia.org/r/911887

Change 918343 had a related patch set uploaded (by KartikMistry; author: KartikMistry):

[mediawiki/services/cxserver@master] Add MinT service to production config

https://gerrit.wikimedia.org/r/918343

Change 918407 had a related patch set uploaded (by Alexandros Kosiaris; author: Alexandros Kosiaris):

[operations/deployment-charts@master] cxserver: Enable machintranslation proxy

https://gerrit.wikimedia.org/r/918407

Change 918343 merged by jenkins-bot:

[mediawiki/services/cxserver@master] Add MinT service to production config

https://gerrit.wikimedia.org/r/918343

Change 905579 had a related patch set uploaded (by KartikMistry; author: KartikMistry):

[operations/deployment-charts@master] Add MinT support to cxserver

https://gerrit.wikimedia.org/r/905579

Change 905579 merged by jenkins-bot:

[operations/deployment-charts@master] Add MinT support to cxserver

https://gerrit.wikimedia.org/r/905579

Change 918407 merged by jenkins-bot:

[operations/deployment-charts@master] cxserver: Enable machintranslation proxy

https://gerrit.wikimedia.org/r/918407

Change 918441 had a related patch set uploaded (by KartikMistry; author: KartikMistry):

[operations/deployment-charts@master] cxserver: Bump chart version

https://gerrit.wikimedia.org/r/918441

Change 918441 merged by jenkins-bot:

[operations/deployment-charts@master] cxserver: Bump chart version

https://gerrit.wikimedia.org/r/918441

Change 918509 had a related patch set uploaded (by Alexandros Kosiaris; author: Alexandros Kosiaris):

[operations/deployment-charts@master] cxserver: mesh configuration updated

https://gerrit.wikimedia.org/r/918509

Change 918509 merged by jenkins-bot:

[operations/deployment-charts@master] cxserver: mesh configuration updated

https://gerrit.wikimedia.org/r/918509

santhosh updated the task description. (Show Details)May 11 2023, 5:05 AM

santhosh closed subtask T331836: Support multiple MT models in self hosted machine translation service as Resolved.May 11 2023, 5:08 AM

santhosh closed subtask T331835: Create new MT client in cxserver for self hosted MT service as Resolved.

Pginer-WMF moved this task from Backlog to Infrastructure on the MinT board.May 11 2023, 3:18 PM

Change 922059 had a related patch set uploaded (by Santhosh; author: Santhosh):

[mediawiki/services/cxserver@master] Remove Flores client as it is replaced by MinT

https://gerrit.wikimedia.org/r/922059

Change 922061 had a related patch set uploaded (by Santhosh; author: Santhosh):

[mediawiki/extensions/ContentTranslation@master] Replace references to Flores by MinT and remove custom label

https://gerrit.wikimedia.org/r/922061

Change 922059 merged by jenkins-bot:

[mediawiki/services/cxserver@master] Remove Flores client as it is replaced by MinT

https://gerrit.wikimedia.org/r/922059

Change 922064 had a related patch set uploaded (by KartikMistry; author: KartikMistry):

[operations/deployment-charts@master] cxserver: Remove Flores MT service

https://gerrit.wikimedia.org/r/922064

Change 922061 merged by KartikMistry:

[mediawiki/extensions/ContentTranslation@master] Replace references to Flores by MinT and remove custom label

https://gerrit.wikimedia.org/r/922061

ReleaseTaggerBot added a project: MW-1.41-notes (1.41.0-wmf.10; 2023-05-23).May 22 2023, 1:00 PM

Change 922064 merged by jenkins-bot:

[operations/deployment-charts@master] cxserver: Remove Flores MT service

https://gerrit.wikimedia.org/r/922064

Mentioned in SAL (#wikimedia-operations) [2023-05-23T06:04:44Z] <kart_> cxserver: Remove Flores MT service (T331505)

Change 923921 had a related patch set uploaded (by Santhosh; author: Santhosh):

[mediawiki/extensions/ContentTranslation@master] CX3 Build 0.2.0+20230529

https://gerrit.wikimedia.org/r/923921

Change 923921 merged by jenkins-bot:

[mediawiki/extensions/ContentTranslation@master] CX3 Build 0.2.0+20230529

https://gerrit.wikimedia.org/r/923921

Pginer-WMF closed subtask T331837: Merge OpusMT service into the self hosted Ctranslate2 based MT service as Resolved.May 31 2023, 2:46 PM

Pginer-WMF updated the task description. (Show Details)Jun 30 2023, 11:37 AM

Pginer-WMF edited projects, added Language-Team (Language-2023-July-September); removed Language-Team (Language-2023-April-June).Jun 30 2023, 11:46 AM

Pginer-WMF moved this task from Quarter Backlog to In Progress on the Language-Team (Language-2023-July-September) board.

Joe closed subtask T337284: Remove Flores key from production as Resolved.Jul 6 2023, 8:55 AM

KartikMistry updated the task description. (Show Details)Jul 6 2023, 12:17 PM

Nikerabbit removed a project: Patch-For-Review.Sep 6 2023, 7:35 AM

Nikerabbit moved this task from In Progress to Done on the Language-Team (Language-2023-July-September) board.

Pginer-WMF closed subtask T329971: New Service Deployment Request: NNLB-200 for machine translation as Resolved.Sep 7 2023, 9:49 AM

Since MinT was launched the service has been running in support of Content and Section Translation.

For pending items in the task:

Regarding the QA test checkbox, we can mark it as resolved. As we enabled MinT to support each language their models support we have checked them individually (T326578, T339105, T340953, T336683).
The use of a better storing option (T335491) is a follow-up task to improve the service, but I don't see it as a blocker to close the current task since the service is up and running. So we can close the current ticket and leave T335491 as a follow-up.

Pginer-WMF updated the task description. (Show Details)Sep 7 2023, 9:58 AM

Self hosted machine translation service
Closed, ResolvedPublic
Actions

Description

Background

Plan

Details

Related Objects
Search...

Event Timeline

	F36908719: image.png
	Mar 13 2023, 7:20 AM

	F36897582: image.png
	Mar 8 2023, 7:56 AM

Self hosted machine translation serviceClosed, ResolvedPublicActions