As an SDAW product manager, I want to understand how many articles there is potential to illustrate with our bot writing partners on Cebuano and Arabic wikis, so that I can determine whether we should consider other features, partners and improvements to illustrate a larger number of articles to meet our SDAW grant requirements.
We know that (as of January 2020) there are 1,939,115 total unillustrated articles across Cebuano and Arabic wikis, but only 121,390 of them have a candidate from the image matching algorithm. How close to illustrating the full 1,939,115 is there potential to get with the addition of matches from MediaSearch?
Acceptance Criteria:
- Using the full list of unillustrated articles from Cebuano (ceb) and Arabic (ar) wikis generated from @Miriam's image matching algorithm, write a script to determine an estimate of what percentage of those articles have matches with elastic search scores over (score threshold TBD).
Note that: