Sunday, August 18, 2024

New languages, testers wanted: Catalan, Chinese, Irish, Kurdish

 Improvements in dbnary allow WikDict to increase its support to four more languages:

  • Catalan
  • Chinese
  • Irish
  • Kurdish

This brings the total number of translations to an impressive number of 14.2 million!

Since I don't speak any of the new languages myself, thoroughly checking the results is not something I can do without your help. So if you use any of the new dictionaries, please report back both with successes and problems!

Thursday, March 14, 2024

Improved translations handling provides ~800k more translations

Different Wiktionaries store their data in different ways. WikDict knows this and handles the data accordingly. Unfortunately, the data structure is not fully consistent even within a single Wiktionary. Recent changes relax some assumptions made with regards to the structure of translations, which allows more translations to find their way into the public WikDict data set. The results are especially beneficial for Turkish, Russian and Bulgarian but have a noticeable effect on nearly all language pairs.

See the GitHub issue if you are interested in more technical details of this change.