-
Notifications
You must be signed in to change notification settings - Fork 88
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
pre-extracted data in .tsv format #140
Comments
Unfortunately the data we provide is not suitable to be used straightforwardly in .tsv or .csv. The JSON data is hierarchical, with big and reasonably sprawling word structures that contain smaller structures, dictionaries and lists, and translating that to .tsv needs to be done on a case-by-case basis. It's not a universal data format that is swappable between different programs (at least yet, or in the near future), it's just a bunch of data we've put into an adhoc data structure as need be. To make what you want possible you need to:
We welcome any contributions to the project to make it more accessible. |
I think pyglossary supports conversion of .json to .tsv/.tab |
We would be happy to have someone implement a conversion utility for our .json to other formats, but someone has to code it first, and our data structure and format can change as time goes by. |
I created a project that is able to create tsv/stardict/kindle dictionaries from the kaikki dump. It is only not extremely well tested, but possibly it works: https://github.com/Vuizur/ebook_dictionary_creator |
I also now have a repository with directly downloadable dictionaries for a lot of languages in 3 different formats: https://github.com/Vuizur/Wiktionary-Dictionaries |
Here is a new tool: https://codeberg.org/GrimPixel/Text_to_Wordlist |
Please give me pre-extracted data in .tsv format please. Goldendict mobile cannot read .json dictionaries.
The text was updated successfully, but these errors were encountered: