Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

JW300 taken down from OPUS #77

Open
kpu opened this issue Oct 22, 2021 · 3 comments
Open

JW300 taken down from OPUS #77

kpu opened this issue Oct 22, 2021 · 3 comments

Comments

@kpu
Copy link
Collaborator

kpu commented Oct 22, 2021

Something related to copyright, they are trying to get proper permission.

@thammegowda
Copy link
Owner

It has been such a valuable resource!
https://jw.org has 1000 languages
https://glosbe.com has 6000 languages (most of them are dictionaries, but there are 2B+ sentence pairs)
If these two allow us to crawl their site text for NLP research+apps ....

@patelrajnath
Copy link

Do we have any idea if it will back online anytime soon?

@Hammyhamm89
Copy link

sorry for replying to this extremely old thread, but i just heard of this dataset, i'm sad it's down. i wish there was a backup somewhere.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants