Files structure inside dataset:
-
Oxford:
- clustering
- oxford
- test
- train
- vocabulary
- idf.txt
- vocabulary.dat
-
newsgroups:
- clustering
- retrieval
- test
- train
- vocabulary.txt
- vocbulary_idf.txt
-
cifrar:
- clustering
- retrieval
- test
- train
- vocabulary.dat
- vocbulary
- idf.txt
- vocabulary.dat