You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When I leave out one vocab or data file out, it instead complains
[2022-10-10 16:12:44] Error: There should be as many vocabularies as training files
[2022-10-10 16:12:44] Error: Aborted from void marian::ConfigValidator::validateOptionsParallelData() const in /data/smt/dev/marian-dev/src/common/config_validator.cpp:83
There is no more output apart of the stack.
Thanks a lot,
Daniel
The text was updated successfully, but these errors were encountered:
Bug description
marian embed
includes a--compute-similarity
option. I assume if$MARIAN/marian embed -t data.ja -v vocab.ja.spm -m model.npz
works, then doubling up testset and vocab (as hinted by the description of
--compute-similarity
):$MARIAN/marian embed -t data.ja paraphrase.ja -v vocab.ja.spm vocab.ja.spm -m model.npz
should work too.
Instead I get
Error: Number of corpus files and vocab files does not agree
Am I doing something wrong?
Context
Marian version: v1.11.0 f00d062 2022-02-08 08:39:24 -0800
CMake command:
cmake .. -DCMAKE_BUILD_TYPE=Release
-DUSE_SENTENCEPIECE=ON
-DCOMPILE_CPU=on
-DUSE_STATIC_LIBS=on
-DUSE_FBGEMM=on
Full error log:
When I leave out one vocab or data file out, it instead complains
There is no more output apart of the stack.
Thanks a lot,
Daniel
The text was updated successfully, but these errors were encountered: