Using t5-large in t5 notebook, the translation result is invalid #135

brevity2021 · 2022-09-08T04:54:11Z

Hi,

First, thank you for the great work!
I was playing with the t5 notebook in demo/generative-model. I build a docker image through Makefile, and run the notebook from the container.

I changed little things from the notebook, only a few printings. When I ran using t5-small it runs fine. But when I switched to use t5-large, the translation result in the Benchmark section becomes empty. I further printed out the tokens generated, and the results are

text generated by ONNX:
Onnx tokens:
tensor([0, 2, 0, 1], device='cuda:0')

which is obviously not correct.

I attach the notebook here for your reference. I doubt if there are any instability issues when you convert to fp16, since that method depends on the randomly generated data.

My experiment was running on a g5.2xlarge instance.

The text was updated successfully, but these errors were encountered:

ayoub-louati · 2022-09-08T22:29:55Z

@brevity2021 we are working on adding support for t5 conversion using the convert script. I think it should cover the precision for different T5 models (including t5-large).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using t5-large in t5 notebook, the translation result is invalid #135

Using t5-large in t5 notebook, the translation result is invalid #135

brevity2021 commented Sep 8, 2022

ayoub-louati commented Sep 8, 2022

Using t5-large in t5 notebook, the translation result is invalid #135

Using t5-large in t5 notebook, the translation result is invalid #135

Comments

brevity2021 commented Sep 8, 2022

ayoub-louati commented Sep 8, 2022