Assertion failed: noRepeatNgramSize.value() > 0 #2442

krishnanpooja · 2024-11-13T11:09:00Z

System Info

GPU-A100,
TensorRT-LLM version = tensorrt_llm-0.13.0.dev2024090300
Ubuntu machine.

Who can help?

I want to set the 'no_repeat_ngram_size'=0 for mistral model. But I get the following assertion error:

RuntimeError: [TensorRT-LLM][ERROR] Assertion failed: noRepeatNgramSize.value() > 0 (/home/jenkins/agent/workspace/LLM/main/L0_PostMerge/llm/cpp/tensorrt_llm/executor/samplingConfig.cpp:332)

As per the documentation the default value is 1 << 30, is there way to set the value to 0? If not, can this feature be added?

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)

Reproduction

Setting no_repear_ngram_size=0 under SamplingParams for mistral model.

Expected behavior

User should be allowed to allowed to set this value to 0.

actual behavior

Getting assertion error.

additional notes

We want to set it to 0 like we do for pytorch-eager used for inference.

The text was updated successfully, but these errors were encountered:

byshiue · 2024-11-14T07:02:45Z

Could you explain your motivation to setting it as 0 instead of 1 << 30? 1 << 30 should work equivalent to 0 and it is more friend to our kernel implementation.

krishnanpooja added the bug Something isn't working label Nov 13, 2024

hello-11 added the triaged Issue has been triaged by maintainers label Nov 14, 2024

byshiue self-assigned this Nov 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Assertion failed: noRepeatNgramSize.value() > 0 #2442

Assertion failed: noRepeatNgramSize.value() > 0 #2442

krishnanpooja commented Nov 13, 2024

byshiue commented Nov 14, 2024

Assertion failed: noRepeatNgramSize.value() > 0 #2442

Assertion failed: noRepeatNgramSize.value() > 0 #2442

Comments

krishnanpooja commented Nov 13, 2024

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

actual behavior

additional notes

byshiue commented Nov 14, 2024