Regarding Tutorial: 'Creating a Generative QA Pipeline with Retrieval-Augmentation' #5574

dkbs12 · 2023-08-15T08:27:56Z

dkbs12
Aug 15, 2023

Hello,
When studying the tutorial 'Creating a Generative QA Pipeline with Retrieval-Augmentation', I found the warning message regarding token maximum length from the response as below;

< Code >
output = pipe.run(query="What does Rhodes Statue look like?")
print(output["answers"][0].answer)

< Response >
Token indices sequence length is longer than the specified maximum sequence length for this model (560 > 512). Running this sequence through the model will result in indexing errors
WARNING:haystack.nodes.prompt.invocation_layer.hugging_face:The prompt has been truncated from 560 tokens to 412 tokens so that the prompt length and answer length (100 tokens) fit within the max token limit (512 tokens). Shorten the prompt to prevent it from being cut off
The Colossus was a mythical figure, and the mythical figure is the mythical Colossus.

The model adopted in the PromptNode was 'google/flan-t5-large' and I know the max length of it is 512.
I guess the input length exceeded the max length of the model and that's why warning message came out as above.
In addtion, I can saw this kind of matter influence the performance of QA pipeline due to indexing errors.
How can I use 'google/flan-t5-large' without such a maximum sequence length problem ?

Thank you for your help in advance.

bogdankostic · 2023-08-15T10:03:47Z

bogdankostic
Aug 15, 2023

There are two ways to solve this problem. You can either reduce the input length to the model. For this, you might want to use our PreProcessor component to split the documents in your document store into smaller chunks. Have a look at our Preprocessing tutorial for more details about this topic.
If you don't want to split the documents into smaller chunks, you should use a model that supports a larger sequence length, for example OpenAI's models or Llama.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Regarding Tutorial: 'Creating a Generative QA Pipeline with Retrieval-Augmentation' #5574

{{title}}

Replies: 1 comment

{{title}}

Select a reply

Regarding Tutorial: 'Creating a Generative QA Pipeline with Retrieval-Augmentation' #5574

dkbs12 Aug 15, 2023

Replies: 1 comment

bogdankostic Aug 15, 2023

dkbs12
Aug 15, 2023

bogdankostic
Aug 15, 2023