llama 3 tokenizer no longer works - updated eos token #44

avianion · 2024-05-13T13:12:23Z

The official llama 3 70b instruct repo has updated the eos token

"eos_token": "<|eot_id|>",

Yet when using this library and using that eos token, no output is outputted because it used the old eos token.

Suggesting to fix this @npuichigo

npuichigo · 2024-05-13T16:34:45Z

which part do you mean? Triton backend should have parameter like stop_words

avianion · 2024-05-13T22:35:40Z

"chat_template": "{% set loop_messages = messages %}{% for message in loop_messages %}{% set content = '<|start_header_id|>' + message['role'] + '<|end_header_id|>\n\n'+ message['content'] | trim + '<|eot_id|>' %}{% if loop.index0 == 0 %}{% set content = bos_token + content %}{% endif %}{{ content }}{% endfor %}{% if add_generation_prompt %}{{ '<|start_header_id|>assistant<|end_header_id|>\n\n' }}{% endif %}",

poddamatt98 · 2024-07-31T09:44:20Z

Hi @avianion, I am reporting the issue you are describing. Using the liquid template defined in the repo, the model is returning an empty response. I also tried converting your chat template from Jinja to Liquid, but without results.
I wonder if you have solved this issue.
@npuichigo can you help us with this?

npuichigo · 2024-08-01T01:51:31Z

@poddamatt98 I will take a look at this when I have time.

poddamatt98 · 2024-08-02T09:28:28Z

problem solved modifying </s> with <|eot_id|> both in row 245 in src/routes/chat.rs and in row 226 in src/routes/completions.rs.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama 3 tokenizer no longer works - updated eos token #44

llama 3 tokenizer no longer works - updated eos token #44

avianion commented May 13, 2024

npuichigo commented May 13, 2024

avianion commented May 13, 2024

poddamatt98 commented Jul 31, 2024

npuichigo commented Aug 1, 2024

poddamatt98 commented Aug 2, 2024 •

edited

Loading

llama 3 tokenizer no longer works - updated eos token #44

llama 3 tokenizer no longer works - updated eos token #44

Comments

avianion commented May 13, 2024

npuichigo commented May 13, 2024

avianion commented May 13, 2024

poddamatt98 commented Jul 31, 2024

npuichigo commented Aug 1, 2024

poddamatt98 commented Aug 2, 2024 • edited Loading

poddamatt98 commented Aug 2, 2024 •

edited

Loading