Fix perplexity calculation and resulting overcompression #208

cornzz · 2025-01-16T18:02:24Z

For longllmlingua this is not a 100% correct fix, especially for short prompts with longllmlingua errors occur.
I hope one of the maintainers can give some feedback whether this is semi-correct

Related: #195, #61

See this comment for details: #61 (comment)

Before the change (top: Llama 2, bottom: GPT-2):

After the change:

Note that the undercompression for 250 and 500 tokens is caused by the second bug / design flaw described in this issue #196 under "Semi-related bug", Also, this was measured including the bugfix in #198 otherwise 50 and 100 tokens would not be compressed.

I did not run the tests as compression outputs will change and tests will fail

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Was this discussed/approved via a Github issue? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

…ion (microsoft#195)

Fix(LLMLingua): fix perplexity calculation and resulting overcompress…

aabda13

…ion (microsoft#195)

cornzz mentioned this pull request Jan 16, 2025

Understanding the interplay between ratio and iterative_size #61

Closed

cornzz changed the title ~~Fix perplexity calculation and resulting overcompress…~~ Fix perplexity calculation and resulting overcompression Jan 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix perplexity calculation and resulting overcompression #208

Fix perplexity calculation and resulting overcompression #208

cornzz commented Jan 16, 2025 •

edited

Loading

Fix perplexity calculation and resulting overcompression #208

Are you sure you want to change the base?

Fix perplexity calculation and resulting overcompression #208

Conversation

cornzz commented Jan 16, 2025 • edited Loading

Before submitting

cornzz commented Jan 16, 2025 •

edited

Loading