Odd repetition and other issues recently - Incorrect ftype? #1192

icsy7867 · 2024-10-28T16:43:07Z

Describe the Issue
Apologies, I am in no means an expert, and I am still learning.

Recently, after upgrading KoboldCPP I have been seeing some strange repetition and other issues with responses that frequently just dont quite make sense given the content.

I am well within the context limit. (Currently testing Qwen 2.5 - 32B using Q4 quantization), it the responses frequently contradict what is in the context right before it. I was doing some digging, so I noticed that in the start output I see:

llm_load_print_meta: model ftype      = Q3_K - Large

even though this is a Q4 model. Being curious, I downloaded llamacpp and did a similar execution, and it immediately detected the correct Q4 on the same model.

Additional Information:
Running in Podman, using a Quadro P6000, 24GB VRAM.

For running koboldcpp I am using:

--usecublas --flashattention --gpulayers 999 --contextsize 12000

Seems to work otherwise, no errors or anything visible. Just curious if someone new of a magic flag or something to try? Or maybe I am being stupid and missing something.

The text was updated successfully, but these errors were encountered:

icsy7867 · 2024-10-28T19:35:36Z

I also test version 1.74 and version 1.73, but it always identifies the ftype as Q3_Large.

LostRuins · 2024-11-01T16:55:09Z

I don't think the ftype detection has anything to do with the output quality - if it was wrong it would output complete garbage.

icsy7867 · 2024-11-02T17:29:10Z

Fair enough! Thank you for the response. I am still exploring the various flags and options as I figure out the issues. I am pretty sure this is user error somehow...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Odd repetition and other issues recently - Incorrect ftype? #1192

Odd repetition and other issues recently - Incorrect ftype? #1192

icsy7867 commented Oct 28, 2024

icsy7867 commented Oct 28, 2024

LostRuins commented Nov 1, 2024

icsy7867 commented Nov 2, 2024

Odd repetition and other issues recently - Incorrect ftype? #1192

Odd repetition and other issues recently - Incorrect ftype? #1192

Comments

icsy7867 commented Oct 28, 2024

icsy7867 commented Oct 28, 2024

LostRuins commented Nov 1, 2024

icsy7867 commented Nov 2, 2024