video-salmonn vicuna question #86

HeChengHui · 2024-11-29T06:35:42Z

@BriansIDP
Thank you for your work.

what is the vram requirement to run inference? i am having OOM using lmsys/vicuna-13b, but lmsys/vicuna-7b is giving me size mismatch error.
Or am i using the wrong model?

The text was updated successfully, but these errors were encountered:

BriansIDP · 2024-11-29T07:23:08Z

Thank you for the question.
Video-SALMONN is trained with vicuna-13b so the input dimension for the 7b model would not match the Q-Former output of video-SALMONN. It would be helpful to try quantization (with a bit of performance loss).

HeChengHui · 2024-11-29T08:11:28Z

@BriansIDP
does that mean i can use something like TheBloke/vicuna-13B-v1.5-16K-AWQ by just setting it in the config?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

video-salmonn vicuna question #86

video-salmonn vicuna question #86

HeChengHui commented Nov 29, 2024

BriansIDP commented Nov 29, 2024

HeChengHui commented Nov 29, 2024 •

edited

Loading

video-salmonn vicuna question #86

video-salmonn vicuna question #86

Comments

HeChengHui commented Nov 29, 2024

BriansIDP commented Nov 29, 2024

HeChengHui commented Nov 29, 2024 • edited Loading

HeChengHui commented Nov 29, 2024 •

edited

Loading