How to apply 3/4-bit quantization to vision-language model? #28

verigle · 2023-06-08T15:25:49Z

How to apply 3/4-bit quantization to vision-language model like BLIP2？

efrantar · 2023-07-10T13:14:17Z

In principle, GPTQ should be applicable to most types of models; see also #3 and #8 for some advice on applying GPTQ to other models than the ones in this repository. In the case of BLIP2 I would guess that you probably want to compress the vision part first and then follow up with the language part, using calibration data that went through the already quantized vision component.

efrantar closed this as completed Jul 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to apply 3/4-bit quantization to vision-language model? #28

How to apply 3/4-bit quantization to vision-language model? #28

verigle commented Jun 8, 2023

efrantar commented Jul 10, 2023

How to apply 3/4-bit quantization to vision-language model? #28

How to apply 3/4-bit quantization to vision-language model? #28

Comments

verigle commented Jun 8, 2023

efrantar commented Jul 10, 2023