Add GPTQModel support for evaluating GPTQ models #2217

Qubitium · 2024-08-16T06:50:34Z

Add option to use GPTQModel for lm_eval. GPTQModel is a replacement for AutoGPTQ on GPTQ quantization and inference with better model support and much faster inference speed out-of-the-box. We have been using it internally with lm-eval for months without issue.

CLAassistant · 2024-08-16T06:50:41Z

All committers have signed the CLA.

Qubitium · 2024-09-30T09:52:48Z

@baberabb Hi, can we get some action on this? What do we need to do to get this reviewed and merged?

Qubitium · 2024-10-22T02:37:41Z

@baberabb Ruff/Lint checks passed. Awaiting review. Thanks.

Qubitium · 2024-10-24T10:23:41Z

@baberabb Ping. Please check our PR. We will push a unit test into test/modes/test_gptq.py later today to complete the PR. Let us know if there is anything else required of us.

Qubitium · 2024-10-25T04:04:15Z

@baberabb Unit test added.

baberabb · 2024-10-25T08:11:50Z

Hi! Thanks for the PR, and sorry it took ages for us to review. This looks good to me, but I want to run it through @haileyschoelkopf as well.

ps. 3.8 test failing because of a recent transformers update.

CL-ModelCloud and others added 7 commits August 1, 2024 05:52

support gptqmodel

31832ed

code opt

211cf6c

Merge branch 'EleutherAI:main' into MOD-Support-GPTQModel

7e64072

Merge branch 'EleutherAI:main' into MOD-Support-GPTQModel

2a3c52a

Merge branch 'EleutherAI:main' into MOD-Support-GPTQModel

57c8c00

Merge remote-tracking branch 'origin/main' into MOD-Support-GPTQModel

210bf46

add gptqmodel option

5fed61c

Qubitium requested review from haileyschoelkopf and lintangsutawika as code owners August 16, 2024 06:50

Qubitium and others added 3 commits August 16, 2024 14:51

Update huggingface.py

fc21103

Update pyproject.toml

c6ebc51

Merge branch 'EleutherAI:main' into MOD-Support-GPTQModel

49ff042

CL-ModelCloud requested a review from baberabb as a code owner September 30, 2024 09:11

CL-ModelCloud and others added 4 commits September 30, 2024 10:27

gptqmodel version upgraded to 1.0.6

c11fe81

Merge branch 'EleutherAI:main' into MOD-Support-GPTQModel

b445a30

GPTQModel version upgraded to 1.0.8

88e4874

Update pyproject.toml

ea89c48

Qubitium mentioned this pull request Oct 22, 2024

[COMPAT] HF compat (AutoModel + Optimum) ModelCloud/GPTQModel#440

Open

CL-ModelCloud added 2 commits October 22, 2024 10:04

Merge branch 'EleutherAI:main' into MOD-Support-GPTQModel

24a6f35

fix ruff-format error

c16111c

Fix code conflicts

8f9f58e

Qubitium changed the title ~~Add GPTQModel support for inferencing GPTQ models~~ Add GPTQModel support for evaluating GPTQ models Oct 24, 2024

CL-ModelCloud added 2 commits October 25, 2024 03:39

add gptqmodel test

ebdba64

Update gptqmodel test model

3c5de46

skip cuda

ee65028

baberabb approved these changes Oct 25, 2024

View reviewed changes

python3.8 compatible

eb10efb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add GPTQModel support for evaluating GPTQ models #2217

Add GPTQModel support for evaluating GPTQ models #2217

Qubitium commented Aug 16, 2024 •

edited

Loading

CLAassistant commented Aug 16, 2024 •

edited

Loading

Qubitium commented Sep 30, 2024

Qubitium commented Oct 22, 2024

Qubitium commented Oct 24, 2024

Qubitium commented Oct 25, 2024

baberabb commented Oct 25, 2024

Add GPTQModel support for evaluating GPTQ models #2217

Are you sure you want to change the base?

Add GPTQModel support for evaluating GPTQ models #2217

Conversation

Qubitium commented Aug 16, 2024 • edited Loading

CLAassistant commented Aug 16, 2024 • edited Loading

Qubitium commented Sep 30, 2024

Qubitium commented Oct 22, 2024

Qubitium commented Oct 24, 2024

Qubitium commented Oct 25, 2024

baberabb commented Oct 25, 2024

Qubitium commented Aug 16, 2024 •

edited

Loading

CLAassistant commented Aug 16, 2024 •

edited

Loading