[Warning] `Merge lora module to 4-bit linear may get different generations` #2321

steveepreston · 2025-01-11T20:27:54Z

System Info

peft 0.14.0
transformers 4.48.0
bitsandbytes 0.45.0

Who can help?

@BenjaminBossan @sayakpaul

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder
My own task or dataset (give details below)

Reproduction

code:

base_model_id = "gemma-2-27b-it"

quantization_config = BitsAndBytesConfig(
    load_in_4bit=True,
    bnb_4bit_use_double_quant=True,
    bnb_4bit_quant_type="nf4",
    bnb_4bit_compute_dtype=torch.bfloat16,
    bnb_4bit_quant_storage=torch.bfloat16,
)

base_model = AutoModelForCausalLM.from_pretrained(
    base_model_id,
    quantization_config=quantization_config,
    attn_implementation="sdpa",
    torch_dtype=torch.bfloat16,
    use_cache=True,
)

peft_model = PeftModel.from_pretrained(base_model, adapter_path)

--> merged_model = peft_model.merge_and_unload()

Warning:


UserWarning: Merge lora module to 4-bit linear may get different generations due to rounding errors.

Expected behavior

merge_and_unload() correctly and without warning.

The text was updated successfully, but these errors were encountered:

steveepreston mentioned this issue Jan 11, 2025

Use AutoPeftModelForCausalLM to merge and save model cause crashed in the low VRAM situation #1083

Closed

4 tasks

steveepreston changed the title ~~[Bug] Merge lora module to 4-bit linear may get different generations~~ [Warning] Merge lora module to 4-bit linear may get different generations Jan 11, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Warning] `Merge lora module to 4-bit linear may get different generations` #2321

[Warning] `Merge lora module to 4-bit linear may get different generations` #2321

steveepreston commented Jan 11, 2025

[Warning] Merge lora module to 4-bit linear may get different generations #2321

[Warning] Merge lora module to 4-bit linear may get different generations #2321

Comments

steveepreston commented Jan 11, 2025

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

[Warning] `Merge lora module to 4-bit linear may get different generations` #2321

[Warning] `Merge lora module to 4-bit linear may get different generations` #2321