What model/setup will work with Raw Text file training? #5459

techogre · 2024-02-06T20:29:36Z

techogre
Feb 6, 2024

Can anyone tell me what model and setup they've successfully used to train that model? The combinations I've tried so far have all failed.

Thanks!

araleza · 2024-02-10T15:48:06Z

araleza
Feb 10, 2024

For training models, I usually use 4 bit GPTQ models downloaded from TheBloke. I load them with the Transformers model loader, with 'auto-devices' and 'disable_exllama' ticked, and then train them with the Training tab. Then to use the resulting LoRA, I apply them to the same model but loaded with the ExLlama_v2 loader.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What model/setup will work with Raw Text file training? #5459

{{title}}

Replies: 1 comment

{{title}}

Select a reply

What model/setup will work with Raw Text file training? #5459

techogre Feb 6, 2024

Replies: 1 comment

araleza Feb 10, 2024

techogre
Feb 6, 2024

araleza
Feb 10, 2024