Does not support Flash Attention 2.0 yet #74

moonriver0922 · 2025-02-11T08:34:00Z

Thanks for the great work.
When I f-t tiny model it in my dataset, it shows
ValueError: DeepseekVLV2ForCausalLM does not support Flash Attention 2.0 yet. Please request to add support where the model is hosted, on its model hub page: https://huggingface.co/deepseek-ai/deepseek-vl2-tiny/discussions/new or in the Transformers GitHub repo: https://github.com/huggingface/transformers/issues/new
Are there any fine-tuning tutorials available？

The text was updated successfully, but these errors were encountered:

gromtang · 2025-02-14T06:20:11Z

I met the same problem

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does not support Flash Attention 2.0 yet #74

Does not support Flash Attention 2.0 yet #74

moonriver0922 commented Feb 11, 2025

gromtang commented Feb 14, 2025

Does not support Flash Attention 2.0 yet #74

Does not support Flash Attention 2.0 yet #74

Comments

moonriver0922 commented Feb 11, 2025

gromtang commented Feb 14, 2025