-بتصرف من أبيات لفواز اللعبون-
This model is to build a specefied Arabic model for only Generating Arabic poems unfortunatly I used free colab to train the model so the results are not that good but the plan is as follow:
- Train with more iterations (need more power)
- Use A bigger dataset Link
- Saparete the training by the bahr (بحر القصيدة) to make the model more capable of learning them
- Use new Model Architicutre
- GELU (as activation)
- LLAMA & JAIS (as new LLM arch)
- use pytroch implementaion instead of from scratch
the pre-trained weights with this code: Link
Note: you need to download the dataset Link to run the model for the tokens and token decoder encoder but the trained model is in this repo
- Also for the
Tarannom.ipynb
is the source code of the first model
until now Only one fully trained model was done
as follow:
#Model | steps | val loss |
---|---|---|
1 | 10000 | 2.0209 |
Models archs
#Model | #layers | #haeds | dim | #params |
---|---|---|---|---|
1 | 12 | 8 | 128 | 2.4M |
- أم ذكرتِ قبلاً فكان رشداً ولنا ما براه السرور الدما
- وكفى الماء حُصْنُ النفس عليك
- أقولها فلا زَلَلتُ فتىً للعزم أَن تُمدي بالغر
This model is inspired by Andrej Kerpathy code Link