Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Tencent HunYuan MOE model #1100

Open
wants to merge 3 commits into
base: main
Choose a base branch
from
Open

Tencent HunYuan MOE model #1100

wants to merge 3 commits into from

Conversation

awni
Copy link
Member

@awni awni commented Nov 8, 2024

Currently too large to run on in 192GB even in 4-bit. Looking into mixed precision.

@awni
Copy link
Member Author

awni commented Nov 9, 2024

Kind of works in 2-bit, but is all chinese for some reason:

mlx_lm.convert --hf-path tencent-community/Hunyuan-A52B-Instruct -q --q-bits 2 --q-group-size 32
mlx_lm.generate --model mlx_model --prompt "Write a story about Einstein" -m 100 --trust-remote-code

Outputs:

==========
Prompt: <|startoftext|><|startoftext|>Write a story about Einstein<|extra_4|><|extra_0|>
<|startoftext|>给出的问题是关于爱因斯坦的故事。以下是一个关于爱因斯坦的虚构故事:

在一个风和日丽的午后,阿尔伯特·爱因斯坦正坐在他位于柏林的办公室里,埋头于一份关于光速不变原理的论文。爱因因斯坦是一个著名的理论物理学家,他的相对论已经改变了科学界对时间和空间的理解。

突然,爱因斯坦的助手急匆匆地闯进了办公室,手里还拿着一封来自国际物理研究协会的信
==========

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant