Optimization for MF #17

thwu1 · 2024-07-09T07:45:24Z

#9 I added precompution for collapsing three linear transformations without modifying the loading part. @iojw Do you want to do a speed test?

iojw

Thank you @thwu1! This will be an amazing improvement for MF - some comments.

iojw · 2024-07-20T20:59:08Z

routellm/routers/matrix_factorization/model.py

-        use_proj,
+        dim=128,
+        num_models=64,
+        text_dim=768,


Could we automatically set the text dim based on the selected embeddings?

iojw · 2024-07-20T21:01:17Z

routellm/routers/routers.py

        num_classes=1,
        use_proj=True,
+        embedding_model="all-mpnet-base-v2",


Maybe better to use OpenAI's embeddings by default to preserve our existing behavior? I'll add some docs to describe the different options here.

iojw · 2024-07-20T21:03:08Z

routellm/routers/matrix_factorization/model.py

+        num_classes=1,
+        use_proj=True,
+        collapse_linear=False,
+        embedding_model="all-mpnet-base-v2",


I think it would be better if we didn't set any default args here and specify default args only in routers.py so that it's easier to keep track of router configs.

thwu1 changed the title ~~Optimization for MF: Precompute collapsed linear transformation~~ Optimization for MF Jul 9, 2024

thwu1 requested a review from iojw July 9, 2024 18:46

precompute collapsed linear transformation

cc67b87

thwu1 force-pushed the optimize_mf branch from 5234fbb to cc67b87 Compare July 14, 2024 00:35

thwu1 added 2 commits July 14, 2024 01:06

support local embedding model

2a4328d

add stella_en_400M_v5 support

1316690

iojw reviewed Jul 20, 2024

View reviewed changes

iojw mentioned this pull request Aug 3, 2024

Can we use the matrix factorization model locally by downloading it to our local? #39

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimization for MF #17

Optimization for MF #17

thwu1 commented Jul 9, 2024 •

edited

Loading

iojw left a comment

iojw Jul 20, 2024

iojw Jul 20, 2024

iojw Jul 20, 2024

Optimization for MF #17

Are you sure you want to change the base?

Optimization for MF #17

Conversation

thwu1 commented Jul 9, 2024 • edited Loading

iojw left a comment

Choose a reason for hiding this comment

iojw Jul 20, 2024

Choose a reason for hiding this comment

iojw Jul 20, 2024

Choose a reason for hiding this comment

iojw Jul 20, 2024

Choose a reason for hiding this comment

thwu1 commented Jul 9, 2024 •

edited

Loading