completion() for fireworks #329

dineshyv · 2024-10-25T22:43:11Z

What does this PR do?

implements completetion() API for fireworks

Feature/Issue validation/testing/test plan

 PROVIDER_ID=test-fireworks MODEL_IDS="Llama3.1-8B-Instruct" PROVIDER_CONFIG=../configs/provider_config_example.yaml with-proxy pytest -s tests/inference/test_inference.py --tb=short --disable-warnings -rs
/home/dineshyv/.conda/envs/stack/lib/python3.10/site-packages/pytest_asyncio/plugin.py:208: PytestDeprecationWarning: The configuration option "asyncio_default_fixture_loop_scope" is unset.
The event loop scope for asynchronous fixtures will default to the fixture caching scope. Future versions of pytest-asyncio will default the loop scope for asynchronous fixtures to function scope. Set the default fixture loop scope explicitly in order to avoid unexpected behavior in the future. Valid fixture loop scopes are: "function", "class", "module", "package", "session"

  warnings.warn(PytestDeprecationWarning(_DEFAULT_FIXTURE_LOOP_SCOPE_UNSET))
================================================================================= test session starts =================================================================================
platform linux -- Python 3.10.15, pytest-8.3.3, pluggy-1.5.0
rootdir: /home/dineshyv/local/llama-stack
configfile: pyproject.toml
plugins: asyncio-0.24.0, anyio-4.6.2.post1
asyncio: mode=strict, default_loop_scope=None
collected 8 items

tests/inference/test_inference.py Resolved 4 providers
 inner-inference => test-fireworks
 models => __routing_table__
 inference => __autorouted__
 inspect => __builtin__

........

=========================================================================== 8 passed, 21 warnings in 9.24s ============================================================================

dineshyv · 2024-10-25T22:46:37Z

llama_stack/providers/tests/inference/test_inference.py

@@ -167,7 +168,7 @@ async def test_completion(inference_settings):
    ]

    assert all(isinstance(chunk, CompletionResponseStreamChunk) for chunk in chunks)
-    assert len(chunks) == 51
+    assert len(chunks) >= 1


@ashwinb, this is needed since the number of chunks each provider returns is not consistent and does not depend on the max tokens.

ashwinb

sweet

completion() for fireworks

d882d46

dineshyv requested review from ashwinb, yanxi0830, hardikjshah, dltn and raghotham as code owners October 25, 2024 22:43

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Oct 25, 2024

dineshyv linked an issue Oct 25, 2024 that may be closed by this pull request

[functionality] Implement completion() methods #168

Closed

dineshyv commented Oct 25, 2024

View reviewed changes

ashwinb approved these changes Oct 25, 2024

View reviewed changes

dineshyv merged commit 9b85d9a into main Oct 25, 2024
4 checks passed

dineshyv deleted the dineshyv/fireworks-completion branch October 25, 2024 23:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

completion() for fireworks #329

completion() for fireworks #329

dineshyv commented Oct 25, 2024

dineshyv Oct 25, 2024 •

edited

Loading

ashwinb left a comment

completion() for fireworks #329

completion() for fireworks #329

Conversation

dineshyv commented Oct 25, 2024

What does this PR do?

Feature/Issue validation/testing/test plan

dineshyv Oct 25, 2024 • edited Loading

Choose a reason for hiding this comment

ashwinb left a comment

Choose a reason for hiding this comment

dineshyv Oct 25, 2024 •

edited

Loading