LLM Docker server documentation. #35

Antivash · 2024-10-28T02:45:53Z

Adding documentation for a locally running LLM server.

lucaelin · 2024-11-01T01:39:25Z

docs/text-generation/LLM-setup.md

+LLM Endpoint: http://127.0.0.1:5000/v1
+LLM API Key: Can be blank, or anything at all.
+```
+So here is where things get a bit scuffed. Regardless of the model name you have loaded, you want to enter gpt-3.5-turbo because covas doesn't like anything else here. Also, when you click Start AI, Covas is going to complain about your model provider not servering gpt-4o-mini. Ignore it and continue anyway. It will still work fine, even with this nag. I don't understand it either.


What we do on covas:next side it a request to the models endpoint of the URL that was entered (https://platform.openai.com/docs/api-reference/models/list). The reason we do this, is because for openai the 4o model is sometimes not available when the account is very young. If detect that 4o-mini is not in that list, we downgrade the model to 3.5-turbo instead. But the catch is when using providers other than openai, those may not provide a models endpoint, in which case our request error and we just hope that what was entered as the model name is correct. What appears to happen in your case is that the text-gen webui does indeed provide a models endpoint, and this endpoint probably does not include gpt-4o-mini, so we show the error message, that gpt-4o-mini is not available, although it seems like the webui does not even care about what you entered, so this gets a bit misleading. One possibility to correct this behavior, is to manually check what the actual available model names are that the webui claims to support and then enter one of those, or to get the webui to state that it is indeed fine with providing a model called gpt-4o-mini...

Could we make the check itself optional, depending on the entered LLM endpoint?

The check is optional already, but we could disable the downgrade / upgrade if the provider is not openai

Antivash added 24 commits October 26, 2024 06:10

Xtts documentation

340873c

Update VoiceClone-and-XTTS-Setup.md

a2349f2

updating

39220d3

Merge branch 'main' of https://github.com/Antivash/Elite-Dangerous-AI…

632db6a

…-Integration

Update VoiceClone-and-XTTS-Setup.md

1b2ab1c

clearing things up and LLM starting

2279621

Merge branch 'main' of https://github.com/Antivash/Elite-Dangerous-AI…

8a0835c

…-Integration

Update VoiceClone-and-XTTS-Setup.md

74a9900

Update LLM-setup.md

3140407

Update LLM-setup.md

a9b5223

Building LLM

c7e9f98

Delete docs/xtts directory

5f83b48

fun with tables

79692ab

tensors

8604706

Finalizing guide

44577a2

Merge branch 'RatherRude:main' into LLM

a3d9a33

Update LLM-setup.md

7c2f26b

Merge branch 'LLM' of https://github.com/Antivash/Elite-Dangerous-AI-…

13ea663

…Integration into LLM

Update LLM-setup.md

2529cfd

Create amd.png

26bb778

Update LLM-setup.md

fb044f5

Create docker-compose.yml

d914a53

Update docker-compose.yml

64102a9

Update LLM-setup.md

e01cdd8

lucaelin reviewed Nov 1, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LLM Docker server documentation. #35

LLM Docker server documentation. #35

Antivash commented Oct 28, 2024

lucaelin Nov 1, 2024

RatherRude Nov 13, 2024

lucaelin Nov 13, 2024

LLM Docker server documentation. #35

Are you sure you want to change the base?

LLM Docker server documentation. #35

Conversation

Antivash commented Oct 28, 2024

lucaelin Nov 1, 2024

Choose a reason for hiding this comment

RatherRude Nov 13, 2024

Choose a reason for hiding this comment

lucaelin Nov 13, 2024

Choose a reason for hiding this comment