Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

spike: model files pvc image hardening example #979

Open
Tracked by #750
justinthelaw opened this issue Sep 4, 2024 · 0 comments · May be fixed by #752
Open
Tracked by #750

spike: model files pvc image hardening example #979

justinthelaw opened this issue Sep 4, 2024 · 0 comments · May be fixed by #752
Assignees
Labels
enhancement New feature or request spike

Comments

@justinthelaw
Copy link
Contributor

justinthelaw commented Sep 4, 2024

Our new method of loading models into our inferencing engines requires an initContainer that uses Zarf Injection to mount a PV into the inferencing engine's container. This initContainer + PV is technically another container that needs to be approved through hardening processes.

This spike is to see how we can separate out the injection into its own container for hardening, to include the model files, and then run this package through the IronBank hardening process. The model we will use is the defenseunicorns/Hermes-2-Pro-Mistral-7B-4bit-32g-GPTQ, and the targeted backend will be the vLLM backend.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request spike
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant