Skip to content

linsun/gen-al-demo

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

gen-al-demo

Gen AL Demo with Istio Ambient

Prerequisites

  • A Kubernetes cluster, for example a kind cluster.
  • kubectl

Startup

We have crafted a few scripts to make this demo run as quickly as possible on your machine once you've installed the prerequisites.

This script will:

  • Create a kind cluster
  • Install a simple curl client, an ollama service and the demo service.
    • Ollama is a Language Model as a Service (LMaaS) that provides a RESTful API for interacting with large language models. It's a great way to get started with LLMs without having to worry about the infrastructure.
./startup.sh

Pull the LLM models

The following two LLM models are used in the demo:

  • LLaVa (Large Language and Vision Assistant)
  • Llama (Large Language Model Meta AI) 3.2

Pull the two models:

kubectl exec -it deploy/client -- curl http://ollama.ollama:80/api/pull -d '{"name": "llama3.2"}'
kubectl exec -it deploy/client -- curl http://ollama.ollama:80/api/pull -d '{"name": "llava"}'

Install Istio ambient and enroll all the apps to Istio ambient

We use Istio to secure, observe and control the traffic among the microservices in the cluster.

./install-istio.sh

Access the demo app

Use port-forwarding to help us access the demo app:

kubectl port-forward svc/demo 8001:8001

To access the demo app, open your browser and navigate to http://localhost:8001

Cleanup

To clean up the demo, run the following command:

./shutdown.sh

Operating System Information

This demo has been tested on the following operating systems and will work if you have the prerequisites installed.

  • macOS M2

Credits

A portion of the demo in this repo was inspired by the github.com/cncf/llm-in-action repo.

About

Gen AL Demo with Istio Ambient

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages