Skip to content

Latest commit

 

History

History
executable file
·
58 lines (42 loc) · 1.42 KB

Readme.md

File metadata and controls

executable file
·
58 lines (42 loc) · 1.42 KB

Deploy DL Workspace cluster on Azure Container Service (ACS)

This document describes the procedure to deploy DL Workspace cluster on ACS. We are still improving the deployment procedure on ACS. Please contact the authors if you encounter any issues.

  1. Follow this document to setup the dev environment of DLWorkspace. Login to your Azure subscription on your dev machine via:
az login
  1. Please configure your ACS cluster.

  2. Set proper authentication.

  3. Running deployment script on the dev machine under src/ClusterBootstrap, as follows:

./deploy.py acs

The deployment script executes the following commands in sequence.

1. Setup basic K8S cluster on ACS
```
./deploy.py acs deploy
```

2. Label nodels and deploy services:
```
./deploy.py acs postdeploy
```

3. Mount storage on nodes:
```
./deploy.py acs storagemount
```

4. Install GPU drivers on nodes (if needed):
```
./deploy.py acs gpudrivers
```

5. Install network plugin
```
./deploy.py acs freeflow
```

6. Build needed docker images and configuration files for restfulapi, jobmanager, and webui
```
./deploy.py acs bldwebui
```

7. Start DLWorkspace pods
```
./deploy.py acs restartwebui
```
  1. If you run into a deployment issue, please check here first.