HowToRun_vLLM_Models Readme Fixes #127

anirudTT · 2025-01-15T16:44:50Z

Various bugs / improvements in the HowToRun_vLLM_Models README


> Process SpawnProcess-1:
> Traceback (most recent call last):
>   File "/tt-metal/ttnn/ttnn/operations/core.py", line 622, in as_tensor
>     tensor = ttnn._ttnn.tensor.load_tensor(cache_file_name, device=device)
> RuntimeError: Cannot open "/home/user/cache_root/tt_metal_cache/cache_repacked-llama-3.1-70b-instruct/layers.0.attention.wqkv_fused.weight_multi_device_8_dtype_BFLOAT8_B_layout_TILE.bin"

The text was updated successfully, but these errors were encountered:

anirudTT mentioned this issue Jan 22, 2025

update instructions to get lamma model setup for tt studio #155

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HowToRun_vLLM_Models Readme Fixes #127

HowToRun_vLLM_Models Readme Fixes #127

anirudTT commented Jan 15, 2025 •

edited

Loading

HowToRun_vLLM_Models Readme Fixes #127

HowToRun_vLLM_Models Readme Fixes #127

Comments

anirudTT commented Jan 15, 2025 • edited Loading

anirudTT commented Jan 15, 2025 •

edited

Loading