You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
python finetune_hf_peft.py --environment=pypi run --smoke True
Full run
python finetune_hf_peft.py --environment=pypi run
Download your lora adapter and move it to $NIM_PEFT_SOURCE
importosfrommy_peft_toolsimportdownload_latest_checkpointdownload_latest_checkpoint(
lora_name="llama3-8b-instruct-alpaca-custom", # remember this name, you'll need it laterlora_dir=os.path.join(os.path.expanduser('~'), 'loras') # NOTE: this is the default
)
curl -X 'POST''http://0.0.0.0:8000/v1/chat/completions' -H 'accept: application/json' -H 'Content-Type: application/json' -d '{ "model": "llama3-8b-instruct-alpaca-custom", "messages": [ { "role":"user", "content":"Hello! How are you?" }, { "role":"assistant", "content":"Hi! I am quite well, how can I help you today?" }, { "role":"user", "content":"Can you write me a song?" } ], "top_p": 1, "n": 1, "max_tokens": 15,}'