We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Add requirement: fschat. Fix wrong parameter name: tensor-parallel-size
Add description about streaming output usage. Fix some error.
Updated api_calls_vllm_zh (markdown)
修改load_in_8bit为load_in_kbit及部分说明信息
add page 'api_calls_vllm_zh'