[Question] Documentation for generative model API and parameters? #129

tanmayb123 · 2022-08-20T20:31:37Z

I can't seem to find any documentation around how I would specify parameters such as max generation length, stop tokens, temperature, etc., for decoder-based models like GPT-2. Currently my API requests are only generating a single token, and I'd obviously like to generate more (up until a specified stop token preferably).

ayoub-louati · 2022-08-31T15:45:18Z

@tanmayb123 Currently, we are not planning to open those parameters, you can try either to add parameters with Triton or to try to pass the wanted parameters in a json way.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question] Documentation for generative model API and parameters? #129

[Question] Documentation for generative model API and parameters? #129

tanmayb123 commented Aug 20, 2022

ayoub-louati commented Aug 31, 2022

[Question] Documentation for generative model API and parameters? #129

[Question] Documentation for generative model API and parameters? #129

Comments

tanmayb123 commented Aug 20, 2022

ayoub-louati commented Aug 31, 2022