[Rate Type] Concurrencies #47

philschmid · 2024-09-05T14:08:22Z

Hello,

I am trying to integrate guidellm into a benchmark suite. And there we ran different load tests based on use concurrencies. We define user concurrenies as "users" that send requests after each other. Meaning send request -> wait for response -> send next request.

I first assumed that's what is done with "constant" and "rate" but there is send way more requests as they are send per second. Is there a way to customize the "user concurrency"? I assume that concurrency == synchronous type. But would be create if i could do something like

guidellm --target "http://localhost:8080/v1" --model "meta-llama/Meta-Llama-3.1-8B-Instruct"  --data-type emulated --data "prompt_tokens=550,generated_tokens=250" --max-seconds 60 --rate-type concurrent --rate 1 --rate 2 --rate 10 --rate 50 --output-path r.json

The text was updated successfully, but these errors were encountered:

markurtz · 2024-09-10T19:31:33Z

Hey @philschmid, I understand what you mean about this request. You'd specifically like to be able to keep a fixed number of concurrent requests over the life of the benchmark where as soon as one finishes it immediately starts a new one, is that correct? You can't easily figure out currently through the constant or poison rate types since those are set as the number of requests per second rather which you'd have to adjust those until you hit the average number of concurrent users, right?

philschmid · 2024-09-11T06:29:50Z

Hey,

Yes. I am looking for a way to benchmark the load under e,g, 1, 2, 4, 8, 16, 32, 64, 128 concurrent users (send request -> wait for response, send again).

But looking into more benchmarks and dashboard, people seem to switch to QPS (what rate should cover). So not sure how important this is.

markurtz self-assigned this Sep 10, 2024

markurtz added the enhancement New feature or request label Sep 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Rate Type] Concurrencies #47

[Rate Type] Concurrencies #47

philschmid commented Sep 5, 2024

markurtz commented Sep 10, 2024

philschmid commented Sep 11, 2024

[Rate Type] Concurrencies #47

[Rate Type] Concurrencies #47

Comments

philschmid commented Sep 5, 2024

markurtz commented Sep 10, 2024

philschmid commented Sep 11, 2024