-
Notifications
You must be signed in to change notification settings - Fork 2.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Advice: Is there a way of sampling a % of spans #36876
Comments
Pinging code owners:
See Adding Labels via Comments if you do not have permissions to add labels yourself. |
Bot the tailsampling and the probabilistic sampling processor support this scenario I think. There is a short explanation when to choose which over here. Based on the ottl label I suspect the tail sampler might be the better fit 🙂 Hope that helps. |
Hi @mkromkamp - I have reviewed those two processors and to be honest I can't quite determine from the configuration examples in the readme how to achieve this. They appear to sample traces first and foremost. I can't see the specific configurations to target a span by say attribute value or name. Could you be more specific about the configuration or samplers you had in mind? Thanks |
Yeah sure. I might need a concrete example. But # Rule 2: low sampling for readiness/liveness probes sounds roughly like what you are potentially looking for. It keeps 10% (0.1) percent of traces for the Reformatted snippet to avoid having to search name: team_a-probe
type: and
and:
and_sub_policy:
- name: service-name-policy
type: string_attribute
string_attribute:
key: service.name
values:
- service-1
- service-2
- service-3
- name: route-live-ready-policy
type: string_attribute
string_attribute:
key: http.route
values:
- /live
- /ready
enabled_regex_matching: true
- name: probabilistic-policy
type: probabilistic
probabilistic:
sampling_percentage: 0.1 |
This would as I understand it sample the whole (sub)trace for that route. Per my original request I want to probability sample a particular span, I want to keep the rest of the spans for that given route. |
I see, that wasn't clear to me from the original post. And is indeed not possible with this approach 🙂 |
Pinging code owners for processor/tailsampling: @jpkrohling. See Adding Labels via Comments if you do not have permissions to add labels yourself. For example, comment '/label priority:p2 -needs-triaged' to set the priority and remove the needs-triaged label. |
Pinging code owners for processor/probabilisticsampler: @jpkrohling @jmacd. See Adding Labels via Comments if you do not have permissions to add labels yourself. For example, comment '/label priority:p2 -needs-triaged' to set the priority and remove the needs-triaged label. |
I take it you are interested in what the probabilistic sampler does (which is to sample individual spans), but that you need to conditionalize the logic by span name or other attribute. The tail sampler has the type of condition you require, but samples whole traces. While not exactly what you're asking for, there is work at the OTel specification level for composite samplers which includes a rule-based approach combined with consistent probability sampling. It would be natural to extend the probabilistic sampler processor to support such features. See open-telemetry/opentelemetry-specification#4321 |
Component(s)
pkg/ottl
Describe the issue you're reporting
I have some noisy spans that I want to filter. But equally I want to know if they're still being produced and from where.
Ideally I'd implement a filter with a percentage, so 1% of the spans still get through. Though I can't see a rand function or something like modulo, though I could possibly craft it using divide. In the absence of rand I could try to use something like the last digit component as my random source.
This doesn't seem like a novel idea though so potentially it's a solved problem and my searching skills aren't up to scatch.
These are leaf spans so it wouldn't impact any children.
I don't think this is worthy use of the tail sampling filter either.
The text was updated successfully, but these errors were encountered: