Skip to content

Static boundary / alerting for categorical features #139

Answered by mbaak
jeaninejuliettes asked this question in Q&A
Discussion options

You must be logged in to vote

Hello,

Thanks for the kind words.

  1. Yes, you can set fixed traffic light boundaries. Simply set the option monitoring_rules when generating the report. See for examples:
    https://github.com/ing-bank/popmon/blob/master/popmon/pipeline/report.py#L81

  2. For any categorical feature we compare the distribution of a new batch of data with the reference histogram. If the distributions are significantly different an alert will be raised. We don't explicitly check for empty bins, since these can also happen in low statistics data batches, but if an empty bin causes a significant difference between the two histograms, it will certainly show up in the test statistics. For example in the metric "max_pr…

Replies: 3 comments 3 replies

Comment options

You must be logged in to vote
2 replies
@jeaninejuliettes
Comment options

@jeaninejuliettes
Comment options

Answer selected by mbaak
Comment options

You must be logged in to vote
1 reply
@jeaninejuliettes
Comment options

Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants