Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Store a history of the process count #10

Open
duncandewhurst opened this issue Nov 24, 2021 · 5 comments
Open

Store a history of the process count #10

duncandewhurst opened this issue Nov 24, 2021 · 5 comments

Comments

@duncandewhurst
Copy link

When a data source has collection errors it's difficult to tell how much data is missing, for example, the most recent Kyrgyzstan dataset reports one error and has ~144k processes. Previously, datasets for Kyrgyzstan had ~675k processes.

As an analyst, I would like to see the history of the process count for each dataset so that I can quickly identify if there is a significant amount of missing data.

cc @mrshll1001

@jpmckinney
Copy link

Hi @duncandewhurst, can you please document your use of this project in terms of publisher feedback?

cc @yolile

@duncandewhurst
Copy link
Author

I don't use it for publisher feedback. I use it for ad-hoc queries and for data use training as discussed in open-contracting/ocdskit#75 (comment), CRM-7149, https://opencontractingint.slack.com/archives/C01N5FPUWNM/p1626272242005000 and https://opencontractingint.slack.com/archives/C01N5FPUWNM/p1633382613005800.

@jpmckinney
Copy link

jpmckinney commented Nov 25, 2021

👍 Since the GitHub issue was about a quality issue, I thought it might be related to publisher feedback.

@duncandewhurst
Copy link
Author

Just something that I noticed whilst preparing for a training :-)

@kindly
Copy link
Contributor

kindly commented Dec 7, 2021

@duncandewhurst sorry missed this.

We actually keep not just the process count but the count for every field for every scrape. We just do not expose that to the user yet.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants