-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Suggestion: leverage data formats #2
Comments
The audience I chose the input "format" for is academic and data science groups, and a lot of them have preference for formats like text / csv, even over json (and definitely yaml), so based on the fact that people will most readily copy paste a GitHub url (including organization and repository name) and then we just need a list of tags after that, a single url then comma delimited tags is a simple and logical solution. I think the formats you shared would be reasonable if there were more needed than that. What kind of use cases do you have in mind for just data output? |
Writing yaml is already a requirement for setting up the GH action. Copy pasting URLs rather than pulling them apart is a fair point. I'm interpreting that as a deliberate choice to prioritize simple to update, over self-documenting. You could have that same priority with something very close to the custom txt you're using, while still being valid json/yml. # yaml style, closest to txt format
https://github.com/organization/example-repo: [hello, world]
# json style
{
"https://github.com/organization/example-repo": ["hello", "world"]
}
# yaml alternative, with some empty lines would be nice to read
https://github.com/organization/example-repo:
- hello
- world
Raw data in a well-supported and simple format like JSON, is by far the best interoperability choice you could make 😄 Take the other issue: #3 Having the scores.json format, allowed me to create SourceCred widgets before I had a clue about how any of the SourceCred code worked or what sort of APIs or tooling it had. Likewise, I think giving the option to output JSON, allows people to use this action as input for whatever else they can come up with. Maybe they would like to load the json into some javascript browser tool rather than static-rendering. Maybe they'll load it into their own script and add bounties / cred scores for these issues. Maybe they want to store it in a database. Whatever the use-case is, having "just data" will make it significantly easier to do. |
I am in total support! I would suggest that we wait for someone actually asking for this, before designing something that isn't asked for. |
Noticing some parsing of the txt file is happening:
good-first-issues/scripts/generate-first-issues.py
Lines 39 to 41 in 266448b
JSON/YAML could probably make this easier and self-documenting:
Likewise, I think it would be really useful to have an option to produce data output. Rather than directly going to front matter + markdown. Even if just internally in the python scripts.
Here's some pseudo code of what I mean in terms of interfaces.
This would make it trivial to support writing JSON data out instead of MD files :]
The text was updated successfully, but these errors were encountered: