Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Clone repos from da-server #1

Open
robobenklein opened this issue Feb 4, 2020 · 6 comments
Open

Clone repos from da-server #1

robobenklein opened this issue Feb 4, 2020 · 6 comments
Assignees

Comments

@robobenklein
Copy link
Member

Need some way that allows workers to clone from WoC repos

Perhaps WoC could use a token, or we have one ssh key we make for all CA workers?

@agrant16
Copy link

agrant16 commented Feb 5, 2020

One ssh key should probably work, though I don't know if they have rate limiting setup which might interfere with our efforts. I still don't have access to WoC. I've emailed Dr. Mockus to see if he can go ahead and get my access setup this week.

@robobenklein
Copy link
Member Author

Sounds like an SSH key would work, but I think it would be better practice to get a separate account for the workers to use since there doesn't seem to be any protection on the mongodb databases on da servers.

@robobenklein
Copy link
Member Author

ok new plan from today:

build a repo list based on the thruMaps for each target language:

  • get blob
  • find repo for commit that had that blob
  • export repo information

export to json, cp from server, import into db using a model changer

@agrant16
Copy link

We can get repos directly from the output of the thruMaps. We can just cut the repo out of that output directly and filter it through uniq. Or is there some advantage to collecting blobs from the thruMaps and going blob->commit->repo that I'm missing?

@robobenklein
Copy link
Member Author

we can get the list of repos for a language, but I think there's also some useful metadata stored elsewhere that we'd otherwise want to pull from the github api, and for non-GH (GL, BB, etc) repos we should come up with some kind of common set of stats

@agrant16
Copy link

That's true. I'm not sure what all metadata we can pull from WoC, but I'll look into it more and see. If we can get stuff we want from there instead that would make things easier.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants