Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Discover collaborating orgs via contributor activity feeds #32

Open
andrew opened this issue Jun 15, 2020 · 6 comments
Open

Discover collaborating orgs via contributor activity feeds #32

andrew opened this issue Jun 15, 2020 · 6 comments

Comments

@andrew
Copy link
Collaborator

andrew commented Jun 15, 2020

For issues and pull requests that are opened by non-core non-collab folk, we can look in their personal activity feeds for PushEvent events to see which orgs that user has commit access to, and so is likely to be a core contributor of.

We have the contributors table now so we can store the etag of the users latest activity poll to keep api requests to a minimum, we can also store the events in the existing events table.

This can then be used to generate some kind of weekly report of suggested orgs as potential new collabs, perhaps even speculatively indexing some of their repositories to look for dependencies on internal packages.

andrew added a commit that referenced this issue Jun 16, 2020
@andrew
Copy link
Collaborator Author

andrew commented Jun 16, 2020

Did a quick spike of this using recent activity from the top 5% external contributors to ipfs projects (who've opened issues or pull requests), there are definitely some orgs that are using ipfs in there, also some related software orgs like bazil, d3, npm, golang that get used a lot by ipfs projects, so it's tending to find both up and downstream projects when we really are just looking for downstream (things that use ipfs rather that things that ipfs uses).

Interestingly there was no cross-over between those top 5% orgs, which suggests that there is often one person within an org that is handling contributions and bug reporting to ipfs:

@andrew
Copy link
Collaborator Author

andrew commented Jun 16, 2020

Further filtering by these orgs but ones that have mentions of "ipfs" in their codebase works well, 40% of possible orgs include at least one result and 20% had over ten results:

Organization GitHub code search for "ipfs" result count
livepeer 1407
proximax-storage 281
berty 267
quorumcontrol 249
glvd 183
ipfs-search 146
jboss-fuse 92
AdExNetwork 78
rs-ipfs 66
simpleaswater 65
IBM 52
rsksmart 49
iotaledger 40
ipfs-rust 37
PolkaX 33
pravahio 32
sunshine-protocol 26
p2p-cdn 24
0xProject 23
neocities 20
wildfly-extras 19
hashmatter 19
NovakDistributed 17
prysmaticlabs 16
ssbc 12
Cryptorado-Community 11
threefoldfoundation 11
YPII 10
gcash 10
rootchain 10
JaquerEspeis 10
ZeroNetJS 9
threefoldtech 9
Caelumlabs 8
lorena-ssi 8
MRH-io 8
PegaSysEng 6
demonovel 6
OpenZeppelin 6
runtimeverification 4
mercode-org 3
opencollective 3
ipws 3
comit-network 3
nomic-io 2
tauri-apps 2
npm 2
dsrp 2
tasitlabs 2
edgi-govdata-archiving 2
rocicorp 2
golang 2
LavaMoat 1
hackforthesea 1
Qm64-tech 1
sigp 1
arborchat 1
auto-mat 1
subuser-security 1
appliedzkp 1
ssd-solar 0
ZeroNerds 0
cabal-club 0
digidem 0
kappa-db 0
ox-inet-resilience 0
scuttlebutt-eu 0
cryptoscope 0
vdfresearch 0
planetary-social 0
cantaverde 0
Safe2COVIDApp 0
finalitylabs 0
tozd 0
wlanslovenija 0
meteor-vue 0
fastify 0
pinojs 0
webview 0
npm-wharf 0
digitalbitbox 0
metadata-research 0
tikv 0
nimona 0
dorkbotpdx 0
hashcloak 0
thoughtram 0
rebase-book 0
cyclestreets 0
wmfs 0
overviewer 0
CCI-MOC 0
rust-ethereum 0
flathub 0
reinfer 0
rust-windowing 0
bazil 0
matsuri-rb 0
LegendsOfLearning 0
CommunityOfCoders 0
journeymidnight 0
greenmobility 0
copenhagenjs 0
tox-rs 0
getuni 0
grafana 0
RadikalSternchen 0
luzifer-docker 0
luzifer-aur 0
nohelloinfo 0
chainx-org 0
pantareijs 0
desidera 0
chain76org 0
jeditools 0
sipb 0
battlecode 0
linkchecker 0
beloitcollegecomputerscience 0
rust-native-ui 0
LiberalTomorrow 0
sfcpc 0
vdom-rs 0
parchiff 0
ipfnio 0
d3 0
visionscarto 0
portmapping 0
javscrape 0
Piraattinuoret 0
coblox 0
testcontainers 0
gumcast 0
little-core-labs 0
hypermodules 0

@andrew
Copy link
Collaborator Author

andrew commented Jun 16, 2020

Here's the output of a similar query for libp2p, finding lots of the expected collabs and doing a good job of filtering out false positives:

Organization GitHub code search for "libp2p" result count
ipfs 1816
filecoin-project 385
status-im 212
PegaSysEng 128
ethereum 122
ZeroNetJS 45
MetaMask 29
drand 12
open-services 7
protocol 6
ipfs-cluster 2
mntnr 1
hashcloak 1
openethereum 1
SearchAndRescue2 0
async-email 0
rpgp 0
async-rs 0
deltachat 0
RustCrypto 0
ssd-solar 0
mercode-org 0
ZeroNerds 0
sustainers 0
foss-responders 0
IndigoUnited 0
LavaMoat 0
traveltomadeira 0
vdfresearch 0
tikv 0
cyclestreets 0
wmfs 0

@andrew
Copy link
Collaborator Author

andrew commented Jun 16, 2020

One note here for both ipfs and libp2p: collabs that had already been imported into those dashboards where filtered out from suggestions, incase anyone was worried that those collabs had been missed.

@andrew
Copy link
Collaborator Author

andrew commented Jun 16, 2020

Filecoin didn't find many non-protocol orgs as potential collabs.

Organization GitHub code search for "filecoin" result count
ipld 322
ipfs 135
ProtoSchool 25
protocol 8
ipfs-shipyard 8
libp2p 7
drand 4
multiformats 1
rocicorp 0
quarantenalive 0
ipfs-rust 0
soramitsu 0
celo-org 0

@autonome
Copy link

autonome commented Jun 17, 2020 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants