Replies: 4 comments
-
This is a great idea. I'd like to work with you to flesh this idea out. At the moment, the first thing that comes to mind is the work that @Jack005 was working on last semester. He was looking at betweeness centrality, which is the measure of how often a node lies on the shortest path between all other nodes. In this case you are looking at a bipartite graph, so you may want to use the bipartite variation on this metric: I don't know if this metric will capture the idea that you are looking for, but it would be an easy first shot at it. You might also want to look up the idea of "structural holes", as it's related to what you are getting at I think. http://www.econ.upf.edu/docs/seminars/burt.pdf Besides looking at existing work in this area, I encourage you to flesh out why this "Connector Ranking" interests you. There are infinite possible mathematical expressions we could compute over this graph. The question is which ones represent meaningful constructs. If there were some other variable that we were trying to predict in terms of this 'Connector Ranking', then we could compare different ranking metrics in their ability to predict that dependent variable. This is certainly the direction this research is going. |
Beta Was this translation helpful? Give feedback.
-
This paper by Pohl and Diehl (2008) directly pertains to this and should be used as a foundation for this work. |
Beta Was this translation helpful? Give feedback.
-
I wonder if this is related to some of the things we've been discussing regarding people's participation in groups, @npdoty |
Beta Was this translation helpful? Give feedback.
-
Ah, yes, I have been doing this kind of work with bipartite graphs and seeing the overlaps between participation, but with mailing lists rather than git repositories. I'm using the bitpartite-specific measure, although I evaluated closeness centrality rather than betweenness centrality in my research notebook: |
Beta Was this translation helpful? Give feedback.
-
One of the things that I'd like to work on for the end of the semester would be to explore the specific people that repositories depend on.
I'd like to start with an Author->Repo graph, where the edge weights represent how many commits that author has made to a given repo.
I'd then like to give each author a "Connector Rating" (we'll find a better name later) that basically represents how much of a role that author plays in connecting all of those repos. However, what I need help figuring out is how to calculate this value.
Some factors that will probably play a role:
I'd like to draw the author->repo graph but mark the author nodes based on their Connector Rating (make them bigger or something.)
Beta Was this translation helpful? Give feedback.
All reactions