-
Notifications
You must be signed in to change notification settings - Fork 340
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Identifying Enterprise Owners in Github #1373
Conversation
When first creating this PR, all the checks worked except the DCO check, which prompted me to sign off on my commits. In doing that, I rebased and, I think I did something wrong. This PR here, the actual contents of the code and changes, look as expected. But I seemed to have touched / pulled in commits other than my own. 😭 I am chatting with Alex a little later, and will hold off on trying to fix this lest I make it worse. UPDATE: we chatted, it is okay 🤣 |
Also after discussion, it was determined this PR could benefit from us trying to add in the newer-style models, and using those. Github User does not have those models yet, but we can reference what was done for Github Team. I will do that — aiming for sometime next week. |
@danbrauer lemme know if you'd like to get on a call to talk through how to adopt the model |
@danbrauer - I thought about this and I think we have a way forward. In our conversation, you made a good point where the data model is interesting for GitHub users. Here,
In this case, the So, my proposal to model GitHub users correctly would be to have a GitHubUserNodeSchema where @dataclass(frozen=True)
class GitHubUserSchema(CartographyNodeSchema):
label: str = 'GitHubUser'
properties: GitHubUserNodeProperties = GitHubUserNodeProperties()
other_relationships: OtherRelationships = OtherRelationships([
GitHubUserMemberOfOrg(),
GitHubUserUnaffiliatedOrg(),
GitHubUserAdminOfOrg(),
... <others> ...
]) This means that GitHubUsers will not have automatic cleanup jobs apply to them, so when a user gets offboarded from an organization, their node may appear to be an "orphan" node in the graph with no other associations. I think this is fine and should work. Let me know if I missed something. cc: @ramonpetgrave64 since you were working with me when we were designing the data model. |
@achantavy in a way, every user is their own GitHub org, so I think we could attach every user node to their own org node. This also opens up the possibility of syncing the repos that may belong to a user. Do we automatically cleanup these sub resource/tenancy nodes? |
100%
We do not, and that's fine since there are relatively few of them compared to the others in the graph.
Syncing the repos that belong to a user is a valid scenario. I'm not sure I like the idea of attaching every user node to their own "virtual tenancy" node though because that is an entity that doesn't actually exist. Zooming out, I think we need to have the data model support automatic cleanup jobs where a sub_resource relationship is not defined. In this case, we would not delete stale nodes but we would delete stale relationships. I'll think on this a little bit. For now, I think @danbrauer can continue working on the design described here while I figure out how to handle the cleanup story for objects without sub_resource relationships. |
😅 I am cleaning up some stuff in our fork. Please ignore for now. I will re-open this PR (perhaps as a new PR but otherwise it will be identical here, while being cleaner on our end of things). I'll link for any new PR to here. Sorry for the noise. |
The new PR is here: #1378 Also thank you for the discussion above. I will consider what @achantavy said and if I have a comment I will make it in the new PR but link back to here. |
Summary
This PR adds to the Github graph, marking users as enterprise owners.
We think this is a valuable addition to the graph in general, because these users are not all necessarily visible in the graph at the moment but have broad access. Less generally (but still maybe relevant to others) our analysts at Etsy need to review these users as part of our UAR (User Access Review) process, which we hope Cartography will eventually help to power.
We wanted to do this in a light-touch way, without breaking existing relationships or removing properties. We also wanted to follow how similar properties are graphed on the user node: org ownership, for example, is noted by the 'user.role' property; similarly, the 'user.is_site_admin' property notes whether a user is a site admin). To that end, we did the following:
Here is an illustration of before/after (I will also add some screencap below but thought the high-level illustration might help):
Other notes on the PR
Related issues or links
None.
Screencaps
(I could get other screencaps... if anything would be helpful, please let me know.)
In this case was helpful that we had an enterprise owner who was also a user in one of our orgs, but not another. I highlighted them specifically in a query here, showing both the new property and relationship type.
Before
After
Checklist
Provide proof that this works (this makes reviews move faster). Please perform one or more of the following:
If you are changing a node or relationship:
If you are implementing a new intel module: