Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Network Statistics #118

Open
simisimon opened this issue Feb 23, 2022 · 1 comment · May be fixed by #120
Open

Network Statistics #118

simisimon opened this issue Feb 23, 2022 · 1 comment · May be fixed by #120

Comments

@simisimon
Copy link
Member

I want to investigate the evolution of the configuration space in modern software development.
To this end, we need to calculate statistics during the the analysis of the commit history of a software system.

Statistics of interests are:

  • size of the network in terms of the number of nodes
  • total number of each node type
  • number of links
  • how often are values changed
  • which configuration file are changed
  • how often are links added/destroyed
  • what is the value range of options

We probably cannot calculate all statistics in the network. In this case, we need to prepare and store the necessary data somewhere (e.g., csv file), so that we can calculate those statistics later, for example, with an external script.

@simisimon
Copy link
Member Author

Since we want to analyze the commit history of the software system, we have to calculate these numbers for each commit. However, since configuration changes are not made in every commit, it would make sense to analyze only those commits in which configuration files are changed. This way, we would decrease the number of csv files we have to create.

@simisimon simisimon linked a pull request Mar 18, 2022 that will close this issue
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant