Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ScanCode: Extract more information #184

Open
5 tasks
abraemer opened this issue Jan 16, 2025 · 0 comments
Open
5 tasks

ScanCode: Extract more information #184

abraemer opened this issue Jan 16, 2025 · 0 comments

Comments

@abraemer
Copy link
Contributor

abraemer commented Jan 16, 2025

ScanCode has a lot of options of what it should output. On one hand, we need to think about what options we absolutely need and document that. OTOH, if there is additional information (such as the license information) it would be nice to also use this. This has the potential that there could be a lot of combinations of information. It would be nice if our tool was "smart" and always could use the information present in the most sensible way.

In Opossum we have the additional fields: frequentLicenses and baseUrlsForSources

Example: If ScanCode was run with -license-text --license-references then it would be easy to populate frequentLicenses

Steps:

  • Learn about the different options of ScanCode
  • Create a test catalog of files using as many different argument combinations as possible
  • Ensure that the pydantic models can handle all of these properly, i.e. add them to the Options model and introduce aditional model if necessary
  • adapt the faker models accordingly - depending on the chosen options, we need to generate different fields
  • Go option by option and consider what value the information has and whether it is sensible to include into the opossum output

Note collection:

  • ScanCode provides copyright and license information separately, i.e. a file could have detected copyrights but no license_detections. Can we preserve the copyright information in this case?
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant