In this release we introduces a new, .NET Core-based parser which seems to be at least 2x faster than the python one, while also being easier to use: just download the binary for your platform, unzip, and point it to one or all discogs .xml.gz.
Give it a shot. Its performance is so impressive it's likely to become the new de facto parser soon.
Meanwhile, we've updated the python version to allow you to point it directly to xml.gz files. As long as they contain the type of the file in the name -- _artist
or _label
, etc, -- the parser will know what to do with it.
We're also recommending that a Python virtual environment is created to install the requirements. No more having to deal with sudo
or elevated permissions.