Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Hi everyone,
as mentioned in #14 , I've added a command line interface to standartization from SMILES strings (namely, from input files containing SMILES as their first column). Also, I added an option to filter compounds using PAINS filters in RDKit as here -- it might be useful to switch it off by default, if you think it's more appropriate for this package.
The interface is following:
So in order to filter
test.smi
, one should do the following:The downside is that it prints a lot of logging messages to stdout, and I could not completely disable them. For example, if I do
chembl_std --header test.smi > out.smi
, I'd get:The current workaround is to do
chembl_std --header test.smi | grep -v Normalizer > out.smi
. If someone knows how to manage it better, I'd appreciate.