Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

HMDB 65k positive mode CID MS/MS #3

Open
tobigithub opened this issue Sep 14, 2024 · 5 comments
Open

HMDB 65k positive mode CID MS/MS #3

tobigithub opened this issue Sep 14, 2024 · 5 comments

Comments

@tobigithub
Copy link

Hi,
can you predict those please?
HMDB 65k metabolites in positive [M+H]+ mode.
Thank you!

HMDB-65k-positive.csv

@NTuan-Nguyen
Copy link
Member

Hello,

The generated spectra is available at:
https://zenodo.org/records/13772810

@tobigithub
Copy link
Author

@NTuan-Nguyen @barupal Thank you!!

@tobigithub
Copy link
Author

Actually only 4064 unique InChIKeys in input file.

@barupal
Copy link
Member

barupal commented Sep 21, 2024

Too many duplicate entries in the original csv file. It should be only ~4000 rows -
HMDB-65k-positive (1).csv

@NTuan-Nguyen , please remove duplicates rows and re-create the MSP file.

@tobigithub
Copy link
Author

Actually the order might be important or the names, but in terms of computational efficiency the duplicates (by full INCHIKEY) could be an issue. For small data sets maybe this is not a problem, but for very large ones (millions of compounds) this could be an issue.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants