-
Notifications
You must be signed in to change notification settings - Fork 26
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CGE phenotype not populated for E. faecium pbp5 when all essential mutations are present #210
Comments
What's happening is we have a process that explicitly looks for complex mutations (as defined in the complex mutations file) that replaces a set of individually reported point mutations with a single complex mutation if there's an appropriate match. Individual Enterococcus faecium pbp5 Point MutationsIn your example with 18 Enterococcus faecium pbp5 point mutations, there aren't enough mutations to perform one of the replacements defined in the complex mutations file. So the 18 pbp5 point mutations remain listed individually, and information associated with each of them is pulled from the CGE Pointfinder Enterococcus faecium database and provided to the user of StarAMR. The CGE Pointfinder database treats these 18 mutations the same as every other point mutation (listing
We show this note from CGE in StarAMR's
So in this case we're showing the CGE-predicted phenotype as Complex Enterococcus faecium pbp5 Point MutationsIn your example with 19 Enterococcus faecium pbp5 point mutations, there are enough mutations to perform one of the replacements, so the 19 pbp5 mutations are replaced with a single complex mutation. These complex mutations are defined in the complex mutations file. The resulting complex mutation does not list ampicillin as a
In this case, meaning exactly the following mutations:
With (to my interpretation) V586L being non-essential:
Whereas our complex mutations file defines either:
or
as mandatory and then grabs all other related mutations and collapses them into a single complex mutation file. Importantly, the CGE Pointfinder database does NOT make the same predictions we use in the complex mutations file (3 vs 19), so it would be wrong to list SummaryStarAMR is trying to wrap around CGE's databases (Pointfinder in this case) and is trying to relay this information back to the user of StarAMR. The complex mutation functionality implemented by StarAMR is an attempt to allieviate some issues with how CGE reports this data by default (expecting users to interpret it themselves). In your example with the complex mutation replacement, CGE is not predicting the mutations we've manually defined in the complex mutations file, so we report it under the In your example with the incomplete complex mutation, CGE reports these mutations like any other, showing There are of course solutions to these problems, but a risk with writing many specific exceptions and special cases is that it makes it harder to update the databases used by StarAMR when CGE updates the databases on their end. It's ultimately matter of priorities though. |
Hello,
I have two Enterococcus faecium assemblies that I have run through staramr v0.10.0 using the default databases with
--pointfinder-organism enterococcus_faecium --pid-threshold 90
. Assembly1 has all 19 pbp5 mutations that contribute to ampicillin resistance according to the CGE key. Assembly2 has 18/19 mutations, and is missing the pbp5 (P667S) mutation, which from my understanding is a mandatory mutation listed in the complex pbp5 mutations file.The detailed summary output for assembly 1 (all 19/19 pbp5 mutations) show the pbp5 mutations get aggregated to a single line and the CGE Phenotype is left blank:
The detailed summary output for assembly 2 (18/19 pbp5 mutations, missing one mandatory P667S) has the pbp5 mutations listed separately one per line and CGE phenotype is populated:
It's not clear to me why the CGE phenotype column for assembly1 is left blank when according to the CGE key these 19 mutations are required for ampicillin resistance, and yet the CGE phenotype column is populated "Ampicillin" for assembly2 when one of the essential mutations is missing. It seems the Predicted Phenotype column is calling them as I would expect: "ampicillin" for assembly1, and "unknown[pbp5 (mutation)]" for assembly 2.
The text was updated successfully, but these errors were encountered: