Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add a BA.1 reference for the web nextclade version #1426

Open
murallcl opened this issue Feb 23, 2024 · 4 comments
Open

Add a BA.1 reference for the web nextclade version #1426

murallcl opened this issue Feb 23, 2024 · 4 comments
Labels
t:feat Type: request of a new feature, functionality, enchancement

Comments

@murallcl
Copy link

Hello!
We would like to run some datasets against a BA.1 reference using the web version of Nextclade.
Currently there are Wuhan, BA.2 and some of BA.2's offspring.
Could a BA.1 reference dataset be added?
Thank you!

@murallcl murallcl added good first issue Good for newcomers help wanted Extra attention is needed needs triage Mark for review and label assignment t:feat Type: request of a new feature, functionality, enchancement labels Feb 23, 2024
@ivan-aksamentov
Copy link
Member

ivan-aksamentov commented Feb 23, 2024

Hi @murallcl,

Might be possible if our scientists still have some time and forces!

In the meantime, do you know that we now have a guide for people to create, use and share their own datasets: https://github.com/nextstrain/nextclade_data/blob/master/docs/README.md ?

And you can find the machinery used for preparing our SC2 datasets here: https://github.com/neherlab/nextclade_data_workflows

It would be great if you can help!

@ivan-aksamentov ivan-aksamentov removed help wanted Extra attention is needed good first issue Good for newcomers needs triage Mark for review and label assignment labels Feb 23, 2024
@corneliusroemer
Copy link
Member

Hi @murallcl - would you be able to explain your use case for the BA.1 reference dataset? That would help us understand whether there might be another way to achieve your goal without requiring a new dataset.

I'm not sure we will add new datasets for clades that have died out. However, you could patch the existing Wuhan-Hu-1 dataset to make your own BA.1 reference dataset.

In addition, we've been thinking about allowing mutation reporting relative to arbitrary references in the future - but this feature is still some time out.

@murallcl
Copy link
Author

Hello Ivan and Cornelius,
Thanks for your prompt responses. We're using it to run some retrospective analyses (i.e. we're comparing sets of sequences from the period of time when BA.1 was circulating) and this is also doubling as genomic epi training for non-bioinformaticians (so, not commandline users of nextclade).
More generally though, I think a BA.1 reference (or an OG Omicron B.1.1.529 reference) may still be relevant for those investigating long-lasting chronic infections or zoonosis.

I tried adding the BA.1 fasta consensus sequence from your lineage library (https://github.com/corneliusroemer/pango-sequences) by dropping it into 'customize' the wuhan but it gave me an error when it came time to comparing it to the sequences. I assumed I was missing input data for the customization but I did not investigate further.
I can appreciate this isn't high priority, but if there's any further suggestions on how to customize the wuhan with the BA.1 SNPs, I'd be happy to try it.
Thanks!
Carmen Lia

@rneher
Copy link
Member

rneher commented May 9, 2024

just replacing the reference without replacing the tree won't work since mutations in the tree are coded relative to the reference. but you could just use a dataset without a tree. this won't give you any clades. You could use this minimal dataset

https://github.com/nextstrain/nextclade_data/tree/master/docs/minimal-dataset

and replace the reference by BA.1 and the annotation by the one from the SARS-CoV-2 dataset. this then will call mutations relative to BA.1, but won't assign any lineages (it needs a tree for that)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
t:feat Type: request of a new feature, functionality, enchancement
Projects
None yet
Development

No branches or pull requests

4 participants