Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add G groups, frequency data, and update protein sequence to #90

Draft
wants to merge 42 commits into
base: master
Choose a base branch
from

Conversation

apmody
Copy link

@apmody apmody commented Mar 30, 2021

  1. Add three and four field HLA alleles ("MHC gene allele") and corresponding nucleotide sequences.
  2. Add terms for G groups
  3. Choose "best" protein sequence for chain-sequences.tsv so that the longest protein sequence is chosen from all alleles that have the same first two-fields (e.g. choose the longest protein sequence between HLA-A*01:140:01 and HLA-A*01:140:02)
  4. Add frequency data based on CIWD 3.0 data for different HLA alleles (data, paper)

@rvita
Copy link
Collaborator

rvita commented Mar 30, 2021 via email

@apmody
Copy link
Author

apmody commented Apr 1, 2021

For item #3, do you also update the accession and the resource name to match the new sequence?

Yes, I have added the accession and the resource name as it appears in IMGT.

@jamesaoverton
Copy link
Collaborator

This make mro.owl task must pass before this can be merged. It's failing for me locally and here in GitHub Actions.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants