-
Notifications
You must be signed in to change notification settings - Fork 4
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Handle importing GFF3s where exons are implicit (#492)
* Start adding test and possibly some fixes for #481 Conversion to GFF3Feature includes source and score and sets ID and Parent attributes. However, export to GFF from UI fails. * Fix imports * Test expanded * Process CDSs, tests extended * Temp commit: Need to fix handling of missing refSeq and proper testing * Add test - need to test with spliced UTRs * Fix header and incorrect phase (not used by Apollo anyway) * Handle spliced UTRs * Fix tests * Re-enable skipped tests --------- Co-authored-by: Garrett Stevens <[email protected]>
- Loading branch information
1 parent
8f1c5d7
commit 06695e9
Showing
10 changed files
with
442 additions
and
9 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,12 @@ | ||
##gff-version 3 | ||
##sequence-region 53cb6b9b4f4ddef1ad47f943 1050 9000 | ||
ctgA example gene 1050 9000 . + . ID=eden | ||
ctgA example mRNA 1050 9000 . + . ID=eden.1;Parent=eden | ||
ctgA example five_prime_UTR 1050 1210 . + 0 ID=five1;Parent=eden.1 | ||
ctgA example CDS 1211 1510 . + 0 ID=cds2;Parent=eden.1 | ||
ctgA example CDS 1611 1710 . + 0 ID=cds2;Parent=eden.1 | ||
ctgA example three_prime_UTR 1711 1800 . + 0 ID=three1;Parent=eden.1 | ||
ctgA example exon 1201 1500 . + 0 ID=exon1;Parent=eden.1 | ||
ctgA example CDS 1601 1700 . + 0 ID=cds1;Parent=eden.1 | ||
ctgA example CDS 1201 1500 . + 0 ID=cds1;Parent=eden.1 | ||
ctgA example TF_binding_site 1050 1100 . + . Parent=eden |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,74 @@ | ||
{ | ||
"_id": "67581b7d5890a8eb1bedab6e", | ||
"refSeq": "ctgA", | ||
"type": "gene", | ||
"min": 1049, | ||
"max": 9000, | ||
"strand": 1, | ||
"children": { | ||
"67581b7d5890a8eb1bedab6c": { | ||
"_id": "67581b7d5890a8eb1bedab6c", | ||
"refSeq": "ctgA", | ||
"type": "mRNA", | ||
"min": 1049, | ||
"max": 9000, | ||
"strand": 1, | ||
"children": { | ||
"67581b7d5890a8eb1bedab66": { | ||
"_id": "67581b7d5890a8eb1bedab66", | ||
"refSeq": "ctgA", | ||
"type": "exon", | ||
"min": 1200, | ||
"max": 1500, | ||
"strand": 1, | ||
"attributes": { "gff_source": ["example"], "gff_id": ["exon1"] } | ||
}, | ||
"67581b7d5890a8eb1bedab67": { | ||
"_id": "67581b7d5890a8eb1bedab67", | ||
"refSeq": "ctgA", | ||
"type": "CDS", | ||
"min": 1210, | ||
"max": 1710, | ||
"strand": 1, | ||
"attributes": { "gff_source": ["example"], "gff_id": ["cds2"] } | ||
}, | ||
"67581b7d5890a8eb1bedab68": { | ||
"_id": "67581b7d5890a8eb1bedab68", | ||
"refSeq": "ctgA", | ||
"type": "CDS", | ||
"min": 1200, | ||
"max": 1700, | ||
"strand": 1, | ||
"attributes": { "gff_source": ["example"], "gff_id": ["cds1"] } | ||
}, | ||
"67581b7d5890a8eb1bedab69": { | ||
"_id": "67581b7d5890a8eb1bedab69", | ||
"refSeq": "ctgA", | ||
"type": "exon", | ||
"min": 1049, | ||
"max": 1510, | ||
"strand": 1 | ||
}, | ||
"67581b7d5890a8eb1bedab6b": { | ||
"_id": "67581b7d5890a8eb1bedab6b", | ||
"refSeq": "ctgA", | ||
"type": "exon", | ||
"min": 1600, | ||
"max": 1800, | ||
"strand": 1 | ||
} | ||
}, | ||
"attributes": { "gff_source": ["example"], "gff_id": ["eden.1"] } | ||
}, | ||
"67581b7d5890a8eb1bedab6d": { | ||
"_id": "67581b7d5890a8eb1bedab6d", | ||
"refSeq": "ctgA", | ||
"type": "TF_binding_site", | ||
"min": 1049, | ||
"max": 1100, | ||
"strand": 1, | ||
"attributes": { "gff_source": ["example"] } | ||
} | ||
}, | ||
"attributes": { "gff_source": ["example"], "gff_id": ["eden"] } | ||
} |
14 changes: 14 additions & 0 deletions
14
packages/apollo-shared/test_data/cds_without_exon_spliced_utr.gff
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,14 @@ | ||
##gff-version 3 | ||
chr1 . gene 1000 9000 . + . ID=gene00001 | ||
#chr1 . mRNA 1300 9000 . + . ID=mRNA00001;Parent=gene00001 | ||
#chr1 . five_prime_UTR 3000 3300 . + . Parent=mRNA00001 | ||
#chr1 . CDS 5000 5500 . + 1 ID=cds00001;Parent=mRNA00001 | ||
#chr1 . three_prime_UTR 7601 8000 . + . Parent=mRNA00001 | ||
chr1 . mRNA 1300 9000 . + . ID=mRNA00003;Parent=gene00001 | ||
chr1 . five_prime_UTR 1300 1500 . + . Parent=mRNA00003 | ||
chr1 . five_prime_UTR 3000 3300 . + . Parent=mRNA00003 | ||
chr1 . CDS 3301 3902 . + 0 ID=cds00003;Parent=mRNA00003 | ||
chr1 . CDS 5000 5500 . + 1 ID=cds00003;Parent=mRNA00003 | ||
chr1 . CDS 7000 7600 . + 2 ID=cds00003;Parent=mRNA00003 | ||
chr1 . three_prime_UTR 7601 8000 . + . Parent=mRNA00003 | ||
chr1 . three_prime_UTR 8501 8900 . + . Parent=mRNA00003 |
Oops, something went wrong.