Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adding more RNA-Seq predictions #15

Open
mictadlo opened this issue Apr 14, 2023 · 1 comment
Open

Adding more RNA-Seq predictions #15

mictadlo opened this issue Apr 14, 2023 · 1 comment
Labels
enhancement New feature or request

Comments

@mictadlo
Copy link

Description of feature

Hi,
I recently came across Mikado which is a pipeline to identify the most useful or “best” set of transcripts from multiple transcript assemblies. Our approach leverages transcript assemblies generated by multiple methods to define expressed loci, assign a representative transcript and return a set of gene models that selects against transcripts that are chimeric, fragmented or with short or disrupted CDS. Loci are first defined based on overlap criteria and each transcript therein is scored based on up to 50 available metrics relating to ORF and cDNA size, relative position of the ORF within the transcript, UTR length and presence of multiple ORFs. Mikado can also utilize blast data to score transcripts based on proteins similarity and to identify and split chimeric transcripts. Optionally, junction confidence data as provided by Portcullis can be used to improve the assessment. The best-scoring transcripts are selected as the primary transcripts of their respective gene loci; additionally, Mikado can bring back other valid splice variants that are compatible with the primary isoform.

giy093fig1

Mikado uses GTF or GFF files as mandatory input. Non-mandatory but highly recommended input data can be generated by obtaining a set of reliable splicing junctions with Portcullis_, by locating coding ORFs on the transcripts using either Transdecoder or Prodigal, and by obtaining homology information through either BLASTX or DIAMOND.

Could the output from Mikado be used to train Augustus, GlimmerHMM and SNAP?

Than you for considering.

Best wishes,

Michal

@mictadlo mictadlo added the enhancement New feature or request label Apr 14, 2023
@marchoeppner
Copy link
Collaborator

Well, shame on me for missing this...

I'll have a look at this once I have managed to get release 1.0 out; the nf-core linting template is breaking my will to live at the moment ;) But after that, happy to make the overall workflow a bit more sophisticated.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants