-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add bwa #103
Conversation
relates to Replace minimap2 with another mapper for short-read data? #102
Nice! Looks good, I just built a new container w/ the latest RUN conda install bwa=0.7.18 samtools=1.20 && conda clean -a --> so it matches the conda environment! If we can have this in |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I didn't run the updated pipeline but the changes look reasonable. I'll leave testing on real data to yourself and @hoelzer
|
||
script: | ||
""" | ||
INDEX=`find -L ./ -name "*.amb" | sed 's/\\.amb\$//'` |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Isn't the index passed in as a parameter (db_index)? If you only want to strip off the .amb
you can use something like:
index_root=${db_index%.amb}
(not tested)
see https://pubs.opengroup.org/onlinepubs/9699919799/utilities/V3_chap02.html#tag_18_06_02
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The BWA processes are copied from nf-core 😇
The index process outputs a folder called bwa
, and with this line, you find the folder. Maybe amb
is BWA specific? 🤔
configs/node.config
Outdated
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I know this is a Nextflow thing, but vertically aligning curly braces makes for unnecessarily larger diffs with meaningless changes. I'd consider it bad style but that's just me. I can ignore them in GitHub but it's more of a pain when using command line git (e.g. git blame especially).
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I've seen too much nf-core code lately 😅
https://nf-co.re/docs/contributing/code_editors_and_styling/harshil_alignment
Great! I would like to test it “real live” on the data where I also saw such discrepancies between |
bwa mem \\ | ||
-t $task.cpus \\ | ||
\$INDEX \\ | ||
$input \\ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
bwa mem
does automatically take care of single-end or paired-end read input, or? So you just pass $input
and then this will be either a SE FASTQ or two PE FASTQs, right?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
yes, for single-end, it would be one fastq file; for paired-end, $input
would be foo_1.fastq foo_2.fastq
(space-separated)
Did you use |
No description provided.