Add support for variant scoring by MIVMIR, GICAM models#812
Conversation
|
Warning Newer version of the nf-core template is available. Your pipeline is using an old version of the nf-core template: 3.5.1. For more documentation on how to update your pipeline, please see the nf-core documentation and Synchronisation documentation. |
|
I'd appreciate your input here @ramprasadn thanks : ) |
262d52c to
543604f
Compare
|
@ramprasadn @peterpru Appreciate your review here. Thanks : ) |
5d2f14b to
f986c17
Compare
fellen31
left a comment
There was a problem hiding this comment.
Nice! Some comments/questions.
f986c17 to
982c61f
Compare
|
982c61f to
504e895
Compare
|
504e895 to
b27e058
Compare
|
b27e058 to
81ea672
Compare
|
fellen31
left a comment
There was a problem hiding this comment.
Looks good overall! There are some tests failing though https://github.com/nf-core/raredisease/actions/runs/25107174156/job/73586432922?pr=812.
I think I'll leave it to @ramprasadn to do a final review if I missed something :)
| - CSQ/SpliceAI_pred_DS_AG | ||
| - most_severe_consequence | ||
| - CADD | ||
| - SWEGENAF |
There was a problem hiding this comment.
Since this is a very specific resource needed for an open source nf-core pipeline, maybe we could add some guidance on where this can be accessed?
There was a problem hiding this comment.
@jemten @ramprasadn Can you please shed some light on how to get a hold on data required for annotating with SWEGENAF, grch37_anon-swegen_str_nsphs_-1000samples-.vcf.gz ?
##INFO=<ID=SWEGENAF,Number=A,Type=Float,Description="Allele Frequency, for each ALT allele, in the same order as listed (from grch37_anon-swegen_str_nsphs_-1000samples-.vcf.gz)">
| with (path(workflow.out.vcf.get(0).get(1)).vcf.variants) { | ||
| for (int i = 0; i < size(); i++) { | ||
| assert snapshot(get(i).getAttribute('MivmirScore')).match('MIVMIR_GICAM-' + get(i).getChr() + '-' + get(i).getStart() + '-MivmirScore') | ||
| assert snapshot(get(i).getAttribute('MivmirExplanation')).match('MIVMIR_GICAM-' + get(i).getChr() + '-' + get(i).getStart() + '-MivmirExplanation') | ||
| assert snapshot(get(i).getAttribute('GicamScore')).match('MIVMIR_GICAM-' + get(i).getChr() + '-' + get(i).getStart() + '-GicamScore') |
There was a problem hiding this comment.
Sure! I think it would be okay to snapshot the first few lines also to see that it contains the correct info keys.
For loops are being removed though, is it possible to use each?
https://docs.seqera.io/nextflow/strict-syntax#for-and-while-loops
There was a problem hiding this comment.
I'll leave it to @ramprasadn to explain how to snapshot ch_publish, since I'm not sure what the best way is.
81ea672 to
92eb9e0
Compare
|
Rebased on latest dev |
9e1d573 to
e2786a6
Compare
|
|
Tests failing now is related to:
|
be6047f to
ab95221
Compare
|
aebd98f to
d2e69e9
Compare
8f4b477 to
9916853
Compare
|
| """ | ||
| } | ||
|
|
||
| process MIVMIR_INTERNAL_UNIT_TEST { |
There was a problem hiding this comment.
Move this inside main process and have env as enabler flag.
There was a problem hiding this comment.
Prefer set env MIVMIR_INTERNAL_TEST in main test config.
9916853 to
cdc427e
Compare
Signed-off-by: Tor Björgen <tor.bjorgen@scilifelab.se>
cdc427e to
7fa6051
Compare
Add support for SNV variant ranking using MIVMIR, GICAM models.
TODOs
PR checklist
nf-core pipelines lint).nextflow run . -profile test,docker --outdir <OUTDIR>).nextflow run . -profile test_singleton,docker --outdir <OUTDIR>).nextflow run . -profile debug,test,docker --outdir <OUTDIR>).docs/usage.mdis updated.docs/output.mdis updated.CHANGELOG.mdis updated.README.mdis updated (including new tool citations and authors/contributors).