Skip to content
Merged
Show file tree
Hide file tree
Changes from 7 commits
Commits
Show all changes
41 commits
Select commit Hold shift + click to select a range
91803e9
Updates for issue 90:
xiaoliz0 Apr 22, 2026
62d3f99
Remove some comment lines.
xiaoliz0 Apr 22, 2026
d3b2ccf
New updates from molecular biology group: The column "Filter_rescued"…
xiaoliz0 Apr 24, 2026
2c29b0a
Merge branch 'main' into develop_issue90
xiaoliz0 Apr 30, 2026
170bfbe
Implement new updates for the request: Only print last column for the…
xiaoliz0 May 4, 2026
df6490e
Merge branch 'develop_issue90' of https://github.com/InPreD/PRONTO in…
xiaoliz0 May 4, 2026
00b0738
Fix some codes after merging. Fix some updated codes from Martin.
xiaoliz0 May 5, 2026
59815d0
Update Script/PRONTO.py
xiaoliz0 May 6, 2026
d831764
Add a fake sample for testing PRONTO new functions.
xiaoliz0 May 18, 2026
fc9187c
Merge branch 'develop_issue90' of https://github.com/InPreD/PRONTO in…
xiaoliz0 May 18, 2026
873c97a
Add files via upload
xiaoliz0 May 20, 2026
be10379
Delete test_data/ous/test_files_for_PRONTO_newFunctions/IPD2225-D01-P…
xiaoliz0 May 28, 2026
2e13ca1
Add files via upload
xiaoliz0 May 28, 2026
76724a1
Delete test_data/ous/test_files_for_PRONTO_newFunctions/ouput/IPD2225…
xiaoliz0 May 28, 2026
0759ae7
Add files via upload
xiaoliz0 May 28, 2026
3fd5541
Rename test_data/ous/test_files_for_PRONTO_newFunctions/IPD2225-D01-P…
xiaoliz0 May 28, 2026
f4d4087
feat: refactor topfilter parsing into new function
marrip May 7, 2026
37bcabb
test: add unittest for topfilter function
marrip May 7, 2026
83debb1
feat: use pandas for read_tsv function
marrip May 7, 2026
c4ec36c
test: add test for new read_tsv function
marrip May 7, 2026
7038a3a
feat: refactor by using pandas for consolidating
marrip May 7, 2026
59ee58f
chore: rm unused value in config file
marrip May 7, 2026
12d693c
fix: correct pandas.read_csv, thanks @xiaoliz0
marrip May 12, 2026
8e016fd
fix: ignore comment lines in small variant table file
marrip May 12, 2026
7148413
fix: use correct dict key
marrip May 21, 2026
ff86c6f
fix: use correct function to write csv
marrip May 21, 2026
7ce228b
fix: correct output file path
marrip May 21, 2026
f1d7870
fix: correct column name Sample_ID
marrip May 21, 2026
3e179b1
fix: try to get rid of future pandas error by using .loc
marrip May 21, 2026
6b661d3
test: update tests for changed functions
marrip May 21, 2026
23777a0
test: structure test data according to pronto's expectations
marrip May 21, 2026
6181c62
chore: update version in Dockerfile
marrip May 21, 2026
6c571ef
chore: update column formatting
May 21, 2026
2d0162f
chore: correct column assignment
May 21, 2026
cc0dc47
chore: update expected col type in test
May 21, 2026
709abfe
test: add missing png files for ppt
marrip May 21, 2026
43ad19e
chore: add silversearcher-ag feature
marrip May 21, 2026
7fb2566
fix: replace index in pandas dataframe to ensure proper deduplication
marrip May 21, 2026
0340f32
fix: correct format in pronto metadata files
marrip May 26, 2026
6c623ad
fix: correct format in pronto metadata files
marrip May 26, 2026
111c9f3
Merge pull request #106 from marrip/refactor-issue90
xiaoliz0 Jun 2, 2026
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
61 changes: 41 additions & 20 deletions Config/configure_PRONTO.ini
Original file line number Diff line number Diff line change
Expand Up @@ -7,72 +7,93 @@ inpred_node = OUS
data_path = /data/sample_data/analysis_results/
;Please modify this if you need to specify file encoding as "ISO-8859-1" or other encoding standards in your local environment to read special Norwegian characters.
encoding_sys = utf-8
;Specify the number of columns you want to do the filtering (NB: this will also make the script to generate the number of output tables):
filter_col_nu = 5
;Specify the number of top filter sections with highest priority, these sections will be named staring with FILTER0 in this configure. (NB: the other filter sections will use the comnbined results from all FILTER0):
top_filter = 3
;Specify the number of filter sections with low priority. (NB: this will also make the script to generate the number of output tables):
filter_col_nu = 4
;Specify the number of max rows of the table per slide starting from the 8th slide in report. This is used to split long tables.
table_max_rows_per_slide = 14
;Please modify this for local env if you use MTF files to import the clinical data into meta file. Specify the version of year of the MTF files.
material_file_version = 2026

[FILTER1]
[FILTER0-1]
;pecify the column name need to be filtered:
filter_column = CPSR_ACMG_class,CPSR_ClinVar_class
;Specify the key word for the filter column. If there are multiple keywords need to be filtered at the same time, please split them with ','. You can use ' && ' to combine the keywords startswith '!' as one filter condition:
key_word = Likely_Pathogenic,Pathogenic
;Specify the column names need to be transfered to new data file. Split the column names with ','. NB: the column contents must be the same as they are in the sheet.
columns = Gene_symbol,Change_summary,Genomic_location,DNA_change,cDNA_change,Protein_change_short,Coding_status,Depth_tumor_DNA,AF_tumor_DNA,Depth_normal_DNA,AF_normal_DNA,Depth_tumor_RNA,AF_tumor_RNA,TSO500_LocalApp_class,Class_judgement_comments
;Specify the column name to add in the right end of the table to verify this filter condition. NB: if this value is given, a new column will be added into the tables in the report. Same column name in different filter sections will be merged into one filter column.
filter_column_add = Filter_rescued
;Specify the output result file name. Please note that you type should not exceed 31 characters and not contain any of the characters ':' ' \' '/' '?' '*' '[' or ']':
output_table = pathogenic_germline

[FILTER0-2]
;pecify the column name need to be filtered:
filter_column = Gene_symbol,Protein_change_short
;Specify the key word for the filter column. If there are multiple keywords need to be filtered at the same time, please split them with ','. You can use ' && ' to combine the keywords startswith '!' as one filter condition:
key_word = TERT
;Specify the column names need to be transfered to new data file. Split the column names with ','. NB: the column contents must be the same as they are in the sheet.
columns = Gene_symbol,Change_summary,Genomic_location,DNA_change,cDNA_change,Protein_change_short,Coding_status,Depth_tumor_DNA,AF_tumor_DNA,Depth_normal_DNA,AF_normal_DNA,Depth_tumor_RNA,AF_tumor_RNA,TSO500_LocalApp_class,Class_judgement_comments
;Specify the column name to add in the right end of the table to verify this filter condition. NB: if this value is given, a new column will be added into the tables in the report. Same column name in different filter sections will be merged into one filter column.
filter_column_add = Filter_rescued
;Specify the output result file name. Please note that you type should not exceed 31 characters and not contain any of the characters ':' ' \' '/' '?' '*' '[' or ']':
output_table = TERT

[FILTER0-3]
;In this filter section, the input data is the combined table generated from FILTER0.
;Specify the column name need to be filtered:
filter_column = Class_judgement
;Specify the key word for the filter column. If there are multiple keywords need to be filtered at the same time, please split them with ','. You can use ' && ' to combine the keywords startswith '!' as one filter condition:
key_word = LikSOM,SOM,LikGL,incon,include
;Specify if you want to output all columns based on the filter. 'True' means yes, and a default output table 'workingTable' will be generated in the output folder extra_files:
all_col_output = True
;Specify the column names need to be transfered to new data file. Split the column names with ','. NB: the column contents must be the same as they are in the sheet.
columns = Gene_symbol,Change_summary,Genomic_location,DNA_change,cDNA_change,Protein_change_short,Coding_status,Depth_tumor_DNA,AF_tumor_DNA,Depth_normal_DNA,AF_normal_DNA,Depth_tumor_RNA,AF_tumor_RNA,TSO500_LocalApp_class,Class_judgement_comments
;Specify the miniment number of Depth_tumor for data to choose:
min_depth_tumor_DNA = 0
;Specify the output result file name. Please note that you type should not exceed 31 characters and not contain any of the characters ':' ' \' '/' '?' '*' '[' or ']':
output_table = preMTBTable_Appendix

[FILTER2]
;In this filter section, the input data is the first table generated from FILTER1.
[FILTER1]
;In this filter section, the input data is the combined table named 'workingTable' generated from all sections of FILTER0.
;Specify the column name need to be filtered:
filter_column = Coding_status
;Specify the key word for the filter column. If there are multiple keywords need to be filtered at the same time, please split them with ','. You can use ' && ' to combine the keywords startswith '!' as one filter condition. And '!' means NOT:
key_word = !x:noncoding_variant && !synonymous_variant
all_col_output = False
;Specify the column names need to be transfered to new data file. Split the column names with ','. NB: the column contents must be the same as they are in the sheet.
columns = Gene_symbol,Ensembl_transcript_ID,Exon_number,Protein_change_short,Coding_status,AF_tumor_DNA
columns = Gene_symbol,Ensembl_transcript_ID,Exon_number,Protein_change_short,Coding_status,AF_tumor_DNA,Filter_rescued
;Specify the output result file name. Please note that you type should not exceed 31 characters and not contain any of the characters ':' ' \' '/' '?' '*' '[' or ']':
output_table = sequence_summary

[FILTER3]
;In this filter section, the input data is the first table generated from FILTER1.
[FILTER2]
;In this filter section, the input data is the combined table named 'workingTable' generated from all sections of FILTER0.
;Specify the column name need to be filtered:
filter_column = Coding_status
;Specify the key word for the filter column. If there are multiple keywords need to be filtered at the same time, please split them with ','. You can use ' && ' to combine the keywords startswith '!' as one filter condition. And '!' means NOT:
key_word = !x:noncoding_variant
all_col_output = False
;Specify the column names need to be transfered to new data file. Split the column names with ','. NB: the column contents must be the same as they are in the sheet.
columns = Gene_symbol,Genomic_location,DNA_change,cDNA_change,Protein_change_short,Coding_status,Depth_tumor_DNA,AF_tumor_DNA
columns = Gene_symbol,Genomic_location,DNA_change,cDNA_change,Protein_change_short,Coding_status,Depth_tumor_DNA,AF_tumor_DNA,Filter_rescued
;Specify the output result file name. Please note that you type should not exceed 31 characters and not contain any of the characters ':' ' \' '/' '?' '*' '[' or ']':
output_table = AllReporVariants_CodingRegion

[FILTER4]
;In this filter section, the input data is the first table generated from FILTER1.
[FILTER3]
;In this filter section, the input data is the combined table named 'workingTable' generated from all sections of FILTER0.
;Specify the column name need to be filtered:
filter_column = Coding_status
;Specify the key word for the filter column. If there are multiple keywords need to be filtered at the same time, please split them with ','. You can use ' && ' to combine the keywords startswith '!' as one filter condition. And '!' means NOT:
key_word = !x:noncoding_variant && !synonymous_variant
all_col_output = False
;Specify the column names need to be transfered to new data file. Split the column names with ','. NB: the column contents must be the same as they are in the sheet.
columns = Gene_symbol,Protein_change_short,Coding_status,AF_tumor_DNA
;Specify the output result file name. Please note that you type should not exceed 31 characters and not contain any of the characters ':' ' \' '/' '?' '*' '[' or ']':
output_table = AllReporVariants_AltProtein

[FILTER5]
;In this filter section, the input data is the first table generated from FILTER1.
[FILTER4]
;In this filter section, the input data is the combined table named 'workingTable' generated from all sections of FILTER0.
;Specify the column name need to be filtered:
filter_column = Reporting_judgement
;Specify the key word for the filter column. If there are multiple keywords need to be filtered at the same time, please split them with ','. You can use ' && ' to combine the keywords startswith '!' as one filter condition. And '!' means NOT:
key_word = BioMar,VUSpot,AddRes,GLrel
all_col_output = False
;Specify the column names need to be transfered to new data file. Split the column names with ','. NB: the column contents must be the same as they are in the sheet.
columns = Gene_symbol,Coding_status,Functional_domain,AF_tumor_DNA,CPSR_ACMG_class,Reporting_judgement
columns = Gene_symbol,Coding_status,Functional_domain,AF_tumor_DNA,CPSR_ACMG_class,Reporting_judgement,Filter_rescued
;Specify the output result file name. Please note that you type should not exceed 31 characters and not contain any of the characters ':' ' \' '/' '?' '*' '[' or ']':
output_table = Main_Findings

Expand Down
Loading
Loading