graph LR
classDef component fill:#decbe4,stroke:#333,color:#000
classDef anndata fill:#d9d9d9,stroke:#333,color:#000
normalization:::group
dataset_processors:::group
raw_dataset["Raw dataset"]:::anndata
common_dataset[Common<br/>dataset]:::anndata
dataset_loader[/Dataset<br/>loader/]:::component
subgraph normalization [Normalization methods]
log_cpm[/"Log CPM"/]:::component
l1_sqrt[/"L1 sqrt"/]:::component
log_scran_pooling[/"Log scran<br/>pooling"/]:::component
sqrt_cpm[/Sqrt CPM/]:::component
end
subgraph dataset_processors[Dataset processors]
pca[/PCA/]:::component
hvg[/HVG/]:::component
knn[/KNN/]:::component
end
dataset_loader --> raw_dataset --> log_cpm & l1_sqrt & log_scran_pooling & sqrt_cpm --> pca --> hvg --> knn --> common_dataset
Originally posted by @rcannood in #247 (comment)
For instance:
Common dataset workflow
graph LR classDef component fill:#decbe4,stroke:#333,color:#000 classDef anndata fill:#d9d9d9,stroke:#333,color:#000 normalization:::group dataset_processors:::group raw_dataset["Raw dataset"]:::anndata common_dataset[Common<br/>dataset]:::anndata dataset_loader[/Dataset<br/>loader/]:::component subgraph normalization [Normalization methods] log_cpm[/"Log CPM"/]:::component l1_sqrt[/"L1 sqrt"/]:::component log_scran_pooling[/"Log scran<br/>pooling"/]:::component sqrt_cpm[/Sqrt CPM/]:::component end subgraph dataset_processors[Dataset processors] pca[/PCA/]:::component hvg[/HVG/]:::component knn[/KNN/]:::component end dataset_loader --> raw_dataset --> log_cpm & l1_sqrt & log_scran_pooling & sqrt_cpm --> pca --> hvg --> knn --> common_datasetTask-specific benchmarking workflow
Discussion
However, this workflow might not be applicable for all tasks.