You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello, I am opening this issue as a follow up from #43 as I am encountering the exact same problems as mentioned here with BatchtoolsParam() in fitGAM() on a slurm cluster and can bring some insight after some tests
the error names(res) <- nms is because res is length of cells and nms is length of genes. To test this I had an sds object and counts matrix of 100 genes and 37 cells and got the error
Error in names(res) <- nms :
'names' attribute [100] must be the same length as the vector [37]
When increasing to 200 genes it became
Error in names(res) <- nms :
'names' attribute [200] must be the same length as the vector [37]
Then I tried 37 cells and 37 genes and it gave the same second error as trebbiano :
Now I am pretty sure this is purely an error on how batchtools sends jobs and collects them, as using any other option such as MulticoreParam() does not throw any error, I think it might come from the fact that in bplapply(), .fitGAM() sets the first parameter as as.data.frame(t(as.matrix(counts)[id, ])) while in most use case of bplapply() it is a simple integer vector such as 1:6 and I think that it completely throws off batchtools called by BiocParallel.
I am entirely dependent on bioconda for all packages version unfortunately, versions I use are
r-batchtools 0.9.17
bioconductor-biocparallel 1.36.0
bioconductor-tradeseq 1.16.0
Which look like they are the latest as these versions are the same in RStudio
What seems very strange is that while my object has for example 100 genes and 37 cells, I would expect that 100 jobs would be submitted to the cluster, 1 for each GAM fitting, instead it submits 37 jobs, 1 for each cell.
The text was updated successfully, but these errors were encountered:
Alexis-Varin
changed the title
Using BatchtoolsParam in fitGAM()
Using BatchtoolsParam() in fitGAM()
Jul 4, 2024
Hello, I am opening this issue as a follow up from #43 as I am encountering the exact same problems as mentioned here with
BatchtoolsParam()
infitGAM()
on a slurm cluster and can bring some insight after some teststhe error
names(res) <- nms
is because res is length of cells and nms is length of genes. To test this I had an sds object and counts matrix of 100 genes and 37 cells and got the errorWhen increasing to 200 genes it became
Then I tried 37 cells and 37 genes and it gave the same second error as trebbiano :
Now I am pretty sure this is purely an error on how
batchtools
sends jobs and collects them, as using any other option such asMulticoreParam()
does not throw any error, I think it might come from the fact that inbplapply()
,.fitGAM()
sets the first parameter asas.data.frame(t(as.matrix(counts)[id, ]))
while in most use case ofbplapply()
it is a simple integer vector such as1:6
and I think that it completely throws offbatchtools
called byBiocParallel
.I am entirely dependent on bioconda for all packages version unfortunately, versions I use are
r-batchtools 0.9.17
bioconductor-biocparallel 1.36.0
bioconductor-tradeseq 1.16.0
Which look like they are the latest as these versions are the same in RStudio
What seems very strange is that while my object has for example 100 genes and 37 cells, I would expect that 100 jobs would be submitted to the cluster, 1 for each GAM fitting, instead it submits 37 jobs, 1 for each cell.
The text was updated successfully, but these errors were encountered: