-
Notifications
You must be signed in to change notification settings - Fork 94
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Solver functions give "no kernel image is available for execution on the device" #318
Comments
This works for me with a fresh nightly build install but fails with stable (0.14). It seems to come from cuml prims: cluster status running
cluster infomarion LocalCUDACluster('tcp://127.0.0.1:45959', workers=8, threads=8, memory=1.08 TB)
client information <Client: 'tcp://127.0.0.1:45959' processes=8 threads=8, memory=1.08 TB>
Input Matrix
[[-1.74721092 2.14871726 6.56769612]
[-1.02317438 2.55977184 5.86181299]
[-4.19011622 4.88020029 6.81418569]
[-2.8100962 2.54433874 8.15304742]
[-3.78735068 4.28623727 7.86674681]
[-3.32177127 2.30468961 7.09697537]]
distributed.worker - WARNING - Compute Failed
Function: _func_fit
args: (PCAMG(), [array([[-2.8100962 , 2.54433874, 8.15304742],
[-3.78735068, 4.28623727, 7.86674681],
[-3.32177127, 2.30468961, 7.09697537]])], 6, 3, [3], 1, False)
kwargs: {}
Exception: RuntimeError("Exception occured! file=/conda/conda-bld/libcumlprims_1591198485167/work/cpp/build/cuml/src/cuml/cpp/src_prims/stats/sum.cuh line=94: FAIL: call='cudaPeekAtLastError()'. Reason:no kernel image is available for execution on the device\nObtained 37 stack frames\n#0 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/common/pointer_utils.cpython-37m-x86_64-linux-gnu.so(_ZN8MLCommon9Exception16collectCallStackEv+0x3e) [0x7f0fd8e0913e]\n#1 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/common/pointer_utils.cpython-37m-x86_64-linux-gnu.so(_ZN8MLCommon9ExceptionC1ERKNSt7__cxx1112basic_stringIcSt11char_traitsIcESaIcEEE+0x80) [0x7f0fd8e09c50]\n#2 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/common/../../../../libcumlprims.so(_ZN8MLCommon5Stats3opg9mean_implIdLi256EEEvRNS_6Matrix4DataIT_EERKSt6vectorIPS6_SaIS9_EERKNS3_14PartDescriptorERKNS_16cumlCommunicatorESt10shared_ptrINS_15deviceAllocatorEEPP11CUstream_stiP13cublasContext+0xf70) [0x7f0fa317a790]\n#3 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/common/../../../../libcumlprims.so(_ZN8MLCommon5Stats3opg4meanERNS_6Matrix4DataIdEERKSt6vectorIPS4_SaIS7_EERKNS2_14PartDescriptorERKNS_16cumlCommunicatorESt10shared_ptrINS_15deviceAllocatorEEPP11CUstream_stiP13cublasContext+0x4e) [0x7f0fa317960e]\n#4 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/common/../../../../libcumlprims.so(_ZN2ML3PCA3opg8fit_implIdEEvRNS_10cumlHandleERSt6vectorIPN8MLCommon6Matrix4DataIT_EESaISB_EERNS7_14PartDescriptorEPS9_SH_SH_SH_SH_SH_NS_9paramsPCAEPP11CUstream_stib+0x12a) [0x7f0fa312448a]\n#5 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/common/../../../../libcumlprims.so(_ZN2ML3PCA3opg8fit_implIdEEvRNS_10cumlHandleEPPN8MLCommon6Matrix12RankSizePairEmPPNS6_4DataIT_EEPSB_SF_SF_SF_SF_SF_NS_9paramsPCAEb+0x284) [0x7f0fa3124a14]\n#6 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/common/../../../../libcumlprims.so(_ZN2ML3PCA3opg3fitERNS_10cumlHandleEPPN8MLCommon6Matrix12RankSizePairEmPPNS5_4DataIdEEPdSD_SD_SD_SD_SD_NS_9paramsPCAEb+0x75) [0x7f0fa310b065]\n#7 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/decomposition/pca_mg.cpython-37m-x86_64-linux-gnu.so(+0x969e) [0x7f0f8017269e]\n#8 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/decomposition/pca_mg.cpython-37m-x86_64-linux-gnu.so(+0xad9b) [0x7f0f80173d9b]\n#9 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/decomposition/base_mg.cpython-37m-x86_64-linux-gnu.so(+0x6210) [0x7f0f8015b210]\n#10 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/decomposition/base_mg.cpython-37m-x86_64-linux-gnu.so(+0xbbcb) [0x7f0f80160bcb]\n#11 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/decomposition/base_mg.cpython-37m-x86_64-linux-gnu.so(+0xe90d) [0x7f0f8016390d]\n#12 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyObject_FastCallKeywords+0x15c) [0x562d053d6bec]\n#13 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(+0x181661) [0x562d053d7661]\n#14 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyEval_EvalFrameDefault+0x48a2) [0x562d0541d762]\n#15 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyFunction_FastCallDict+0x118) [0x562d0538e138]\n#16 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyEval_EvalFrameDefault+0x1ccd) [0x562d0541ab8d]\n#17 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyFunction_FastCallDict+0x118) [0x562d0538e138]\n#18 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyEval_EvalFrameDefault+0x1ccd) [0x562d0541ab8d]\n#19 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyFunction_FastCallKeywords+0x187) [0x562d0538ed37]\n#20 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(+0x181455) [0x562d053d7455]\n#21 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyEval_EvalFrameDefault+0x611) [0x562d054194d1]\n#22 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyFunction_FastCallDict+0x118) [0x562d0538e138]\n#23 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyEval_EvalFrameDefault+0x1ccd) [0x562d0541ab8d]\n#24 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyFunction_FastCallKeywords+0x187) [0x562d0538ed37]\n#25 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(+0x181455) [0x562d053d7455]\n#26 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyEval_EvalFrameDefault+0x611) [0x562d054194d1]\n#27 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyFunction_FastCallKeywords+0x187) [0x562d0538ed37]\n#28 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(+0x181455) [0x562d053d7455]\n#29 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyEval_EvalFrameDefault+0x611) [0x562d054194d1]\n#30 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyObject_FastCallDict+0x1b6) [0x562d05370ab6]\n#31 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(+0x12f071) [0x562d05385071]\n#32 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(PyObject_Call+0xb4) [0x562d05371214]\n#33 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(+0x2218e3) [0x562d054778e3]\n#34 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(+0x1dac27) [0x562d05430c27]\n#35 in /lib/x86_64-linux-gnu/libpthread.so.0(+0x76db) [0x7f10d5a326db]\n#36 in /lib/x86_64-linux-gnu/libc.so.6(clone+0x3f) [0x7f10d575b88f]\n")
distributed.worker - WARNING - Compute Failed
Function: _func_fit
args: (PCAMG(), [array([[-1.74721092, 2.14871726, 6.56769612],
[-1.02317438, 2.55977184, 5.86181299],
[-4.19011622, 4.88020029, 6.81418569]])], 6, 3, [3], 0, False)
kwargs: {}
Exception: RuntimeError("Exception occured! file=/conda/conda-bld/libcumlprims_1591198485167/work/cpp/build/cuml/src/cuml/cpp/src_prims/stats/sum.cuh line=94: FAIL: call='cudaPeekAtLastError()'. Reason:no kernel image is available for execution on the device\nObtained 37 stack frames\n#0 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/common/pointer_utils.cpython-37m-x86_64-linux-gnu.so(_ZN8MLCommon9Exception16collectCallStackEv+0x3e) [0x7fbaa193913e]\n#1 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/common/pointer_utils.cpython-37m-x86_64-linux-gnu.so(_ZN8MLCommon9ExceptionC1ERKNSt7__cxx1112basic_stringIcSt11char_traitsIcESaIcEEE+0x80) [0x7fbaa1939c50]\n#2 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/common/../../../../libcumlprims.so(_ZN8MLCommon5Stats3opg9mean_implIdLi256EEEvRNS_6Matrix4DataIT_EERKSt6vectorIPS6_SaIS9_EERKNS3_14PartDescriptorERKNS_16cumlCommunicatorESt10shared_ptrINS_15deviceAllocatorEEPP11CUstream_stiP13cublasContext+0xf70) [0x7fba92a98790]\n#3 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/common/../../../../libcumlprims.so(_ZN8MLCommon5Stats3opg4meanERNS_6Matrix4DataIdEERKSt6vectorIPS4_SaIS7_EERKNS2_14PartDescriptorERKNS_16cumlCommunicatorESt10shared_ptrINS_15deviceAllocatorEEPP11CUstream_stiP13cublasContext+0x4e) [0x7fba92a9760e]\n#4 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/common/../../../../libcumlprims.so(_ZN2ML3PCA3opg8fit_implIdEEvRNS_10cumlHandleERSt6vectorIPN8MLCommon6Matrix4DataIT_EESaISB_EERNS7_14PartDescriptorEPS9_SH_SH_SH_SH_SH_NS_9paramsPCAEPP11CUstream_stib+0x12a) [0x7fba92a4248a]\n#5 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/common/../../../../libcumlprims.so(_ZN2ML3PCA3opg8fit_implIdEEvRNS_10cumlHandleEPPN8MLCommon6Matrix12RankSizePairEmPPNS6_4DataIT_EEPSB_SF_SF_SF_SF_SF_NS_9paramsPCAEb+0x284) [0x7fba92a42a14]\n#6 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/common/../../../../libcumlprims.so(_ZN2ML3PCA3opg3fitERNS_10cumlHandleEPPN8MLCommon6Matrix12RankSizePairEmPPNS5_4DataIdEEPdSD_SD_SD_SD_SD_NS_9paramsPCAEb+0x75) [0x7fba92a29065]\n#7 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/decomposition/pca_mg.cpython-37m-x86_64-linux-gnu.so(+0x969e) [0x7fba8407969e]\n#8 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/decomposition/pca_mg.cpython-37m-x86_64-linux-gnu.so(+0xad9b) [0x7fba8407ad9b]\n#9 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/decomposition/base_mg.cpython-37m-x86_64-linux-gnu.so(+0x6210) [0x7fba84062210]\n#10 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/decomposition/base_mg.cpython-37m-x86_64-linux-gnu.so(+0xbbcb) [0x7fba84067bcb]\n#11 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/decomposition/base_mg.cpython-37m-x86_64-linux-gnu.so(+0xe90d) [0x7fba8406a90d]\n#12 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyObject_FastCallKeywords+0x15c) [0x564ad9cc2bec]\n#13 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(+0x181661) [0x564ad9cc3661]\n#14 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyEval_EvalFrameDefault+0x48a2) [0x564ad9d09762]\n#15 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyFunction_FastCallDict+0x118) [0x564ad9c7a138]\n#16 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyEval_EvalFrameDefault+0x1ccd) [0x564ad9d06b8d]\n#17 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyFunction_FastCallDict+0x118) [0x564ad9c7a138]\n#18 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyEval_EvalFrameDefault+0x1ccd) [0x564ad9d06b8d]\n#19 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyFunction_FastCallKeywords+0x187) [0x564ad9c7ad37]\n#20 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(+0x181455) [0x564ad9cc3455]\n#21 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyEval_EvalFrameDefault+0x611) [0x564ad9d054d1]\n#22 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyFunction_FastCallDict+0x118) [0x564ad9c7a138]\n#23 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyEval_EvalFrameDefault+0x1ccd) [0x564ad9d06b8d]\n#24 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyFunction_FastCallKeywords+0x187) [0x564ad9c7ad37]\n#25 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(+0x181455) [0x564ad9cc3455]\n#26 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyEval_EvalFrameDefault+0x611) [0x564ad9d054d1]\n#27 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyFunction_FastCallKeywords+0x187) [0x564ad9c7ad37]\n#28 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(+0x181455) [0x564ad9cc3455]\n#29 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyEval_EvalFrameDefault+0x611) [0x564ad9d054d1]\n#30 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyObject_FastCallDict+0x1b6) [0x564ad9c5cab6]\n#31 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(+0x12f071) [0x564ad9c71071]\n#32 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(PyObject_Call+0xb4) [0x564ad9c5d214]\n#33 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(+0x2218e3) [0x564ad9d638e3]\n#34 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(+0x1dac27) [0x564ad9d1cc27]\n#35 in /lib/x86_64-linux-gnu/libpthread.so.0(+0x76db) [0x7fbbc534f6db]\n#36 in /lib/x86_64-linux-gnu/libc.so.6(clone+0x3f) [0x7fbbc507888f]\n")
Traceback (most recent call last):
File "dask-cuda-318.py", line 52, in <module>
XT = cumlModel.fit_transform(X_cudf)
File "/datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/dask/decomposition/pca.py", line 189, in fit_transform
return self.fit(X).transform(X)
File "/datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/dask/decomposition/pca.py", line 173, in fit
self._fit(X)
File "/datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/dask/decomposition/base.py", line 101, in _fit
raise_exception_from_futures(list(pca_fit.values()))
File "/datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/dask/common/utils.py", line 144, in raise_exception_from_futures
len(errs), len(futures), ", ".join(map(str, errs))
RuntimeError: 2 of 2 worker jobs failed: Exception occured! file=/conda/conda-bld/libcumlprims_1591198485167/work/cpp/build/cuml/src/cuml/cpp/src_prims/stats/sum.cuh line=94: FAIL: call='cudaPeekAtLastError()'. Reason:no kernel image is available for execution on the device
Obtained 37 stack frames
#0 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/common/pointer_utils.cpython-37m-x86_64-linux-gnu.so(_ZN8MLCommon9Exception16collectCallStackEv+0x3e) [0x7fbaa193913e]
#1 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/common/pointer_utils.cpython-37m-x86_64-linux-gnu.so(_ZN8MLCommon9ExceptionC1ERKNSt7__cxx1112basic_stringIcSt11char_traitsIcESaIcEEE+0x80) [0x7fbaa1939c50]
#2 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/common/../../../../libcumlprims.so(_ZN8MLCommon5Stats3opg9mean_implIdLi256EEEvRNS_6Matrix4DataIT_EERKSt6vectorIPS6_SaIS9_EERKNS3_14PartDescriptorERKNS_16cumlCommunicatorESt10shared_ptrINS_15deviceAllocatorEEPP11CUstream_stiP13cublasContext+0xf70) [0x7fba92a98790]
#3 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/common/../../../../libcumlprims.so(_ZN8MLCommon5Stats3opg4meanERNS_6Matrix4DataIdEERKSt6vectorIPS4_SaIS7_EERKNS2_14PartDescriptorERKNS_16cumlCommunicatorESt10shared_ptrINS_15deviceAllocatorEEPP11CUstream_stiP13cublasContext+0x4e) [0x7fba92a9760e]
#4 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/common/../../../../libcumlprims.so(_ZN2ML3PCA3opg8fit_implIdEEvRNS_10cumlHandleERSt6vectorIPN8MLCommon6Matrix4DataIT_EESaISB_EERNS7_14PartDescriptorEPS9_SH_SH_SH_SH_SH_NS_9paramsPCAEPP11CUstream_stib+0x12a) [0x7fba92a4248a]
#5 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/common/../../../../libcumlprims.so(_ZN2ML3PCA3opg8fit_implIdEEvRNS_10cumlHandleEPPN8MLCommon6Matrix12RankSizePairEmPPNS6_4DataIT_EEPSB_SF_SF_SF_SF_SF_NS_9paramsPCAEb+0x284) [0x7fba92a42a14]
#6 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/common/../../../../libcumlprims.so(_ZN2ML3PCA3opg3fitERNS_10cumlHandleEPPN8MLCommon6Matrix12RankSizePairEmPPNS5_4DataIdEEPdSD_SD_SD_SD_SD_NS_9paramsPCAEb+0x75) [0x7fba92a29065]
#7 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/decomposition/pca_mg.cpython-37m-x86_64-linux-gnu.so(+0x969e) [0x7fba8407969e]
#8 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/decomposition/pca_mg.cpython-37m-x86_64-linux-gnu.so(+0xad9b) [0x7fba8407ad9b]
#9 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/decomposition/base_mg.cpython-37m-x86_64-linux-gnu.so(+0x6210) [0x7fba84062210]
#10 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/decomposition/base_mg.cpython-37m-x86_64-linux-gnu.so(+0xbbcb) [0x7fba84067bcb]
#11 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/decomposition/base_mg.cpython-37m-x86_64-linux-gnu.so(+0xe90d) [0x7fba8406a90d]
#12 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyObject_FastCallKeywords+0x15c) [0x564ad9cc2bec]
#13 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(+0x181661) [0x564ad9cc3661]
#14 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyEval_EvalFrameDefault+0x48a2) [0x564ad9d09762]
#15 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyFunction_FastCallDict+0x118) [0x564ad9c7a138]
#16 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyEval_EvalFrameDefault+0x1ccd) [0x564ad9d06b8d]
#17 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyFunction_FastCallDict+0x118) [0x564ad9c7a138]
#18 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyEval_EvalFrameDefault+0x1ccd) [0x564ad9d06b8d]
#19 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyFunction_FastCallKeywords+0x187) [0x564ad9c7ad37]
#20 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(+0x181455) [0x564ad9cc3455]
#21 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyEval_EvalFrameDefault+0x611) [0x564ad9d054d1]
#22 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyFunction_FastCallDict+0x118) [0x564ad9c7a138]
#23 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyEval_EvalFrameDefault+0x1ccd) [0x564ad9d06b8d]
#24 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyFunction_FastCallKeywords+0x187) [0x564ad9c7ad37]
#25 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(+0x181455) [0x564ad9cc3455]
#26 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyEval_EvalFrameDefault+0x611) [0x564ad9d054d1]
#27 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyFunction_FastCallKeywords+0x187) [0x564ad9c7ad37]
#28 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(+0x181455) [0x564ad9cc3455]
#29 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyEval_EvalFrameDefault+0x611) [0x564ad9d054d1]
#30 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyObject_FastCallDict+0x1b6) [0x564ad9c5cab6]
#31 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(+0x12f071) [0x564ad9c71071]
#32 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(PyObject_Call+0xb4) [0x564ad9c5d214]
#33 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(+0x2218e3) [0x564ad9d638e3]
#34 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(+0x1dac27) [0x564ad9d1cc27]
#35 in /lib/x86_64-linux-gnu/libpthread.so.0(+0x76db) [0x7fbbc534f6db]
#36 in /lib/x86_64-linux-gnu/libc.so.6(clone+0x3f) [0x7fbbc507888f]
, Exception occured! file=/conda/conda-bld/libcumlprims_1591198485167/work/cpp/build/cuml/src/cuml/cpp/src_prims/stats/sum.cuh line=94: FAIL: call='cudaPeekAtLastError()'. Reason:no kernel image is available for execution on the device
Obtained 37 stack frames
#0 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/common/pointer_utils.cpython-37m-x86_64-linux-gnu.so(_ZN8MLCommon9Exception16collectCallStackEv+0x3e) [0x7f0fd8e0913e]
#1 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/common/pointer_utils.cpython-37m-x86_64-linux-gnu.so(_ZN8MLCommon9ExceptionC1ERKNSt7__cxx1112basic_stringIcSt11char_traitsIcESaIcEEE+0x80) [0x7f0fd8e09c50]
#2 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/common/../../../../libcumlprims.so(_ZN8MLCommon5Stats3opg9mean_implIdLi256EEEvRNS_6Matrix4DataIT_EERKSt6vectorIPS6_SaIS9_EERKNS3_14PartDescriptorERKNS_16cumlCommunicatorESt10shared_ptrINS_15deviceAllocatorEEPP11CUstream_stiP13cublasContext+0xf70) [0x7f0fa317a790]
#3 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/common/../../../../libcumlprims.so(_ZN8MLCommon5Stats3opg4meanERNS_6Matrix4DataIdEERKSt6vectorIPS4_SaIS7_EERKNS2_14PartDescriptorERKNS_16cumlCommunicatorESt10shared_ptrINS_15deviceAllocatorEEPP11CUstream_stiP13cublasContext+0x4e) [0x7f0fa317960e]
#4 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/common/../../../../libcumlprims.so(_ZN2ML3PCA3opg8fit_implIdEEvRNS_10cumlHandleERSt6vectorIPN8MLCommon6Matrix4DataIT_EESaISB_EERNS7_14PartDescriptorEPS9_SH_SH_SH_SH_SH_NS_9paramsPCAEPP11CUstream_stib+0x12a) [0x7f0fa312448a]
#5 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/common/../../../../libcumlprims.so(_ZN2ML3PCA3opg8fit_implIdEEvRNS_10cumlHandleEPPN8MLCommon6Matrix12RankSizePairEmPPNS6_4DataIT_EEPSB_SF_SF_SF_SF_SF_NS_9paramsPCAEb+0x284) [0x7f0fa3124a14]
#6 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/common/../../../../libcumlprims.so(_ZN2ML3PCA3opg3fitERNS_10cumlHandleEPPN8MLCommon6Matrix12RankSizePairEmPPNS5_4DataIdEEPdSD_SD_SD_SD_SD_NS_9paramsPCAEb+0x75) [0x7f0fa310b065]
#7 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/decomposition/pca_mg.cpython-37m-x86_64-linux-gnu.so(+0x969e) [0x7f0f8017269e]
#8 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/decomposition/pca_mg.cpython-37m-x86_64-linux-gnu.so(+0xad9b) [0x7f0f80173d9b]
#9 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/decomposition/base_mg.cpython-37m-x86_64-linux-gnu.so(+0x6210) [0x7f0f8015b210]
#10 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/decomposition/base_mg.cpython-37m-x86_64-linux-gnu.so(+0xbbcb) [0x7f0f80160bcb]
#11 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/lib/python3.7/site-packages/cuml/decomposition/base_mg.cpython-37m-x86_64-linux-gnu.so(+0xe90d) [0x7f0f8016390d]
#12 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyObject_FastCallKeywords+0x15c) [0x562d053d6bec]
#13 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(+0x181661) [0x562d053d7661]
#14 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyEval_EvalFrameDefault+0x48a2) [0x562d0541d762]
#15 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyFunction_FastCallDict+0x118) [0x562d0538e138]
#16 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyEval_EvalFrameDefault+0x1ccd) [0x562d0541ab8d]
#17 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyFunction_FastCallDict+0x118) [0x562d0538e138]
#18 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyEval_EvalFrameDefault+0x1ccd) [0x562d0541ab8d]
#19 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyFunction_FastCallKeywords+0x187) [0x562d0538ed37]
#20 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(+0x181455) [0x562d053d7455]
#21 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyEval_EvalFrameDefault+0x611) [0x562d054194d1]
#22 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyFunction_FastCallDict+0x118) [0x562d0538e138]
#23 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyEval_EvalFrameDefault+0x1ccd) [0x562d0541ab8d]
#24 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyFunction_FastCallKeywords+0x187) [0x562d0538ed37]
#25 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(+0x181455) [0x562d053d7455]
#26 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyEval_EvalFrameDefault+0x611) [0x562d054194d1]
#27 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyFunction_FastCallKeywords+0x187) [0x562d0538ed37]
#28 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(+0x181455) [0x562d053d7455]
#29 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyEval_EvalFrameDefault+0x611) [0x562d054194d1]
#30 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(_PyObject_FastCallDict+0x1b6) [0x562d05370ab6]
#31 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(+0x12f071) [0x562d05385071]
#32 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(PyObject_Call+0xb4) [0x562d05371214]
#33 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(+0x2218e3) [0x562d054778e3]
#34 in /datasets/pentschev/miniconda3/envs/rn-102-0.14/bin/python(+0x1dac27) [0x562d05430c27]
#35 in /lib/x86_64-linux-gnu/libpthread.so.0(+0x76db) [0x7f10d5a326db]
#36 in /lib/x86_64-linux-gnu/libc.so.6(clone+0x3f) [0x7f10d575b88f] |
"no kernel image is available for execution on the device" typically means that you're using an unsupported GPU architecture. Could you dump the output of |
Thanks for your replies.
|
This appears to be an issue with the compilation options for the cumlprims supporting library in 0.14, which is compiled for newer GPUs. We are working on a fix now. Thank you for reporting this! |
All good! Thanks for taking time to look into it! |
@davidnoz123 a new libcumlprims version (0.14.1) has been released for the release version that has fixed the issue. For the nightly 0.15 version there are upcoming packages that should be fixed as well in the next couple of days at the latest. |
Wow! You guys do fast service! Thanks loads! ( : |
@davidnoz123 thanks for reporting the issue, I'll tentatively close this as it's probably resolved, feel free to reopen if you still experience problems after updating libcumlprims. |
Tested and works with a new conda environment and:-
on:-
with:-
Great guys! Thanks again! |
I've cobbled together a couple of dask_cuda examples I've found on the web (see below).
I believe it should work but I get a "no kernel image is available for execution on the device" exception on line "XT = cumlModel.fit_transform(X_cudf)".
Can anyone get this working on their setup?
If so, then can you reply with your "conda list" output?
Thanks in advance!
The text was updated successfully, but these errors were encountered: