Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] Error message "Boolean is not supported" from NYCTaxi-E2E notebook #213

Open
tym1062 opened this issue Oct 22, 2019 · 3 comments
Open
Labels
bug Something isn't working

Comments

@tym1062
Copy link

tym1062 commented Oct 22, 2019

Describe the bug
The following stack-trace reported after executing cell 14 in NYCTaxi-E2E notebook while using RAPIDS 0.10 container:

XGBoostError: [14:31:50] /conda/conda-bld/xgboost_1571337679414/work/src/data/simple_csr_source.cu:161: Boolean is not supported.
Stack trace:
  [bt] (0) /opt/conda/envs/rapids/lib/libxgboost.so(+0xc9594) [0x7fe4c8a83594]
  [bt] (1) /opt/conda/envs/rapids/lib/libxgboost.so(xgboost::data::SimpleCSRSource::FromDeviceColumnar(std::vector<xgboost::Json, std::allocator<xgboost::Json> > const&, bool, float)+0x743) [0x7fe4c8c66443]
  [bt] (2) /opt/conda/envs/rapids/lib/libxgboost.so(xgboost::data::SimpleCSRSource::CopyFrom(std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&, bool, float)+0xc74) [0x7fe4c8ade9e4]
  [bt] (3) /opt/conda/envs/rapids/lib/libxgboost.so(XGDMatrixCreateFromArrayInterfaces+0x1c8) [0x7fe4c8a91b08]
  [bt] (4) /opt/conda/envs/rapids/lib/python3.6/lib-dynload/../../libffi.so.6(ffi_call_unix64+0x4c) [0x7fe7e0569630]
  [bt] (5) /opt/conda/envs/rapids/lib/python3.6/lib-dynload/../../libffi.so.6(ffi_call+0x22d) [0x7fe7e0568fed]
  [bt] (6) /opt/conda/envs/rapids/lib/python3.6/lib-dynload/_ctypes.cpython-36m-x86_64-linux-gnu.so(_ctypes_callproc+0x2ce) [0x7fe7e058000e]
  [bt] (7) /opt/conda/envs/rapids/lib/python3.6/lib-dynload/_ctypes.cpython-36m-x86_64-linux-gnu.so(+0x13a45) [0x7fe7e0580a45]
  [bt] (8) /opt/conda/envs/rapids/bin/python(_PyObject_FastCallDict+0x8b) [0x558414ded7bb]

Steps/Code to reproduce bug
Download https://github.com/rapidsai/notebooks-contrib/blob/master/intermediate_notebooks/E2E/taxi/NYCTaxi-E2E.ipynb to /raid

docker run --gpus all --rm -it -p 8888:8888 -p 8787:8787 -p 8786:8786 -v /raid:/rapids/data rapidsai/rapidsai:cuda10.1-runtime-ubuntu18.04

Once inside container, start jupyter lab:

cd /rapids; ./utils/start_jupyter.sh

Open browser, specify URL: IP-Address:8888, then open NYCTaxi-E2E notebook. Execute cells in the notebook - after cell 14 is executed, should see the trace-back.

Expected behavior
I suspect an extra step will need to be added to the NYCTaxi notebook to update the DMatrix before XGBoost train method is called.

Environment details (please complete the following information):

Here is the conda package list for rapids packages from inside the RAPIDS 0.10 container:

conda list |grep rapids
cudf                      0.10.0                   py36_0    rapidsai
cugraph                   0.10.0                   py36_0    rapidsai
cuml                      0.10.0          cuda10.1_py36_0    rapidsai
cupy                      6.3.0            py36hf7e11e9_0    rapidsai
cuspatial                 0.10.0                   py36_0    rapidsai
dask-cuda                 0.10.0                   py36_0    rapidsai
dask-cudf                 0.10.0                   py36_0    rapidsai
dask-xgboost              0.2.0.dev28      cuda10.1py36_0    rapidsai
libcudf                   0.10.0               cuda10.1_0    rapidsai
libcugraph                0.10.0               cuda10.1_0    rapidsai
libcuml                   0.10.0               cuda10.1_0    rapidsai
libcumlprims              0.10.0a191017       cuda10.1_15    rapidsai-nightly
libcuspatial              0.10.0               cuda10.1_0    rapidsai
libnvstrings              0.10.0               cuda10.1_0    rapidsai
librmm                    0.10.0               cuda10.1_0    rapidsai
libxgboost                1.0.0.SNAPSHOT       cuda10.1_1    rapidsai
nvstrings                 0.10.0                   py36_0    rapidsai
py-xgboost                1.0.0.SNAPSHOT   cuda10.1py36_1    rapidsai
rapids                    0.10.0          cuda10.1_py36_0    rapidsai
rapids-xgboost            0.10.0          cuda10.1_py36_0    rapidsai
rmm                       0.10.0                   py36_0    rapidsai
xgboost                   1.0.0.SNAPSHOT   cuda10.1py36_1    rapidsai
  • Environment location: [Docker]
  • Method of RAPIDS libraries install: [Docker]
    • If method of install is [Docker], provide docker pull & docker run commands used
      docker pull rapidsai/rapidsai:cuda10.1-runtime-ubuntu18.04
      docker run --gpus all --rm -it -p 8888:8888 -p 8787:8787 -p 8786:8786 \ rapidsai/rapidsai:cuda10.1-runtime-ubuntu18.04
    • If other methods, provide the commands used

Additional context
Add any other context about the problem here.

@tym1062 tym1062 added the bug Something isn't working label Oct 22, 2019
@vilmara
Copy link

vilmara commented Oct 23, 2019

hi @tym1062, take a look at the issue #214

@taureandyernv
Copy link
Contributor

@tym1062 @vilmara this has been escalated and we may have a fix by end of week with nightlies. Thanks so much!!

@trivialfis
Copy link

Currently xgboost doesn't support consuming boolean values from cudf, as it specializes it to bitset according to arrow. Workaround would be converting it to integer or float first. For those who curious about why we can support all data types in pandas, that's because we convert them into float from Python. As GPU memory is quite precious, so I decided not to do the conversation to prevent implicit memory consumption.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

4 participants