Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

using pytorch1.11, I still can not compile DCNv2 because of #include <ATen/ceil_div.h> #56

Open
xiao-pengyou opened this issue May 27, 2022 · 11 comments

Comments

@xiao-pengyou
Copy link

gpu 3080ti

nvidia-smi show Driver version 510.47.03 CUDA Version: 11.6
nvcc -V show Cuda compilation tools, release 11.4, V11.4.120

when I create conda env I use create -n FairMOT python=3.8
when I install pytorch ,use conda install pytorch torchvision torchaudio cudatoolkit=11.3 -c pytorch
then I check the version , input python, import torch
torch.version 1.11 torch.version.cuda 11.3 torch.backends.cudnn.version() 8200

then input sudo python3 setup.py build develop
fatal error: ATen/ceil_div.h: 没有那个文件或目录
#include <ATen/ceil_div.h>

Is there something I do wrong?

@Natsu-Akatsuki
Copy link
Contributor

try python3 setup.py build develop without sudo...

@xiao-pengyou
Copy link
Author

try python3 setup.py build develop without sudo...

Thank you very much. I find when add sudo,the compiler use the default environment(python3.6) instead of python3.8 which I bulid for the project(python3.8)

@JehanJay
Copy link

JehanJay commented Jun 15, 2022

I do have the same error as @xiao-pengyou
fatal error: ATen/ceil_div.h: No such file or directory
In my case, I am using a conda environment developed with Pytorch LTS 1.8.2 and Cudatoolkit 11.1 Nvidia 3050Ti

try python3 setup.py build develop without sudo...

@Natsu-Akatsuki Tried this way, but still, the issue remains.

What exactly would be the bug causing this error?

@Natsu-Akatsuki
Copy link
Contributor

Try to use the previous version. i.e. git checkout fa9b2fd
Please read the README carefully.

@JehanJay
Copy link

I do have the same error as @xiao-pengyou fatal error: ATen/ceil_div.h: No such file or directory In my case, I am using a conda environment developed with Pytorch LTS 1.8.2 and Cudatoolkit 11.1 Nvidia 3050Ti

try python3 setup.py build develop without sudo...

@Natsu-Akatsuki Tried this way, but still, the issue remains.

What exactly would be the bug causing this error?

However, DCnv2 got successfully compiled with Cudatoolkit 11.3 with PyTorch stable 1.11.0.

@Natsu-Akatsuki
Copy link
Contributor

What I mean is that the lateset version of DCNv2 only support pytorch 1.11.+.
On the other hand, when you use pytorch 1.8.2 you need to use previous version.

@xiao-pengyou
Copy link
Author

What I mean is that the lateset version of DCNv2 only support pytorch 1.11.+. On the other hand, when you use pytorch 1.8.2 you need to use previous version.

You can find the latest DCNv2 in the issue ,which support Pytorch 1.12, or other version support 1.6, 1.7, 1.8.

@JehanJay
Copy link

What I mean is that the lateset version of DCNv2 only support pytorch 1.11.+. On the other hand, when you use pytorch 1.8.2 you need to use previous version.

For the previous version of DCNv2, PyTorch 1.8.2 was not supported and supported only versions <=1.7.0.
The issue with that was, PyTorch 1.7.0 is not supported with Cudatoolkit 11.1 which is the oldest compatible version with CUDA SDK 8.6.
So as far as I understood the issue, conceptually the previous DCNv2 is no longer supported with newer GPU architectures.

@DhruvKoolRajamani
Copy link

Just add #ifdef macro flags to check for the version of PyTorch before the ATen library include.

@Albert337
Copy link

What I mean is that the lateset version of DCNv2 only support pytorch 1.11.+. On the other hand, when you use pytorch 1.8.2 you need to use previous version.

For the previous version of DCNv2, PyTorch 1.8.2 was not supported and supported only versions <=1.7.0. The issue with that was, PyTorch 1.7.0 is not supported with Cudatoolkit 11.1 which is the oldest compatible version with CUDA SDK 8.6. So as far as I understood the issue, conceptually the previous DCNv2 is no longer supported with newer GPU architectures.

my docker environment is pytorch 1.8.0 and cuda 11.1, i met the same error
image
as you mean,i need change pytorch to 1.7.0 version?

@cvcvcccq0912
Copy link

What I mean is that the lateset version of DCNv2 only support pytorch 1.11.+. On the other hand, when you use pytorch 1.8.2 you need to use previous version.

For the previous version of DCNv2, PyTorch 1.8.2 was not supported and supported only versions <=1.7.0. The issue with that was, PyTorch 1.7.0 is not supported with Cudatoolkit 11.1 which is the oldest compatible version with CUDA SDK 8.6. So as far as I understood the issue, conceptually the previous DCNv2 is no longer supported with newer GPU architectures.

my docker environment is pytorch 1.8.0 and cuda 11.1, i met the same error image as you mean,i need change pytorch to 1.7.0 version?

I'm having the same problem as you, as are the pytorch and cuda versions. But I can't lower my pytoch version any more. Have you solved your problem? How to solve it?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

6 participants