You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
hello,
I'm trying training the feature extraction block with the code you have released,
but i find if we use (4, 3, 10000) as the input,
the PointNetSetAbstraction layer could takes too much gpu memory (over 12GB by 3080Ti),
as the dist_matrix could be (4, 10000, 10000) if the npoints set to 10000,
I'm wondering if you have any good idea to solve this problem.
thanks too much !
The text was updated successfully, but these errors were encountered:
I have find the solution for this problem;
main question is the inefficiency of the multiply matrix and QueryAndGroup operation,
so delicate CUDA ops may helps for this;
This version of Pointnet2 can solve this:
git clone https://github.com/sshaoshuai/Pointnet2.PyTorch.git
if you use CUDA 11, a pull request version of this may also be used.
After replaced the code, a tensor with shape of [16, 10000, 10000] can be trained in 12GB device.
thanks a lot !
hello,
I'm trying training the feature extraction block with the code you have released,
but i find if we use (4, 3, 10000) as the input,
the PointNetSetAbstraction layer could takes too much gpu memory (over 12GB by 3080Ti),
as the dist_matrix could be (4, 10000, 10000) if the npoints set to 10000,
I'm wondering if you have any good idea to solve this problem.
thanks too much !
The text was updated successfully, but these errors were encountered: