You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thank you for the great library.
I am not so familiar with the DMRG algorithm and block2 implementation so I am curious if there is a GPU implementation of the code. Is there any effort on it from some other branches/developers or maybe the GPU-enhanced code would solve no problem even if it exists ...?
Thank you for your time.
The text was updated successfully, but these errors were encountered:
Currently there is no GPU implementation of the code. We may consider it at some later time, maybe next year.
Internally, block2 relies heavily on the block-sparse tensor algebra, this data structure is essential for achieving high efficiency in CPU. In these block-sparse tensors, the blocks can have very different (big and small) sizes. For GPU, it is more advantageous to have either very big tensors (so that one can directly call cuTENSOR functions) or many small tensors with the same size (so that it can be easily parallelized using a GPU kernel). Unfortunately, these are not the cases for block2.
Therefore, a GPU support in block2 is a non-trivial thing. But clearly it is an interesting research project. We will put some efforts to make it possible. If you have a long term interest on this, you may leave this issue open and I will keep you informed of any progress.
Hello,
Thank you for the great library.
I am not so familiar with the DMRG algorithm and block2 implementation so I am curious if there is a GPU implementation of the code. Is there any effort on it from some other branches/developers or maybe the GPU-enhanced code would solve no problem even if it exists ...?
Thank you for your time.
The text was updated successfully, but these errors were encountered: