A bug with `n_fused` #41

JiayiFeng · 2022-10-18T04:24:38Z

When a attn_qkv Layer is set with n_fused>1 and reversed=False, the shape of its sliced weight is incorrect.

Seems that the root cause is here:

parallelformers/parallelformers/parallel/slicing.py

Lines 79 to 95 in 436573b

    
           dim = dim if not reversed or is_bias else abs(dim - 1) 
        
           n_fused = 1 if not n_fused else n_fused 
        
           proj_layer = proj_layer.chunk( 
        
               n_fused * self.world_size, 
        
               dim=dim, 
        
           ) 
        
           if n_fused > 1: 
        
               ranks = (len(proj_layer) + self.world_size - 1) // self.world_size 
        
               proj_layer = [ 
        
                   proj_layer[i * self.world_size : (i + 1) * self.world_size] 
        
                   for i in range(ranks) 
        
               ] 
        
               proj_layer = list( 
        
                   map(lambda x: torch.cat([*x], dim=-1), zip(*proj_layer)) 
        
               )

For a attn_qkv weight, the arg dim is 0. So when the reversed=False and n_fused>1, the tensor is chunked on the dim 0 and then concatenated on the dim 1. Which make its shape incorrect.

The text was updated successfully, but these errors were encountered:

hyunwoongko · 2022-10-18T04:49:19Z

which model did you use?

JiayiFeng · 2022-10-18T05:58:55Z

I used a modified GPT-NeoX model, which is not officially supported. So I written a custom policy, and find this issue.

JiayiFeng · 2022-10-18T06:16:23Z

Maybe the

proj_layer = list( 
         map(lambda x: torch.cat([*x], dim=-1), zip(*proj_layer)) 
     )

should be:

proj_layer = list( 
         map(lambda x: torch.cat([*x], dim=dim), zip(*proj_layer)) 
     )

I guess.

hyunwoongko · 2022-10-18T06:17:04Z

okay so could you test it with other models?

JiayiFeng added the bug Something isn't working label Oct 18, 2022

JiayiFeng changed the title ~~A bug in with n_fused~~ A bug with n_fused Oct 18, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A bug with `n_fused` #41

A bug with `n_fused` #41

JiayiFeng commented Oct 18, 2022 •

edited

Loading

hyunwoongko commented Oct 18, 2022

JiayiFeng commented Oct 18, 2022

JiayiFeng commented Oct 18, 2022

hyunwoongko commented Oct 18, 2022 •

edited

Loading

A bug with n_fused #41

A bug with n_fused #41

Comments

JiayiFeng commented Oct 18, 2022 • edited Loading

hyunwoongko commented Oct 18, 2022

JiayiFeng commented Oct 18, 2022

JiayiFeng commented Oct 18, 2022

hyunwoongko commented Oct 18, 2022 • edited Loading

A bug with `n_fused` #41

A bug with `n_fused` #41

JiayiFeng commented Oct 18, 2022 •

edited

Loading

hyunwoongko commented Oct 18, 2022 •

edited

Loading