change TextTokenizer 2DConvolution to 1D #52

simonlevine · 2022-01-07T19:50:29Z

Hello,

It seems to make more intuitive sense to use 1D convolutions here over the embedding with a channel size equal to the word embedding dimension, rather than the edge-case of a 2D convolution as is currently implemented. I would personally make this change to match other networks with similar convolutions over nn.Embeddings. I believe this has no change to performance but rather is presented for clarity. Thank you

alihassanijr · 2022-01-10T18:54:48Z

Hello,

Thank you, yes it would not result in any significant difference, at least that is what we observed in our experiments.
I'll check this PR and make separate models with 1D convs, because I think we'll experience a key mismatch with our old checkpoints if we replace the models.

change 2dConv to 1d

621e4fe

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

change TextTokenizer 2DConvolution to 1D #52

change TextTokenizer 2DConvolution to 1D #52

simonlevine commented Jan 7, 2022

alihassanijr commented Jan 10, 2022

change TextTokenizer 2DConvolution to 1D #52

Are you sure you want to change the base?

change TextTokenizer 2DConvolution to 1D #52

Conversation

simonlevine commented Jan 7, 2022

alihassanijr commented Jan 10, 2022