vocab = n_vocab + n_special + n_ctx means? #54

JiahangOK · 2019-04-05T08:13:58Z

I know that n_vocab is the total number of tokens in encoder dictionary. But when I saw vocab = n_vocab + n_special + n_ctx, I was confused, maybe n_special is the for start,delimiter and classify. But what is n_ctx? Why add these 3 things? (why there is little comment about variables and functions....Is there somewhere else to see the explanation of the codes?) I am new to learn about the transformer.

The text was updated successfully, but these errors were encountered:

LeeRel1991 · 2019-04-26T06:10:43Z

from #28

n_ctx is the maximum number of token in an input sequence.
n_special is the number of special tokens used to format the input properly. For example in the ROCStories problem, we use 3 additional tokens, start, delimiter and classify
n_vocab should be the actual valid tokens

the reason for adding is that each token in a sentence should also has a position encoding(here, position embedding is used, similar to word embedding.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vocab = n_vocab + n_special + n_ctx means? #54

vocab = n_vocab + n_special + n_ctx means? #54

JiahangOK commented Apr 5, 2019

LeeRel1991 commented Apr 26, 2019

vocab = n_vocab + n_special + n_ctx means? #54

vocab = n_vocab + n_special + n_ctx means? #54

Comments

JiahangOK commented Apr 5, 2019

LeeRel1991 commented Apr 26, 2019