Issues about Positional Embedding and Reference Point #32

tae-mo · 2022-10-22T12:35:55Z

Hi, thanks for sharing your wonderful work.

I got a question in here,

Line 33 in ead865c

def gen_sineembed_for_position(pos_tensor):

which embedes positional information in the query_pos.

however, I don't understand the reason why does 2*(dim_t//2) has to be devided by 128, instead of the actual dimension pos_tensor has (e.g., 256 by default).

ConditionalDETR/models/transformer.py

Line 38 in ead865c

dim_t = 10000 ** (2 * (dim_t // 2) / 128)

Is it works correctly even dim_t is divided by 128?

I would appreciate to be corrected !

And another question is,
when we do the calculation of the equation (1) in the paper,

ConditionalDETR/models/conditional_detr.py

Line 89 in ead865c

tmp[..., :2] += reference_before_sigmoid

can I understand that the model would learn "offsets" from the corresponding reference points?
what is precise role of the reference points?

Thank you!

The text was updated successfully, but these errors were encountered:

Run542968 · 2023-09-14T03:45:39Z

Hi, for question (1), why does 2*(dim_t//2) has to be devided by 128, since the position embedding performs on both the x and y direction, then concat.

tae-mo changed the title ~~Issues about Positional Embedding~~ Issues about Positional Embedding and Reference Point Oct 22, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issues about Positional Embedding and Reference Point #32

Issues about Positional Embedding and Reference Point #32

tae-mo commented Oct 22, 2022 •

edited

Loading

Run542968 commented Sep 14, 2023

Issues about Positional Embedding and Reference Point #32

Issues about Positional Embedding and Reference Point #32

Comments

tae-mo commented Oct 22, 2022 • edited Loading

Run542968 commented Sep 14, 2023

tae-mo commented Oct 22, 2022 •

edited

Loading