Bounding boxes: VOC to darknet transform - why minus 1? #4565

umbrellait-maksim-titarenko · 2019-12-21T13:05:23Z

Hello!

Exploring the issue of converting annotations for custom model training, I discovered that we apply the following transform to the VOC (see: scripts/voc_label.py):

    x = (box[0] + box[1])/2.0 - 1
    y = (box[2] + box[3])/2.0 - 1

where:

bbox[:4] are "bndbox"-values: xmin, xmax, ymin, ymax - parsed from VOC xml-files,
x, y assumed to be a coordinate of the center of "bndbox".

I'm a bit confused with that since:
e.g. we have a box with xmin, xmax = 10, 20
in this case, intuitively, the x-center should be at the point = 15
however, the code above will give us the value 14.0

My question is:

Why do we apply those "minus 1"?
Do the VOC-annotation coordinates differ from the generally accepted ones (0,0 - for the left top corner)?

The text was updated successfully, but these errors were encountered:

umbrellait-maksim-titarenko · 2019-12-21T16:27:03Z

I see there was the commit to apply the "minus 1" operations:

@AlexeyAB, can you please comment on why was this done?

Sarah20187 · 2024-11-24T07:02:42Z

same question here， Thanks！

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bounding boxes: VOC to darknet transform - why minus 1? #4565

Bounding boxes: VOC to darknet transform - why minus 1? #4565

umbrellait-maksim-titarenko commented Dec 21, 2019

umbrellait-maksim-titarenko commented Dec 21, 2019

Sarah20187 commented Nov 24, 2024

Bounding boxes: VOC to darknet transform - why minus 1? #4565

Bounding boxes: VOC to darknet transform - why minus 1? #4565

Comments

umbrellait-maksim-titarenko commented Dec 21, 2019

umbrellait-maksim-titarenko commented Dec 21, 2019

Sarah20187 commented Nov 24, 2024