How to use the pre-trained model to generate images from specific captions? #8

Astatine-213-Tian · 2020-06-15T13:58:21Z

For example, how can I generate an image that is corresponding to the caption "a person skateboarding in the street with some people looking on"?

Astatine-213-Tian · 2020-06-16T09:39:01Z

Could you please upload the code you use to get the images presented in your paper? Thanks!

tohinz · 2020-06-17T12:05:20Z

Hi, the easiest way of doing this is to go through the captions in the validation set (captions.pickle) until you find the sentence for which you want to generate images. If you load the captions the same way as the dataloader does you can iterate through them directly until you find the correct caption.

cap = captions[0].data.cpu().numpy()
sentence = ""
for j in range(len(cap)):
if cap[j] == 0: break
word = self.ixtoword[cap[j]].encode('ascii', 'ignore').decode('ascii')
sentence += word + " "

Astatine-213-Tian · 2020-06-19T08:58:52Z

Hi, thanks for your response. Can I generate image from my own sentences in the same way?

tohinz · 2020-06-22T13:47:32Z

Hi, have a look at how they do it in the original AttnGAN here: https://github.com/taoxugit/AttnGAN/blob/0d000e652b407e976cb88fab299e8566f3de8a37/code/main.py#L146
You can use it in the same way for our model since we use the same text encoder. You'll need to provide the bounding boxes and object labels for our model though, or use another network that predicts these from the caption.

RoshanTanisha · 2020-06-22T13:51:13Z

Hello,

Thanks for the code. Do you have any pointers where could I find such a model which gives bounding box and labels from caption? For bounding box I got this from other Issue: https://github.com/jamesli1618/Obj-GAN/, but not able to find for labels?

Thanks.

tohinz · 2020-06-23T09:53:49Z

I haven't used it personally, but something like LayoutVAE or Seq-SG2SL and their related work should help with this (I don't know which papers have available implementations, so you might have to check the related work of these papers, too.

savitha91 · 2020-08-04T16:14:05Z

Hi both the papers ( LayoutVAE or Seq-SG2SL) you had mentioned doesnt have the code.Can you please suggest how to get bounding box (semantic layout) from the caption.

tohinz · 2020-08-05T08:51:41Z

Hi, you can use the code from https://github.com/jamesli1618/Obj-GAN/ to get bounding boxes + object labels from the captions. They have a model pretrained on COCO, which should be a good starting point for most settings.

savitha91 · 2020-08-05T09:13:21Z

Hi I checked this code https://github.com/jamesli1618/Obj-GAN/tree/master/box_generation/bbox_proc , which is mentioned to extract the bound-box info .I am checking on this. Thanks

savitha91 · 2020-08-06T08:15:01Z

Hi Tobias, would like to know whether u checked the bbox_proc code for boundary box generation. I have raised a query in the obj-GAN repo jamesli1618/Obj-GAN#24. It would be great if you can help me with a sample code to generate a file similar to 'input_val2014.txt', so that i can use the semantic object model

Astatine-213-Tian changed the title ~~How to use the pre-trained model to generate image from specific captions?~~ How to use the pre-trained model to generate images from specific captions? Jun 15, 2020

tohinz mentioned this issue Jul 3, 2020

generate images from arbitrary strings? tohinz/multiple-objects-gan#8

Open

gitraffica mentioned this issue Apr 27, 2021

About sampling #17

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to use the pre-trained model to generate images from specific captions? #8

How to use the pre-trained model to generate images from specific captions? #8

Astatine-213-Tian commented Jun 15, 2020

Astatine-213-Tian commented Jun 16, 2020

tohinz commented Jun 17, 2020 •

edited

Loading

Astatine-213-Tian commented Jun 19, 2020

tohinz commented Jun 22, 2020

RoshanTanisha commented Jun 22, 2020

tohinz commented Jun 23, 2020

savitha91 commented Aug 4, 2020

tohinz commented Aug 5, 2020

savitha91 commented Aug 5, 2020

savitha91 commented Aug 6, 2020

How to use the pre-trained model to generate images from specific captions? #8

How to use the pre-trained model to generate images from specific captions? #8

Comments

Astatine-213-Tian commented Jun 15, 2020

Astatine-213-Tian commented Jun 16, 2020

tohinz commented Jun 17, 2020 • edited Loading

Astatine-213-Tian commented Jun 19, 2020

tohinz commented Jun 22, 2020

RoshanTanisha commented Jun 22, 2020

tohinz commented Jun 23, 2020

savitha91 commented Aug 4, 2020

tohinz commented Aug 5, 2020

savitha91 commented Aug 5, 2020

savitha91 commented Aug 6, 2020

tohinz commented Jun 17, 2020 •

edited

Loading