[FEATURE] Training details for MLP-based models #708

dalistarh · 2021-06-17T09:38:10Z

dalistarh
Jun 17, 2021

Hi,

Thanks for your useful work!

I would like to fine-tune the MLP-based models (gMLP and ResMLP), and I was wondering if you could please provide some additional details on training parameters, in particular:

augmentations used
optimizer and hyper-parameters (if possible)

I plan to do training on GPUs.

Many thanks,
Dan

rwightman · 2021-06-17T19:38:05Z

rwightman
Jun 17, 2021
Maintainer

@dalistarh I've trained a ResMLP from scratch, got close to paper results. My gMLP attempt failed, probably due to init issues, have improved that but yet to rerun. The hparams were based on the gMLP paper, which is basically DeiT hparams minus 'repeated aug' https://github.com/facebookresearch/deit

1 reply

dalistarh Jun 19, 2021
Author

Thank you, will give this a try. Can I ask what were your params for the ResMLP scratch run?

rwightman · 2021-06-23T18:35:43Z

rwightman
Jun 23, 2021
Maintainer

@dalistarh https://gist.github.com/rwightman/d6c264a9001f9167e06c209f630b2cc6

0 replies

Aguin · 2021-08-11T07:26:24Z

Aguin
Aug 11, 2021

facebookresearch/deit#106 (comment)

0 replies

BuaaAlban · 2021-08-31T10:26:20Z

BuaaAlban
Aug 31, 2021

I have a question for ResMLP in Machine translation. The cross patch FC's dimension depends on the fixed size of the input images in CV, but in MT, the sequence length is not fixed, how could we solve it or what do we do during the training and inference to solve this problem?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE] Training details for MLP-based models #708

{{title}}

Replies: 4 comments 1 reply

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

[FEATURE] Training details for MLP-based models #708

dalistarh Jun 17, 2021

Replies: 4 comments · 1 reply

rwightman Jun 17, 2021 Maintainer

dalistarh Jun 19, 2021 Author

rwightman Jun 23, 2021 Maintainer

Aguin Aug 11, 2021

BuaaAlban Aug 31, 2021

dalistarh
Jun 17, 2021

Replies: 4 comments 1 reply

rwightman
Jun 17, 2021
Maintainer

dalistarh Jun 19, 2021
Author

rwightman
Jun 23, 2021
Maintainer

Aguin
Aug 11, 2021

BuaaAlban
Aug 31, 2021