Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Can GRU be replaced with Conv layers? #258

Open
Stephen-K1 opened this issue Nov 27, 2023 · 1 comment
Open

Can GRU be replaced with Conv layers? #258

Stephen-K1 opened this issue Nov 27, 2023 · 1 comment

Comments

@Stephen-K1
Copy link

  1. In the RVM model, the GRU layer accounts for a huge number of computations. It is intuitive to ask: would it be better to replace the GRU layer with Conv layer that occupies the same number of computations? A simple answer of 'yes' or 'no' will be greatly appreciated.

  2. Recently I've been trying my best to implement a matting model with excellent performance. I have read many recently proposed video matting papers and test their matting performance. Even RVM was proposed two years ago, it is the best open-sourced (including training code) model in my test results. I wonder if you can provide some tips to improve the performance of RVM? I believe you have a lot of good ideas that are worth trying. It will be greatly appreciated if you can share some of your insights here. Thank you very much!

@PeterL1n
Copy link
Owner

  1. No. The whole point of our research is to replace conv with GRU. GRU recurrent architecture allows the model to analyze the video sequence with temporal memory. If you replace it with Conv, then it will treat each frame independently. It will have flickers.

  2. I have not been following matting research lately, but here are some ideas just top of my head:

  • Use transformer instead of conv gru to model temporal relation.
  • Use better backbone, based on ViT, like DinoV2.
  • Treat matting as a generative task, using diffusion objective etc.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants