-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
About grad-cam visualization #2
Comments
Thank you for your interest in our work. We referred to the Grad-CAM visualization code in this work: https://github.com/salesforce/ALBEF/blob/main/visualization.ipynb. If you have further questions about Grad-CAM visualization, I will also be glad to help you. |
Thank you very much for your effective response! I had previously referred to the work you provided, but due to my limited coding skills, I didn't know where to start making modifications. Although I feel this might be asking for something without much effort on my part, I still want to ask if you could share your source code. This would greatly help me understand and apply the concepts in the future. Of course, if it's inconvenient for you to share, that's perfectly fine. And thank you very much for your response :) |
@gordonhu608 Following the GradCam Questions.
I would reproduce GradCam on your brilliant work next week. I just want to make sure my pipeline is the same as yours. |
1.For our work, we chose the layer of cross-attention, so I assume for llava it would be the projection layer. 2. We tested on a QA example, we computed the loss on the 'answer' part's output tokens. 3. I happened to test on llava's gradcam results, it's sometimes noisy. I conjecture each of the 576 visual tokens is attending to very different image information and studying some kind of relations. |
Thanks for explanation ! |
Yes, Correct. |
@Oscar860601 Hello! I am struggling with GradCAM visualization, could you please provide me with your code for visualizing on llava 1.5? It would be a great help for me. |
@Oscar860601 Same here! Looking for a piece of code! |
1 similar comment
@Oscar860601 Same here! Looking for a piece of code! |
How to get the output logit distribution of llava model? There is some wrong below the code: |
@Oscar860601 Same here! Looking for a piece of code! |
Your work is excellent, and I have seen the Grad-CAM visualization results you provided. Could you please share the code for the Grad-CAM visualization? I would be very grateful.
The text was updated successfully, but these errors were encountered: