-
Notifications
You must be signed in to change notification settings - Fork 25
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Thanks for your reimplementation #3
Comments
Cool beans! Looking forward to seeing this improve as well as the @phymhan implementation! |
@phymhan hi first thank you for your awesome work! I enjoyed reading your paper. And I appreciate your comments. |
Hi @mkshing Thanks for your reply! I'm glad you enjoyed reading our paper and appreciated your comments. Thank you for bringing attention to the implementation of |
Apologies for the confusion earlier. Let me provide a quick clarification: "1-D kernels" refer to weight tensors that are 1-dimensional, e.g. |
@phymhan thank you for leaving detailed comments! I understood. I will fix it as soon as possible. I appreciate your feedback again. |
@phymhan Hi, for 1.), I think I have nothing to fix for current implementation. Can you specify where it is? |
Hi @mkshing , thanks for your response! I appreciate your clarification. Yes, you are correct, there is no need to make any modifications to your current implementation. What I did was for example, after |
Hello, Thanks for your amazing job, I would like to know if you will open the source code in recent days and what framework of the original code, pytorch? or others? Thanks again. |
Hi @mkshing, thanks for checking out our work! We're still waiting for legal team to review the code, and I've got some personal stuff going on too. It might take another 2 weeks unfortunately. Sorry for the wait, and I really appreciate your understanding. To answer your question, the original code for SD is implemented in PyTorch. Thanks! |
Hi @mkshing,
Thanks for taking the time to reimplement our paper! Your implementation looks fantastic! Although our code release is still under review, it should be available soon. Your results are already quite impressive. I appreciate your interest in our work! After taking a quick look at your implementation, I noticed a few differences that could impact performance: 1) To avoid large reconstruction error during SVD when using CPU, I converted the model to GPU and added the residual back when reassembling the weight matrix. 2) I also finetuned 1-D kernels, and 3) the text encoder. 4) For single image editing, I used a small learning rate (1e-6) for 1-D kernels and a larger learning rate (1e-3) for 2-D and 4-D kernels. 5) Finally, I utilized PyTorch's parametrization functionality to register hooks on loaded models, eliminating the need to define new classes for each model. Thank you again for your reimplementation! 🚀
The text was updated successfully, but these errors were encountered: