-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
replace searchsorted with in-built function #35
replace searchsorted with in-built function #35
Conversation
3d268ed
to
49e1093
Compare
Hello @WillBrennan , I have merged commits previously on this issue. However, I am not sure whether this or other commits end up hurting the performance so I rolled back the code. Can you let me know if you can get comparable results before I merge it? Much thanks! |
Hi @WillBrennan |
49e1093
to
434e781
Compare
Hi everyone; wasn't aware of that previous issue with rendering. I'll keep you posted on the training and rendering results. It looks like there's been lots of other changes since then. If it works okay on my branch I'll run a git-bisect and try and pin down the bug as well. |
Hey @WillBrennan you are right, I am not sure exactly what breaks the performance. Please keep me posted and huge thanks for the efforts. |
Sorry for the late reply! Model trained correctly on a single machine with a 2080Ti in just under ~8 hours using the minimum pytorch and torchvision versions in requirements txt on CUDA 10.2. I've uploaded the model, output videos and console logs to a folder on google drive for you to inspect before merging in this PR. I'll start git-bisecting through the dev branch to work out whats going on here. I'd love to be able to have multi-gpu support and train a lot quicker! |
Sorry! Just realised the drive folder with the results wasn't made public. Just updated that now. |
** nudge ** |
Hey sorry for the delay. The results look stunning, thanks so much for this! |
Do you mind if I spend some time training other models before merging? |
No worries; of course not. Let me know if I can help; I've got spare GPUs waiting to heat-up a room. |
Cool, how about we split the models, e.g., I do the Blender scenes and you
do the LLFF scenes?
…On Fri, May 14, 2021 at 5:10 AM Will Brennan ***@***.***> wrote:
No worries; of course not. Let me know if I can help; I've got spare GPUs
waiting to heat-up a room.
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#35 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ABV3DR2LUTXRBWGZLI62VRTTNTSITANCNFSM43LGMX4A>
.
|
ping @WillBrennan |
oops! sorry; forgot to say I'm training the LLFF scenes at the moment. Should be done by tomorrow. |
Just uploaded the output in the logs directory to the directory I linked above; https://drive.google.com/drive/folders/1uq0OSpyCuSIOBbT12L3pLiEnMoKIwMDo?usp=sharing Looks like its all working correctly. This has;
|
Hi @WillBrennan! I have just tried to render using your pretrained models and got the error
May I ask you, did you test the models with built-in torchsearchsorted and with torch 1.8? Did you use the main or dev branch? |
This wasn’t with the master or dev branch. It was with the branch for this PR to show it’s working as expected. It looks like you’ve tried to load a single GPU model into a DataParallel object so I’m guessing you’re running off of the broken Dev branch that added multi-gpu support? If you use the branch from this PR or master then it’ll load correctly |
Yo sorry @salykovaa you wanna try again? |
@WillBrennan huge huge thanks <3 |
Anytime! It’s always great to see a project like this on GitHub! |
@WillBrennan hmm interesting... I used master branch. but ok, I will try again tomorrow ;) may be I did something wrong |
Hi @WillBrennan! I have just finished testing your models. As I said, yesterday (and today) I used master branch and try to render, but got the same error posted yesterday. I don't know why the error appears. I tried both python 3.6, 3.7 and both pytorch 1.8, 1.9, but these models provided by you don't work. What I found interesting is that the rendering works perfectly with old models from 2020 provided by @yenchenlin here |
same issue @WillBrennan one solution, replace this line Line 231 in 1f06483
with
|
@salykovaa I've tested on my own machine and the issue is confirmed. I've changed the pre-trained model links back to my original google drive. @WillBrennan seems that your saved weights have @jason718 thank you! This solution removes all the |
…rchsorted replace searchsorted with in-built function
Recent versions of pytorch add searchsorted; I had issues compiling your project with the versions of GCC and NVCC I have on my system. Looks like you were thinking along similar lines with your todo statement.
Because I wasn't able to compile the original torchsearchsorted module I haven't been able to check if the results are identical. However training on the low-res modules is converging and PSNR is going up in a reasonable way.