replace searchsorted with in-built function #35

WillBrennan · 2021-04-21T21:54:27Z

Recent versions of pytorch add searchsorted; I had issues compiling your project with the versions of GCC and NVCC I have on my system. Looks like you were thinking along similar lines with your todo statement.

Because I wasn't able to compile the original torchsearchsorted module I haven't been able to check if the results are identical. However training on the low-res modules is converging and PSNR is going up in a reasonable way.

yenchenlin · 2021-04-21T21:58:10Z

Hello @WillBrennan ,

I have merged commits previously on this issue. However, I am not sure whether this or other commits end up hurting the performance so I rolled back the code.

Can you let me know if you can get comparable results before I merge it?

Much thanks!

salykova · 2021-04-21T22:02:43Z

Hi @WillBrennan
Please check the issue. Thats why the code was rolled back and thats why these requirements are used instead of torch>=1.8 torchvision>=0.9.1 Summary: with the new versions of torch and torchvision the provided pretrained models can not be rendered.

WillBrennan · 2021-04-21T22:25:11Z

Hi everyone; wasn't aware of that previous issue with rendering. I'll keep you posted on the training and rendering results. It looks like there's been lots of other changes since then. If it works okay on my branch I'll run a git-bisect and try and pin down the bug as well.

yenchenlin · 2021-04-21T22:31:39Z

Hey @WillBrennan you are right, I am not sure exactly what breaks the performance. Please keep me posted and huge thanks for the efforts.

WillBrennan · 2021-04-23T21:30:11Z

Sorry for the late reply! Model trained correctly on a single machine with a 2080Ti in just under ~8 hours using the minimum pytorch and torchvision versions in requirements txt on CUDA 10.2.

I've uploaded the model, output videos and console logs to a folder on google drive for you to inspect before merging in this PR.
https://drive.google.com/drive/folders/1lWQuF4ylr-vVFJAikRsle9-DofnFBCIE?usp=sharing

I'll start git-bisecting through the dev branch to work out whats going on here. I'd love to be able to have multi-gpu support and train a lot quicker!

WillBrennan · 2021-04-27T11:52:54Z

Sorry! Just realised the drive folder with the results wasn't made public. Just updated that now.

WillBrennan · 2021-05-11T17:38:54Z

** nudge **

yenchenlin · 2021-05-13T19:11:13Z

Hey sorry for the delay. The results look stunning, thanks so much for this!

yenchenlin · 2021-05-13T19:12:00Z

Do you mind if I spend some time training other models before merging?

WillBrennan · 2021-05-14T09:10:16Z

No worries; of course not. Let me know if I can help; I've got spare GPUs waiting to heat-up a room.

yenchenlin · 2021-05-14T11:53:20Z

Cool, how about we split the models, e.g., I do the Blender scenes and you do the LLFF scenes?

…

On Fri, May 14, 2021 at 5:10 AM Will Brennan ***@***.***> wrote: No worries; of course not. Let me know if I can help; I've got spare GPUs waiting to heat-up a room. — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#35 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ABV3DR2LUTXRBWGZLI62VRTTNTSITANCNFSM43LGMX4A> .

yenchenlin · 2021-05-18T20:17:33Z

ping @WillBrennan

WillBrennan · 2021-05-18T20:19:17Z

oops! sorry; forgot to say I'm training the LLFF scenes at the moment. Should be done by tomorrow.

WillBrennan · 2021-05-19T18:39:35Z

Just uploaded the output in the logs directory to the directory I linked above;

https://drive.google.com/drive/folders/1uq0OSpyCuSIOBbT12L3pLiEnMoKIwMDo?usp=sharing

Looks like its all working correctly. This has;

fern
flowers
fortress
horns
leaves
orchids
room
trex

salykova · 2021-06-21T19:42:34Z

Hi @WillBrennan! I have just tried to render using your pretrained models and got the error

Found ckpts ['./logs/fern_test/200000.tar']
Reloading from ./logs/fern_test/200000.tar
Traceback (most recent call last):

File "run_nerf.py", line 878, in <module>
train()
File "run_nerf.py", line 640, in train
render_kwargs_train, render_kwargs_test, start, grad_vars, optimizer = create_nerf(args)
File "run_nerf.py", line 231, in create_nerf
model.load_state_dict(ckpt['network_fn_state_dict'])
File "/home/mnsv/miniconda3/envs/nerf/lib/python3.7/site-packages/torch/nn/modules/module.py", line 1407, in load_state_dict
self.__class__.__name__, "\n\t".join(error_msgs)))
RuntimeError: Error(s) in loading state_dict for NeRF:
Missing key(s) in state_dict: "pts_linears.0.weight", "pts_linears.0.bias", "pts_linears.1.weight", "pts_linears.1.bias", "pts_linears.2.weight", "pts_linears.2.bias", "pts_linears.3.weight", "pts_linears.3.bias", "pts_linears.4.weight", "pts_linears.4.bias", "pts_linears.5.weight", "pts_linears.5.bias", "pts_linears.6.weight", "pts_linears.6.bias", "pts_linears.7.weight", "pts_linears.7.bias", "views_linears.0.weight", "views_linears.0.bias", "feature_linear.weight", "feature_linear.bias", "alpha_linear.weight", "alpha_linear.bias", "rgb_linear.weight", "rgb_linear.bias". Unexpected key(s) in state_dict: "module.pts_linears.0.weight", "module.pts_linears.0.bias", "module.pts_linears.1.weight", "module.pts_linears.1.bias", "module.pts_linears.2.weight", "module.pts_linears.2.bias", "module.pts_linears.3.weight", "module.pts_linears.3.bias", "module.pts_linears.4.weight", "module.pts_linears.4.bias", "module.pts_linears.5.weight", "module.pts_linears.5.bias", "module.pts_linears.6.weight", "module.pts_linears.6.bias", "module.pts_linears.7.weight", "module.pts_linears.7.bias", "module.views_linears.0.weight", "module.views_linears.0.bias", "module.feature_linear.weight", "module.feature_linear.bias", "module.alpha_linear.weight", "module.alpha_linear.bias", "module.rgb_linear.weight", "module.rgb_linear.bias".

May I ask you, did you test the models with built-in torchsearchsorted and with torch 1.8? Did you use the main or dev branch?

WillBrennan · 2021-06-21T21:09:58Z

This wasn’t with the master or dev branch. It was with the branch for this PR to show it’s working as expected.

It looks like you’ve tried to load a single GPU model into a DataParallel object so I’m guessing you’re running off of the broken Dev branch that added multi-gpu support?

If you use the branch from this PR or master then it’ll load correctly

yenchenlin · 2021-06-21T21:11:05Z

Yo sorry @salykovaa you wanna try again?

yenchenlin · 2021-06-21T21:11:15Z

@WillBrennan huge huge thanks <3

WillBrennan · 2021-06-21T21:14:31Z

Anytime! It’s always great to see a project like this on GitHub!

salykova · 2021-06-21T21:34:35Z

@WillBrennan hmm interesting... I used master branch. but ok, I will try again tomorrow ;) may be I did something wrong

salykova · 2021-06-22T11:57:10Z

Hi @WillBrennan! I have just finished testing your models. As I said, yesterday (and today) I used master branch and try to render, but got the same error posted yesterday. I don't know why the error appears. I tried both python 3.6, 3.7 and both pytorch 1.8, 1.9, but these models provided by you don't work. What I found interesting is that the rendering works perfectly with old models from 2020 provided by @yenchenlin here

jason718 · 2021-10-06T11:12:40Z

Hi @WillBrennan! I have just finished testing your models. As I said, yesterday (and today) I used master branch and try to render, but got the same error posted yesterday. I don't know why the error appears. I tried both python 3.6, 3.7 and both pytorch 1.8, 1.9, but these models provided by you don't work. What I found interesting is that the rendering works perfectly with old models from 2020 provided by @yenchenlin here

same issue @WillBrennan

one solution, replace this line

nerf-pytorch/run_nerf.py

Line 231 in 1f06483

model.load_state_dict(ckpt['network_fn_state_dict'])

with

        from collections import OrderedDict
        new_ckpt = OrderedDict()
        for k, v in ckpt['network_fn_state_dict'].items():
            if k.startswith('module.'):
                new_ckpt[k[7:]] = v # remove 'modeule.'
            else:
                new_ckpt[k] = v
        model.load_state_dict(new_ckpt)

yenchenlin · 2021-10-06T14:56:43Z

@salykovaa I've tested on my own machine and the issue is confirmed. I've changed the pre-trained model links back to my original google drive.

@WillBrennan seems that your saved weights have module. as prefix and that fails the model loading.

@jason718 thank you! This solution removes all the module. prefix and can help pytorch successfully load the model provided by @WillBrennan

…rchsorted replace searchsorted with in-built function

replace torchsearchsorted with inbuilt call

19a66af

WillBrennan force-pushed the feature/replace-searchsorted branch from 3d268ed to 49e1093 Compare April 21, 2021 21:55

update requirements.txt to pytorch version with searchsorted

434e781

WillBrennan force-pushed the feature/replace-searchsorted branch from 49e1093 to 434e781 Compare April 21, 2021 22:12

yenchenlin merged commit f1e5b3f into yenchenlin:master Jun 21, 2021

WillBrennan deleted the feature/replace-searchsorted branch June 21, 2021 21:11

alanspike mentioned this pull request Jul 12, 2022

Do we still need torchsearchsorted? MingSun-Tse/Efficient-NeRF#5

Closed

SRDewan pushed a commit to SRDewan/nerf-pytorch that referenced this pull request Jul 19, 2022

Merge pull request yenchenlin#35 from WillBrennan/feature/replace-sea…

405efc4

…rchsorted replace searchsorted with in-built function

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

replace searchsorted with in-built function #35

replace searchsorted with in-built function #35

WillBrennan commented Apr 21, 2021

yenchenlin commented Apr 21, 2021 •

edited

Loading

salykova commented Apr 21, 2021 •

edited

Loading

WillBrennan commented Apr 21, 2021

yenchenlin commented Apr 21, 2021 •

edited

Loading

WillBrennan commented Apr 23, 2021 •

edited

Loading

WillBrennan commented Apr 27, 2021

WillBrennan commented May 11, 2021

yenchenlin commented May 13, 2021

yenchenlin commented May 13, 2021

WillBrennan commented May 14, 2021

yenchenlin commented May 14, 2021 via email

yenchenlin commented May 18, 2021

WillBrennan commented May 18, 2021

WillBrennan commented May 19, 2021

salykova commented Jun 21, 2021 •

edited

Loading

WillBrennan commented Jun 21, 2021

yenchenlin commented Jun 21, 2021

yenchenlin commented Jun 21, 2021

WillBrennan commented Jun 21, 2021

salykova commented Jun 21, 2021

salykova commented Jun 22, 2021 •

edited

Loading

jason718 commented Oct 6, 2021 •

edited

Loading

yenchenlin commented Oct 6, 2021 •

edited

Loading

replace searchsorted with in-built function #35

replace searchsorted with in-built function #35

Conversation

WillBrennan commented Apr 21, 2021

yenchenlin commented Apr 21, 2021 • edited Loading

salykova commented Apr 21, 2021 • edited Loading

WillBrennan commented Apr 21, 2021

yenchenlin commented Apr 21, 2021 • edited Loading

WillBrennan commented Apr 23, 2021 • edited Loading

WillBrennan commented Apr 27, 2021

WillBrennan commented May 11, 2021

yenchenlin commented May 13, 2021

yenchenlin commented May 13, 2021

WillBrennan commented May 14, 2021

yenchenlin commented May 14, 2021 via email

yenchenlin commented May 18, 2021

WillBrennan commented May 18, 2021

WillBrennan commented May 19, 2021

salykova commented Jun 21, 2021 • edited Loading

WillBrennan commented Jun 21, 2021

yenchenlin commented Jun 21, 2021

yenchenlin commented Jun 21, 2021

WillBrennan commented Jun 21, 2021

salykova commented Jun 21, 2021

salykova commented Jun 22, 2021 • edited Loading

jason718 commented Oct 6, 2021 • edited Loading

yenchenlin commented Oct 6, 2021 • edited Loading

yenchenlin commented Apr 21, 2021 •

edited

Loading

salykova commented Apr 21, 2021 •

edited

Loading

yenchenlin commented Apr 21, 2021 •

edited

Loading

WillBrennan commented Apr 23, 2021 •

edited

Loading

salykova commented Jun 21, 2021 •

edited

Loading

salykova commented Jun 22, 2021 •

edited

Loading

jason718 commented Oct 6, 2021 •

edited

Loading

yenchenlin commented Oct 6, 2021 •

edited

Loading