Save test predictions on multiple GPUs #2926

nateraw · 2020-08-12T05:29:17Z

What does this PR do?

This PR lets you save test_step predictions across multiple GPUs via a new function EvalResult.write(name, values, filename='predictions.txt').

Here's what it looks like:

class MyModel(pl.LightningModule):
    ...
    def test_step(self, batch, batch_idx):
        x, y = batch
        logits, loss = self(x)
        preds = torch.argmax(logits, dim=1)
        result = pl.EvalResult()
        result.log('test_loss', loss)
        result.write('preds', preds, filename='./predictions.txt')
        return result

If you don't call .write() on your EvalResult objects, nothing should happen.
If you do, it'll save at the files you specified.
If on multiple GPUs, it should prepend your filename with the rank, leaving you with n_gpus number of prediction files.
- For example, if running on two GPUs, test dataset size of 10,000, and your prediction file is the default of predictions.txt, you'll end up with:
  - predictions_rank_0.txt --> 5000 predictions
  - predictions_rank_1.txt --> 5000 predictions

Its basically going to take whatever you send to .write() and torch.cat/list.extend it with existing data with matching filename/name keys. In the background, the dictionary we're creating looks like this right now:

_predictions = {
    # The default file that predictions are saved to when you call .write
    'predictions.txt': {
        'preds': ['cat', 'dog', 'horse'],
        'idxs': [0, 1, 2]
    },

    # Optionally, if the predictions you want to save have different lengths, you can add more files
    'others.txt': {
        'something': ['wow', 'so', 'cool', 'dont', 'you', 'think?'],
        'idxs': [0, 1, 2, 3, 4, 5]
    }
}

Fixes # (issue)

Before submitting

Was this discussed/approved via a Github issue? (no need for typos and docs improvements)
Did you read the contributor guideline, Pull Request section?
Did you make sure your PR does only one thing, instead of bundling different changes together? Otherwise, we ask you to create a separate PR for every change.
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?
Did you verify new and existing tests pass locally with your changes?
If you made a notable change (that affects users), did you update the CHANGELOG?

PR review

Anyone in the community is free to review the PR once the tests have passed.
If we didn't discuss your PR in Github issues there's a high chance it will not be merged.

Did you have fun?

Make sure you had fun coding 🙃

pep8speaks · 2020-08-12T05:29:21Z

Hello @nateraw! Thanks for updating this PR.

In the file tests/base/model_test_steps.py:

Line 99:5: E303 too many blank lines (2)
Line 139:1: W293 blank line contains whitespace
Line 145:1: W293 blank line contains whitespace

In the file tests/core/test_results.py:

Line 74:120: E501 line too long (149 > 119 characters)

Comment last updated at 2020-08-14 20:42:10 UTC

pytorch_lightning/trainer/supporters.py

codecov · 2020-08-13T22:56:00Z

Codecov Report

Merging #2926 into master will increase coverage by 2%.
The diff coverage is 86%.

@@           Coverage Diff           @@
##           master   #2926    +/-   ##
=======================================
+ Coverage      86%     88%    +2%     
=======================================
  Files          81      81            
  Lines        7554    7836   +282     
=======================================
+ Hits         6476    6907   +431     
+ Misses       1078     929   -149

pytorch_lightning/trainer/supporters.py

williamFalcon · 2020-08-14T08:13:29Z

pytorch_lightning/trainer/evaluation_loop.py

+
+                    # Add step predictions to prediction collection to write later
+                    do_write_predictions = is_result_obj and test_mode
+                    if do_write_predictions:


this should just stay inside the prediction object

we already track prediction objects. No need to do anything explicit.

williamFalcon · 2020-08-14T08:13:59Z

pytorch_lightning/trainer/evaluation_loop.py

@@ -388,6 +395,9 @@ def _evaluate(
        # log callback metrics
        self.__update_callback_metrics(eval_results, using_eval_result)

+        # Write predictions to disk if they're available.
+        predictions.to_disk()


this should just be an internal function of the prediction object.

we can’t wait till the end of epoch to write predictions bc we will accumulate too much memory.

the write needs to happen at every batch.

how do we want to deal with writing to the cache file (w/ torch.save) if we want to append to it?

pytorch_lightning/core/step_result.py

pytorch_lightning/trainer/evaluation_loop.py

pytorch_lightning/trainer/supporters.py

Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>

* Save test predictions on multiple GPUs

nateraw force-pushed the fix-preds branch from a5a5173 to 7f6a7b8 Compare August 12, 2020 15:18

justusschock reviewed Aug 13, 2020

View reviewed changes

pytorch_lightning/trainer/supporters.py Show resolved Hide resolved

nateraw force-pushed the fix-preds branch from 7f6a7b8 to 96eb070 Compare August 13, 2020 21:55

nateraw and others added 11 commits August 13, 2020 20:04

🚧 wip

b2eaf71

🚧 wip

0888391

🚧 wip

d590cab

track batch size

5d6633b

🐛 switch f-string back because it was syntax error

3b44a44

🚧 start to add edits from messy local branch

4b84912

🚧 use torch.save instead of csv

65a4c05

🚧 .

7cbcc46

🚧 .

1f535d6

🚧 wip

71a8c99

🚧 wip

bd45203

nateraw force-pushed the fix-preds branch from a4166bf to bd45203 Compare August 14, 2020 02:04

nateraw added 8 commits August 13, 2020 20:59

🚧 wip

0a9fe65

🚧 wip

6e82b20

🚧 wip

68b2b54

🚧 wip

8e974d2

🚧 wip

de984c4

🚧 wip

c09efa4

🚧 wip

bf6b696

🚧 wip

031729e

nateraw force-pushed the fix-preds branch from bd9d88a to 031729e Compare August 14, 2020 06:45

justusschock approved these changes Aug 14, 2020

View reviewed changes

pytorch_lightning/trainer/supporters.py Show resolved Hide resolved

pytorch_lightning/trainer/supporters.py Show resolved Hide resolved

🚧 wip

e8b35f1

nateraw force-pushed the fix-preds branch from 46fbaf1 to e8b35f1 Compare August 14, 2020 07:49

🚧 wip

7b9fd75

nateraw marked this pull request as ready for review August 14, 2020 08:02

mergify bot requested a review from a team August 14, 2020 08:02

williamFalcon reviewed Aug 14, 2020

View reviewed changes

mergify bot requested a review from a team August 14, 2020 08:13

williamFalcon reviewed Aug 14, 2020

View reviewed changes

mergify bot requested a review from a team August 14, 2020 08:14

Borda added the feature Is an improvement or enhancement label Aug 14, 2020

Borda reviewed Aug 14, 2020

View reviewed changes

mergify bot requested a review from a team August 14, 2020 09:36

williamFalcon and others added 5 commits August 14, 2020 13:03

Update pytorch_lightning/core/step_result.py

0d91462

Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>

Update pytorch_lightning/core/step_result.py

ca6762f

Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>

Update pytorch_lightning/core/step_result.py

9f23dc2

Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>

Update step_result.py

373ab8e

Update step_result.py

0a1f9c2

williamFalcon merged commit b969523 into Lightning-AI:master Aug 14, 2020

ameliatqy pushed a commit to ameliatqy/pytorch-lightning that referenced this pull request Aug 17, 2020

Save test predictions on multiple GPUs (Lightning-AI#2926)

456eded

* Save test predictions on multiple GPUs

Borda added this to the 0.9.0 milestone Aug 20, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Save test predictions on multiple GPUs #2926

Save test predictions on multiple GPUs #2926

nateraw commented Aug 12, 2020 •

edited

Loading

pep8speaks commented Aug 12, 2020 •

edited

Loading

codecov bot commented Aug 13, 2020 •

edited

Loading

williamFalcon Aug 14, 2020

williamFalcon Aug 14, 2020

williamFalcon Aug 14, 2020

nateraw Aug 14, 2020

Save test predictions on multiple GPUs #2926

Save test predictions on multiple GPUs #2926

Conversation

nateraw commented Aug 12, 2020 • edited Loading

What does this PR do?

Before submitting

PR review

Did you have fun?

pep8speaks commented Aug 12, 2020 • edited Loading

Comment last updated at 2020-08-14 20:42:10 UTC

codecov bot commented Aug 13, 2020 • edited Loading

Codecov Report

williamFalcon Aug 14, 2020

Choose a reason for hiding this comment

williamFalcon Aug 14, 2020

Choose a reason for hiding this comment

williamFalcon Aug 14, 2020

Choose a reason for hiding this comment

nateraw Aug 14, 2020

Choose a reason for hiding this comment

nateraw commented Aug 12, 2020 •

edited

Loading

pep8speaks commented Aug 12, 2020 •

edited

Loading

codecov bot commented Aug 13, 2020 •

edited

Loading