[Example] Distributed Replay Buffer Prototype Example Implementation #615

adityagoel4512 · 2022-10-27T11:53:28Z

Description

Prototype example distributed replay buffer implementation using LazyMemmapStorage and TensorDictReplayBuffer, with nodes communicating using torch.rpc. The implementation allows for 1 Trainer Node, 1 Replay Buffer Node, and N >= 1 Data Collector Nodes.

Motivation and Context

This investigative example illustrates some patterns with which we can implement distributed RL algorithms with replay buffers using the torch.rl framework. It also helps us understand the abstractions and ideas that may be missing from the framework or may need adapting to make writing distributed RL algorithms natural and performant.

I have raised an issue to propose this change (required for new features and bug fixes)

Types of changes

What types of changes does your code introduce? Remove all that do not apply:

[ x] Example (update in the folder of examples)

Checklist

Go over all the following points, and put an x in all the boxes that apply.
If you are unsure about any of these, don't hesitate to ask. We are here to help!

[ x] I have read the CONTRIBUTION guide (required)
My change requires a change to the documentation.
I have updated the tests accordingly (required for a bug fix or a new feature).
I have updated the documentation accordingly.

adityagoel4512 · 2022-10-27T15:14:50Z

@vmoens I've made ReplayBufferNode subclass TensorDictReplayBuffer now so it fits in with the rest of the object hierarchy and can be used entirely like any other Replay Buffer.

vmoens · 2022-10-27T16:17:49Z

this is soooo cool
Would you have time to have a look at why when a memmap tensor is sent on one end the other worker receives a regular tensor? Do you think we can easily make the other process receive a memmap tensor instead?

For instance, this receives memmap tensors from one process and those tensors are actually of memmap type.

But torch.distributed does not serialize things as multiprocess I guess...

vmoens · 2022-10-28T10:21:57Z

Note for future contributions: it's best if you don't develop on your main branch but if you branch out on your forked repo :)

adityagoel4512 · 2022-10-28T12:41:24Z

Add heading textAdd bold text, <Cmd+b>Add italic text, <Cmd+i>
Add a quote, <Cmd+Shift+.>Add code, <Cmd+e>Add a link, <Cmd+k>
Add a bulleted list, <Cmd+Shift+8>Add a numbered list, <Cmd+Shift+7>Add a task list, <Cmd+Shift+l>
Directly mention a user or team
Reference an issue, pull request, or discussion
Add saved reply

Sure thing - would you prefer I create a new branch for this PR as well?

Note for future contributions: it's best if you don't develop on your main branch but if you branch out on your forked repo :)

codecov · 2022-10-28T14:00:39Z

Codecov Report

Merging #615 (1a73dcd) into main (e96fd37) will decrease coverage by 0.08%.
The diff coverage is 64.94%.

@@            Coverage Diff             @@
##             main     #615      +/-   ##
==========================================
- Coverage   87.82%   87.73%   -0.09%     
==========================================
  Files         125      126       +1     
  Lines       24280    24371      +91     
==========================================
+ Hits        21324    21382      +58     
- Misses       2956     2989      +33

Flag	Coverage Δ
habitat-gpu	`23.95% <21.42%> (-0.01%)`	⬇️
linux-cpu	`85.03% <64.94%> (-0.08%)`	⬇️
linux-gpu	`86.35% <64.94%> (-0.09%)`	⬇️
linux-outdeps-gpu	`75.65% <64.94%> (-0.05%)`	⬇️
linux-stable-cpu	`84.92% <64.94%> (-0.07%)`	⬇️
linux-stable-gpu	`86.23% <64.94%> (-0.09%)`	⬇️
macos-cpu	`84.81% <64.94%> (-0.07%)`	⬇️
olddeps-gpu	`76.50% <64.94%> (-0.05%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
test/test_rb_distributed.py	`49.20% <49.20%> (ø)`
test/test_rb.py	`97.05% <80.00%> (ø)`
torchrl/data/replay_buffers/utils.py	`91.42% <92.85%> (+0.95%)`	⬆️
torchrl/data/replay_buffers/rb_prototype.py	`87.76% <100.00%> (+1.36%)`	⬆️

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

vmoens

Great work!
Should we comment the example, or write a separate markdown file in the example directory to explain what this is about?

Personally I'd be in favour of commenting the code using this syntax which will allow us to port that to the doc later on.

Do you think we can test the distributed replay buffer in the CI? It would be nice to cover that there.

adityagoel4512 · 2022-11-02T13:19:17Z

Great work! Should we comment the example, or write a separate markdown file in the example directory to explain what this is about?

Personally I'd be in favour of commenting the code using this syntax which will allow us to port that to the doc later on.

Do you think we can test the distributed replay buffer in the CI? It would be nice to cover that there.

Ah yes I'll add some comments in the example. I'll investigate how best to test the distributed buffer now.

…t to buffer

vmoens

Wonderful! Thanks a million for this!

Adi Goel added 2 commits October 27, 2022 12:44

Distributed replay buffer prototype

1f8d25a

Fixes comment issue

dffce19

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 27, 2022

Makes ReplayBufferNode subclass TensorDictReplayBuffer

1c1cca9

adityagoel4512 marked this pull request as ready for review October 27, 2022 15:47

vmoens changed the title ~~Distributed Replay Buffer Prototype Example Implementation~~ [Example] Distributed Replay Buffer Prototype Example Implementation Oct 27, 2022

vmoens added documentation Improvements or additions to documentation enhancement New feature or request labels Oct 28, 2022

vmoens and others added 2 commits October 28, 2022 13:43

aha

e7735af

Merge branch 'pytorch:main' into main

421bbfc

vmoens and others added 11 commits October 28, 2022 15:23

amend

4bddc59

Merge branch 'index_memmap' of github.com:pytorch/rl

b25a2ec

bf

0ee8fb0

Merge branch 'index_memmap' of github.com:pytorch/rl

f329012

Fixes print statements and removes redundant Collector arg

7d4c6e5

Fixes print statements and removes redundant Collector arg

f90c53e

Merge branch 'main' of github.com:adityagoel4512/rl

bb77423

amend

f7afa74

Merge branch 'index_memmap' of github.com:pytorch/rl

1a2909d

amend

9a0d7b1

Merge branch 'index_memmap' of github.com:pytorch/rl

08abc27

adityagoel4512 mentioned this pull request Nov 1, 2022

[Feature] Benchmark storage types #633

Merged

6 tasks

Adi Goel added 3 commits November 1, 2022 13:57

Resolves merge conflict

3a22827

Resolves merge conflict

5687fd3

Merge branch 'main' of github.com:adityagoel4512/rl

20ebdd3

Adi Goel added 3 commits November 1, 2022 18:24

Adds class decorator

3f75de8

AddsRemoteTensorDictReplayBuffer to rb_prototype.py

d3e1e64

Adds RemoteTensorDictReplayBuffer to docs

8171e59

vmoens reviewed Nov 2, 2022

View reviewed changes

Adi Goel added 10 commits November 2, 2022 14:20

Adds docstring comments to distributed replay buffer example

509966c

Adds docstring comments to distributed replay buffer example

85cf109

Merge branch 'main' of github.com:adityagoel4512/rl

3750133

Adds RemoteTensorDictReplayBuffer to existing test fixture

65ebdcf

Adds distributed rb test suite

a4feb59

Moves rpc init and shutdown outside scope of test function

6fd7db2

Remove stray print and add more descriptive error if unable to connec…

4e41902

…t to buffer

Remove stray print and add more descriptive error if unable to connec…

886694b

…t to buffer

Merge branch 'main' of github.com:adityagoel4512/rl

dad3755

Merge branch 'main' of github.com:pytorch/rl

1a73dcd

vmoens approved these changes Nov 3, 2022

View reviewed changes

vmoens merged commit b4b27fe into pytorch:main Nov 3, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Example] Distributed Replay Buffer Prototype Example Implementation #615

[Example] Distributed Replay Buffer Prototype Example Implementation #615

adityagoel4512 commented Oct 27, 2022 •

edited

Loading

adityagoel4512 commented Oct 27, 2022 •

edited

Loading

vmoens commented Oct 27, 2022

vmoens commented Oct 28, 2022

adityagoel4512 commented Oct 28, 2022

codecov bot commented Oct 28, 2022 •

edited

Loading

vmoens left a comment

adityagoel4512 commented Nov 2, 2022

vmoens left a comment

[Example] Distributed Replay Buffer Prototype Example Implementation #615

[Example] Distributed Replay Buffer Prototype Example Implementation #615

Conversation

adityagoel4512 commented Oct 27, 2022 • edited Loading

Description

Motivation and Context

Types of changes

Checklist

adityagoel4512 commented Oct 27, 2022 • edited Loading

vmoens commented Oct 27, 2022

vmoens commented Oct 28, 2022

adityagoel4512 commented Oct 28, 2022

codecov bot commented Oct 28, 2022 • edited Loading

Codecov Report

vmoens left a comment

Choose a reason for hiding this comment

adityagoel4512 commented Nov 2, 2022

vmoens left a comment

Choose a reason for hiding this comment

adityagoel4512 commented Oct 27, 2022 •

edited

Loading

adityagoel4512 commented Oct 27, 2022 •

edited

Loading

codecov bot commented Oct 28, 2022 •

edited

Loading