Introduce in-memory resource abstraction #375

mweber15 · 2023-05-10T19:14:42Z

This follows from discussion in #366.

The goal of this change is to allow for weights to be loaded from a copy of rust_model.ot that is already present in memory. There are two ways in which that data might be present:

As a HashMap<String, Tensor> from previous interaction with tch
As a contiguous buffer of the file data

One or the other mechanism might be preferable depending on how user code is using the model data. In some sense, implementing a provider based on the second option is more of a convenience method for the user to avoid the tch::nn::VarStore::load_from_stream interaction.

I've changed the definition of the ResourceProvider trait to require that it be both Send and Sync. There are currently certain contexts where dyn ResourceProvider + Send is required, but in theory before this change an implementation might not be Send (or Sync). The existing providers are both Send and Sync, and it seems reasonable (if technically incorrect) for user code to assume this to be true. I don't see a downside to making this explicit, but that part of this change might be better suited for separate discussion. I am not trying to sneak it in.

The enum Resource data type is used here as a means to abstract over the possible ways a ResourceProvider might represent an underlying resource. Without this, it would be necessary to either call different trait methods until one succeeded or implement as_any and downcast in order to implement load_weights similarly to how it is now. Those options seemed less preferable to creating a wrapper.

While it would be possible to replace all calls to get_local_path with the get_resource API, removal of the existing function would be a very big breaking change. As such, this change also introduces RustBertError::UnsupportedError to allow for the different methods to coexist. An alternative would be for the new ResourceProviders to write their resources to a temporary disk location and return an appropriate path, but that is counter to the purpose of the new ResourceProviders and so I chose not to implement that.

This follows from discussion in guillaume-be#366. The goal of this change is to allow for weights to be loaded from a copy of `rust_model.ot` that is already present in memory. There are two ways in which that data might be present: 1. As a `HashMap<String, Tensor>` from previous interaction with `tch` 2. As a contiguous buffer of the file data One or the other mechanism might be preferable depending on how user code is using the model data. In some sense, implementing a provider based on the second option is more of a convenience method for the user to avoid the `tch::nn::VarStore::load_from_stream` interaction. I've changed the definition of the `ResourceProvider` trait to require that it be both `Send` and `Sync`. There are currently certain contexts where `dyn ResourceProvider + Send` is required, but in theory before this change an implementation might not be `Send` (or `Sync`). The existing providers are both `Send` and `Sync`, and it seems reasonable (if technically incorrect) for user code to assume this to be true. I don't see a downside to making this explicit, but that part of this change might be better suited for separate discussion. I am not trying to sneak it in. The `enum Resource` data type is used here as a means to abstract over the possible ways a `ResourceProvider` might represent an underlying resource. Without this, it would be necessary to either call different trait methods until one succeeded or implement `as_any` and downcast in order to implement `load_weights` similarly to how it is now. Those options seemed less preferable to creating a wrapper. While it would be possible to replace all calls to `get_local_path` with the `get_resource` API, removal of the existing function would be a very big breaking change. As such, this change also introduces `RustBertError::UnsupportedError` to allow for the different methods to coexist. An alternative would be for the new `ResourceProvider`s to write their resources to a temporary disk location and return an appropriate path, but that is counter to the purpose of the new `ResourceProvider`s and so I chose not to implement that.

mweber15 · 2023-05-11T13:44:03Z

After thinking more about the potential for tch::Tensors to be mutated despite the attempt here to make that difficult, I don't really think TensorResource is a safe abstraction. I know that for my usage, BufferResource would be sufficient, and that can be implemented without safety concerns. Very interested to hear other thoughts.

Dropping the requirement I've added here for ResourceProvider to be Send + Sync would of course be fine and would put the onus back on users of TensorResource to observe its limitations. I do think it would be beneficial for all ResourceProviders to be explicitly Send + Sync if possible though.

- Remove `Resource::NamedTensors` - Change `BufferResource` to contain a `&[u8]` rather than `Vec<u8>`

Minor changes to in-memory resources implementation

guillaume-be · 2023-05-25T17:35:43Z

Thank you @mweber15 . Could you please add a small test that would validate and illustrate the approach is working for the buffer resource? You could maybe pick any of the small-sized model (e.g. DistilBART as done so far or a DistilBERT model)

mweber15 · 2023-05-25T17:47:47Z

Hi @guillaume-be,

The recent build change has broken the build for me in a couple of ways that I'm working through. Once I have that sorted I will add an illustrative test and make sure the doc comments reflect the current state of the PR. Hopefully soon.

guillaume-be · 2023-05-25T18:00:21Z

examples/natural_language_inference_deberta.rs

@@ -20,14 +20,13 @@ fn main() -> anyhow::Result<()> {
    let merges_resource = Box::new(RemoteResource::from_pretrained(
        DebertaMergesResources::DEBERTA_BASE_MNLI,
    ));
-    let model_resource = Box::new(RemoteResource::from_pretrained(
+    let mut model_resource = Box::new(RemoteResource::from_pretrained(


note that with the latest changes (RwLocks) there is no need for the resource to be mutable anymore meaning the changes should be essentially backward compatible (examples/tests should be possible to leave unchanged).

The signature of load_weights becomes:

pub fn load_weights( rp: &(impl ResourceProvider + ?Sized), vs: &mut VarStore, ) -> Result<(), RustBertError> { match rp.get_resource()? { Resource::Buffer(mut data) => { vs.load_from_stream(std::io::Cursor::new(data.deref_mut()))?; Ok(()) } Resource::PathBuf(path) => Ok(vs.load(path)?), } }

guillaume-be

Thank you for the PR!

mweber15 · 2023-05-27T16:57:07Z

Thank you so much for your help putting this together!

- Add impl<T: ResourceProvider + ?Sized> ResourceProvider for Box<T>

e6307b4

- Remove `Resource::NamedTensors` - Change `BufferResource` to contain a `&[u8]` rather than `Vec<u8>`

guillaume-be mentioned this pull request May 12, 2023

Minor changes to in-memory resources implementation mweber15/rust-bert#1

Merged

guillaume-be and others added 4 commits May 22, 2023 19:18

Further rework proposal for resources

8d6b14b

Use mutable references and locks

0d1b63e

Merge pull request #1 from guillaume-be/updated_inmemory_resource

5aeec26

Minor changes to in-memory resources implementation

Merge branch 'master' into add-in-memory-resources

697608e

mweber15 added 2 commits May 25, 2023 13:54

Make model resources mutable in tests/examples

77e7c37

Merge branch 'master' into add-in-memory-resources

6e17895

guillaume-be reviewed May 25, 2023

View reviewed changes

mweber15 added 2 commits May 25, 2023 14:13

Remove unnecessary mutability and TensorResource references

2c280b1

Add BufferResource example

5beec4b

guillaume-be approved these changes May 26, 2023

View reviewed changes

guillaume-be merged commit ba57704 into guillaume-be:master May 26, 2023

mweber15 deleted the add-in-memory-resources branch May 27, 2023 16:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce in-memory resource abstraction #375

Introduce in-memory resource abstraction #375

mweber15 commented May 10, 2023

mweber15 commented May 11, 2023

guillaume-be commented May 25, 2023

mweber15 commented May 25, 2023

guillaume-be May 25, 2023 •

edited

Loading

guillaume-be left a comment

mweber15 commented May 27, 2023

Introduce in-memory resource abstraction #375

Introduce in-memory resource abstraction #375

Conversation

mweber15 commented May 10, 2023

mweber15 commented May 11, 2023

guillaume-be commented May 25, 2023

mweber15 commented May 25, 2023

guillaume-be May 25, 2023 • edited Loading

Choose a reason for hiding this comment

guillaume-be left a comment

Choose a reason for hiding this comment

mweber15 commented May 27, 2023

guillaume-be May 25, 2023 •

edited

Loading