Typed messaging and validation

I *was* originally going to make a big post on [`pydantic`](https://pydantic-docs.helpmanual.io/) and how we could offer typed messages using that very very nice project despite there being a couple holdups for integration with `msgpack`. 

However, it turns out just today an even faster and msgpack specific project was released: [`msgspec`](https://github.com/jcrist/msgspec) 🏄🏼 

It claims to not only be faster then [msgpack-python](https://github.com/msgpack/msgpack-python) but also supports [schema evolution](https://jcristharif.com/msgspec/#schema-evolution) and other [niceties](https://jcristharif.com/msgspec/#highlights)
It also has perf bumps when making multiple [repeated encode/decode calls](https://jcristharif.com/msgspec/#usage) which is exactly [how we're currently using `msgpack` inside our `Channel`](https://github.com/goodboy/tractor/blob/master/tractor/_ipc.py#L45).

Overall there looks to be no downside and we'll get typed message semantics fast and free 👍🏼 

For reference, I'll leave a bunch of links I'd previously gathered regarding making `pydantic` work with `msgpack`:
- https://github.com/samuelcolvin/pydantic/issues/951
- https://pydantic-docs.helpmanual.io/usage/dataclasses/
- https://github.com/samuelcolvin/pydantic/pull/595
- https://github.com/tiangolo/fastapi/issues/1285
- https://github.com/MolSSI/QCElemental/blob/master/qcelemental/models/basemodels.py#L121
  - this is just adding a `BaseModel.serialize()` effectively which looks up a serialize method by name (eg. json, msgpack) but isn't really adding any "native feeling" support nor speed gains afaict. 
  
##### TODO
- [ ] support for a `msgpack-python` [custom type](https://github.com/msgpack/msgpack-python#packingunpacking-of-custom-data-type) serializer for `pydantic.BaseModel` such that we just implicitly render with `.dict()` as pack time and load via `Model(**message)`` at decode time?
- [ ] write ourselves a small bytes-length prefixed framing protocol for `msgspec` as per the comments in #212 
  - example from [a blog post on protobuf](https://eli.thegreenplace.net/2011/08/02/length-prefix-framing-for-protocol-buffers)
  - consider how we might wrap `trio.SocketStream` using something like [`tricycle.BufferedReceiveStream`](https://github.com/oremanj/tricycle/blob/master/tricycle/_streams.py#L12); @oremanj was nice enough to provide usage:
  ```python
    while header := await stream.receive_all_or_none(4):
        len, = struct.unpack("<I", header)
        # probably want to sanity-check len for not being unreasonably huge
        chunk = await stream.receive_exactly(len)
        # do something with chunk
    ```
- [ ] consider offering `msgspec` as an optional dependency if we end up liking it?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Typed messaging and validation #196

TODO

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development