bevy_reflect: Add `ReflectDeserializerProcessor` #15482

aecsocket · 2024-09-27T21:51:31Z

NOTE: Also see #15548 for the serializer equivalent

Objective

The current ReflectDeserializer and TypedReflectDeserializer use the TypeRegistration and/or ReflectDeserialize of a given type in order to determine how to deserialize a value of that type. However, there is currently no way to statefully override deserialization of a given type when using these two deserializers - that is, to have some local data in the same scope as the ReflectDeserializer, and make use of that data when deserializing.

The motivating use case for this came up when working on bevy_animation_graph, when loading an animation graph asset. The AnimationGraph stores Vec<Box<dyn NodeLike>>s which we have to load in. Those Box<dyn NodeLike>s may store Handles to e.g. Handle<AnimationClip>. I want to trigger a load_context.load() for that handle when it's loaded.

#[derive(Reflect)]
struct Animation {
    clips: Vec<Handle<AnimationClip>>,
}

(
    clips: [
        "animation_clips/walk.animclip.ron",
        "animation_clips/run.animclip.ron",
        "animation_clips/jump.animclip.ron",
    ],
)

Currently, if this were deserialized from an asset loader, this would be deserialized as a vec of Handle::default()s, which isn't useful since we also need to load_context.load() those handles for them to be used. With this processor field, a processor can detect when Handle<T>s are being loaded, then actually load them in.

Solution

trait ReflectDeserializerProcessor {
    fn try_deserialize<'de, D>(
        &mut self,
        registration: &TypeRegistration,
        deserializer: D,
    ) -> Result<Result<Box<dyn PartialReflect>, D>, D::Error>
    where
        D: serde::Deserializer<'de>;
}

- pub struct ReflectDeserializer<'a> {
+ pub struct ReflectDeserializer<'a, P = ()> { // also for ReflectTypedDeserializer
      registry: &'a TypeRegistry,
+     processor: Option<&'a mut P>,
  }

impl<'a, P: ReflectDeserializerProcessor> ReflectDeserializer<'a, P> { // also for ReflectTypedDeserializer
    pub fn with_processor(registry: &'a TypeRegistry, processor: &'a mut P) -> Self {
        Self {
            registry,
            processor: Some(processor),
        }
    }
}

This does not touch the existing fn news.
This processor field is also added to all internal visitor structs.

When TypedReflectDeserializer runs, it will first try to deserialize a value of this type by passing the TypeRegistration and deserializer to the processor, and fallback to the default logic. This processor runs the earliest, and takes priority over all other deserialization logic.

Testing

Added unit tests to bevy_reflect::serde::de. Also using almost exactly the same implementation in my fork of bevy_animation_graph.

Migration Guide

(Since I added P = (), I don't think this is actually a breaking change anymore, but I'll leave this in)

bevy_reflect's ReflectDeserializer and TypedReflectDeserializer now take a ReflectDeserializerProcessor as the type parameter P, which allows you to customize deserialization for specific types when they are found. However, the rest of the API surface (new) remains the same.

Original implementation

Add ReflectDeserializerProcessor:

struct ReflectDeserializerProcessor {
    pub can_deserialize: Box<dyn FnMut(&TypeRegistration) -> bool + 'p>,
    pub deserialize: Box<
        dyn FnMut(
                &TypeRegistration,
                &mut dyn erased_serde::Deserializer,
            ) -> Result<Box<dyn PartialReflect>, erased_serde::Error>
            + 'p,
}

Along with ReflectDeserializer::new_with_processor and TypedReflectDeserializer::new_with_processor. This does not touch the public API of the existing new fns.

This is stored as an Option<&mut ReflectDeserializerProcessor> on the deserializer and any of the private -Visitor structs, and when we attempt to deserialize a value, we first pass it through this processor.

Also added a very comprehensive doc test to ReflectDeserializerProcessor, which is actually a scaled down version of the code for the bevy_animation_graph loader. This should give users a good motivating example for when and why to use this feature.

Why `Box<dyn ..>`?

When I originally implemented this, I added a type parameter to ReflectDeserializer to determine the processor used, with () being "no processor". However when using this, I kept running into rustc errors where it failed to validate certain type bounds and led to overflows. I then switched to a dynamic dispatch approach.

The dynamic dispatch should not be that expensive, nor should it be a performance regression, since it's only used if there is Some processor. (Note: I have not benchmarked this, I am just speculating.) Also, it means that we don't infect the rest of the code with an extra type parameter, which is nicer to maintain.

Why the `'p` on `ReflectDeserializerProcessor<'p>`?

Without a lifetime here, the Boxes would automatically become Box<dyn FnMut(..) + 'static>. This makes them practically useless, since any local data you would want to pass in must then be 'static. In the motivating example, you couldn't pass in that &mut LoadContext to the function.

This means that the 'p infects the rest of the Visitor types, but this is acceptable IMO. This PR also elides the lifetimes in the impl<'de> Visitor<'de> for -Visitor blocks where possible.

Future possibilities

I think it's technically possible to turn the processor into a trait, and make the deserializers generic over that trait. This would also open the door to an API like:

type Seed;

fn seed_deserialize(&mut self, r: &TypeRegistration) -> Option<Self::Seed>;

fn deserialize(&mut self, r: &TypeRegistration, d: &mut dyn erased_serde::Deserializer, s: Self::Seed) -> ...;

A similar processor system should also be added to the serialization side, but that's for another PR. Ideally, both PRs will be in the same release, since one isn't very useful without the other.

Testing

Added unit tests to bevy_reflect::serde::de. Also using almost exactly the same implementation in my fork of bevy_animation_graph.

Migration Guide

bevy_reflect's ReflectDeserializer and TypedReflectDeserializer now take a second lifetime parameter 'p for storing the ReflectDeserializerProcessor field lifetimes. However, the rest of the API surface (new) remains the same, so if you are not storing these deserializers or referring to them with lifetimes, you should not have to make any changes.

aecsocket · 2024-09-27T23:09:07Z

Thinking about the current functions that the processor stores, maybe having two separate can_deserialize and deserialize is not a good idea.

Originally I wrote it as one function, but had issues where I would have to pass ownership of the deserializer into the processor, which means I couldn't use it for default deserialization logic later. I then split this into two, and if the first passes, only then does ownership of the deserializer get moved to the processor (and at that point, we ourselves don't need it anymore for default deser logic).

It might be possible to change this to just use one function, which would help ergonomics. Maybe the processor can pass ownership of the deserializer back, if it refuses to deserialize this value? Although then we'd be getting back an erased deserializer, which likely causes problems.

MrGVSV

Overall looks really useful and is a pretty clean implementation! Also, amazing docs!!!

I think this would also pair nicely with #8611 in order to give users a lot more control over reflection deserialization.

Just a few comments.

crates/bevy_reflect/src/serde/de/deserializer.rs

MrGVSV · 2024-09-28T18:26:18Z

Marking this as C-Breaking-Change due to the added lifetimes in the public API.

github-actions · 2024-09-28T18:26:37Z

It looks like your PR is a breaking change, but you didn't provide a migration guide.

Could you add some context on what users should update when this change get released in a new version of Bevy?
It will be used to help writing the migration guide for the version. Putting it after a ## Migration Guide will help it get automatically picked up by our tooling.

aecsocket · 2024-09-28T23:23:45Z

Thinking even harder about the API, what if should_deserialize returned a Box<dyn FnOnce(..)> which performed the actual deserialization?

Benefits:

any state you compute in should_deserialize can be passed immediately to the deserialization fn
if you never make a deserializer fn, you never have to define what it does (I guess this isn't a big deal since you can just unimplemented!())
if you have a processor which does different things for different type registrations, it makes it much cleaner to implement the logic. just return two different fns instead of running the check logic again
- this is the most compelling reason to do it this way

Drawbacks:

you would have to allocate a new box for this fn - if your processor is valid for a lot of fields, this might be bad for performance
- although realistically this is reflect deserializer code which I intend to be used mainly in asset loading - dynamism and flexibility is more important imo

aecsocket · 2024-09-29T12:23:56Z

After a lot more thinking, I've decided to just use static dispatch for the processor, and add a P type parameter:

This is already a breaking change with the 'p lifetime
The 'p lifetime was already infecting the rest of the visitor types, so now it's swapped out for a P
This means we don't have to erase the deserializer when we pass it to the processor
The processor can compute state when figuring out if it should actually deserialize or not, and then use that state in the actual deserialization
A tiny performance improvement probably? Since we're not using dyn dispatch
The processor trait is implemented for () when you don't want any processor

aecsocket · 2024-09-30T10:58:06Z

Ok, I'm finally in a state where I'm a lot more happy with the public API of the processor. This took a lot of wrangling with the type system, since I kept running into infinite recursion like &mut &mut &mut &mut... P. In summary:

ReflectDeserializerProcessor only has a single fn try_deserialize which either returns a value, ownership of the D: Deserializer, or an error
- Cleans up the should_deserialize vs deserialize issues that I complained about before
The ReflectDeserializer, TypedReflectDeserializer and all internal visitor classes take a type parameter P
This is used in processor: Option<&'a mut P>, reusing the existing lifetime 'a already used for the &'a TypeRegistry (you'll see why we need an Option and can't own P...)
We have a no-op impl ReflectDeserializerProcessor for ()
- The actual logic of this doesn't matter, because whenever we have P = (), the processor field will be None. It could even panic!().
fn new(..) returns a deserializer with P = (), but crucially, processor: None
- So that when a user uses the standard new fn, they don't even have to think about the P type at all
fn with_processor(..) (renamed from new_with_processor) takes in a processor: &'a mut P

So why can't we use processor: P and take ownership? Let's assume we did...

The visitor structs may construct new TypedReflectDeserializers when traversing - e.g. a ListVisitor will need to construct multiple typed deserializers to deserialize every value under it.
This means we need some way to hand out the same value of P multiple times - that's a borrow, so let's just pass in processor: &mut self.processor
Then we impl<T: Processor + ?Sized> Processor for &mut T
But what if P is itself a &mut Q? Now we've made a &mut &mut Q
Now since the new TypedReflectDeserializer might instantiate more visitor types, which might instantiate more deserializers, rustc overflows when instantiating types - &mut &mut &mut &mut &mut ...
Therefore, if we store a &'a mut P, we can just reborrow when constructing the child deserializer, and pass processor: self.processor. We're still passing in a &mut P, so no rustc overflows.

What about that Option?

Since the deserializer stores a &mut P, what if we don't want a processor? I.e., we want to give it P = () which has a no-op impl?
Then fn new() wouldn't work:

pub fn new(registration: &'a TypeRegistration, registry: &'a TypeRegistry) -> Self {
    Self {
        registration,
        registry,
        processor: &mut (), // error: this temporary `()` will be dropped
    }
}

So, fn new just sets processor: None
Note that the reborrow processor: self.processor from before won't work with Option. Instead, we use processor: self.processor.as_deref_mut() to reborrow the Option<&mut P>.

Benefits:

try_deserialize is a single fn and much cleaner now
No erased_serde::Deserializer

Drawbacks:

More monomorphisations of the deserializer and visitor types, bigger binary size

crates/bevy_reflect/src/serde/de/deserializer.rs

MrGVSV · 2024-10-04T20:44:08Z

crates/bevy_reflect/src/serde/de/mod.rs

+        #[cfg(feature = "debug_stack")]
+        assert_eq!(
+            error,
+            ron::Error::Message("my custom deserialize error (stack: `i32`)".to_string())
+        );


How does the processor affect the debug_stack for deeper nestings?

For example:

Foo -> Bar (w/ Processor) -> Baz -> i32

Does it print the full stack:

Foo -> Bar -> Baz -> i32

Or does it skip the processed type:

Foo -> Bar -> i32

Or something else?

Because the processor takes ownership of the deserializer here, the processor would have to manually create a ReflectDeserializer::new, resetting the type stack. If the processor overrides deserialization of Bar, it would look like Baz -> i32.
Do we need some smarter mechanism for handling the type stack here?

This also made me realise that if you have a processor (or a DeserializeWithRegistry) impl that creates a new ReflectDeserializer, it won't have the type stack context or the processor context with it. So in the example where we perform extra logic when loading Handles, if one of the reflected types implements DeserializeWithRegistry and creates its own TypedReflectDeserializer, that deserializer won't have the processor, and any handles under that won't be properly loaded.

crates/bevy_reflect/src/serde/de/deserializer.rs

github-actions · 2024-10-05T11:01:21Z

The generated examples/README.md is out of sync with the example metadata in Cargo.toml or the example readme template. Please run cargo run -p build-templated-pages -- update examples to update it, and commit the file change.

aecsocket · 2024-10-05T18:04:13Z

Note about the landscape.png: I don't know if I need to add a license for this asset or something, I noticed that some assets have a License.txt or similar, but this is my own photo that I took, so I guess I'll license it under CC0 if that's suitable for Bevy?

Also Nikko is a really nice place :)

MrGVSV

Looking good! Just one comment about the example.

examples/asset/reflect_deserialized_asset.rs

crates/bevy_reflect/src/serde/de/mod.rs

crates/bevy_reflect/src/serde/de/processor.rs

MrGVSV

I think this is an overall helpful tool to have in one's belt for reflection. And it opens the door to future processor configuration which may be useful.

**NOTE: This is based on, and should be merged alongside, #15482 I'll leave this in draft until that PR is merged. # Objective Equivalent of #15482 but for serialization. See that issue for the motivation. Also part of this tracking issue: #15518 This PR is non-breaking, just like the deserializer PR (because the new type parameter `P` has a default `P = ()`). ## Solution Identical solution to the deserializer PR. ## Testing Added unit tests and a very comprehensive doc test outlining a clear example and use case.

**NOTE: Also see #15548 for the serializer equivalent** # Objective The current `ReflectDeserializer` and `TypedReflectDeserializer` use the `TypeRegistration` and/or `ReflectDeserialize` of a given type in order to determine how to deserialize a value of that type. However, there is currently no way to statefully override deserialization of a given type when using these two deserializers - that is, to have some local data in the same scope as the `ReflectDeserializer`, and make use of that data when deserializing. The motivating use case for this came up when working on [`bevy_animation_graph`](https://github.com/aecsocket/bevy_animation_graph/tree/feat/dynamic-nodes), when loading an animation graph asset. The `AnimationGraph` stores `Vec<Box<dyn NodeLike>>`s which we have to load in. Those `Box<dyn NodeLike>`s may store `Handle`s to e.g. `Handle<AnimationClip>`. I want to trigger a `load_context.load()` for that handle when it's loaded. ```rs #[derive(Reflect)] struct Animation { clips: Vec<Handle<AnimationClip>>, } ``` ```rs ( clips: [ "animation_clips/walk.animclip.ron", "animation_clips/run.animclip.ron", "animation_clips/jump.animclip.ron", ], ) ```` Currently, if this were deserialized from an asset loader, this would be deserialized as a vec of `Handle::default()`s, which isn't useful since we also need to `load_context.load()` those handles for them to be used. With this processor field, a processor can detect when `Handle<T>`s are being loaded, then actually load them in. ## Solution ```rs trait ReflectDeserializerProcessor { fn try_deserialize<'de, D>( &mut self, registration: &TypeRegistration, deserializer: D, ) -> Result<Result<Box<dyn PartialReflect>, D>, D::Error> where D: serde::Deserializer<'de>; } ``` ```diff - pub struct ReflectDeserializer<'a> { + pub struct ReflectDeserializer<'a, P = ()> { // also for ReflectTypedDeserializer registry: &'a TypeRegistry, + processor: Option<&'a mut P>, } ``` ```rs impl<'a, P: ReflectDeserializerProcessor> ReflectDeserializer<'a, P> { // also for ReflectTypedDeserializer pub fn with_processor(registry: &'a TypeRegistry, processor: &'a mut P) -> Self { Self { registry, processor: Some(processor), } } } ``` This does not touch the existing `fn new`s. This `processor` field is also added to all internal visitor structs. When `TypedReflectDeserializer` runs, it will first try to deserialize a value of this type by passing the `TypeRegistration` and deserializer to the processor, and fallback to the default logic. This processor runs the earliest, and takes priority over all other deserialization logic. ## Testing Added unit tests to `bevy_reflect::serde::de`. Also using almost exactly the same implementation in [my fork of `bevy_animation_graph`](https://github.com/aecsocket/bevy_animation_graph/tree/feat/dynamic-nodes). ## Migration Guide (Since I added `P = ()`, I don't think this is actually a breaking change anymore, but I'll leave this in) `bevy_reflect`'s `ReflectDeserializer` and `TypedReflectDeserializer` now take a `ReflectDeserializerProcessor` as the type parameter `P`, which allows you to customize deserialization for specific types when they are found. However, the rest of the API surface (`new`) remains the same. <details> <summary>Original implementation</summary> Add `ReflectDeserializerProcessor`: ```rs struct ReflectDeserializerProcessor { pub can_deserialize: Box<dyn FnMut(&TypeRegistration) -> bool + 'p>, pub deserialize: Box< dyn FnMut( &TypeRegistration, &mut dyn erased_serde::Deserializer, ) -> Result<Box<dyn PartialReflect>, erased_serde::Error> + 'p, } ``` Along with `ReflectDeserializer::new_with_processor` and `TypedReflectDeserializer::new_with_processor`. This does not touch the public API of the existing `new` fns. This is stored as an `Option<&mut ReflectDeserializerProcessor>` on the deserializer and any of the private `-Visitor` structs, and when we attempt to deserialize a value, we first pass it through this processor. Also added a very comprehensive doc test to `ReflectDeserializerProcessor`, which is actually a scaled down version of the code for the `bevy_animation_graph` loader. This should give users a good motivating example for when and why to use this feature. ### Why `Box<dyn ..>`? When I originally implemented this, I added a type parameter to `ReflectDeserializer` to determine the processor used, with `()` being "no processor". However when using this, I kept running into rustc errors where it failed to validate certain type bounds and led to overflows. I then switched to a dynamic dispatch approach. The dynamic dispatch should not be that expensive, nor should it be a performance regression, since it's only used if there is `Some` processor. (Note: I have not benchmarked this, I am just speculating.) Also, it means that we don't infect the rest of the code with an extra type parameter, which is nicer to maintain. ### Why the `'p` on `ReflectDeserializerProcessor<'p>`? Without a lifetime here, the `Box`es would automatically become `Box<dyn FnMut(..) + 'static>`. This makes them practically useless, since any local data you would want to pass in must then be `'static`. In the motivating example, you couldn't pass in that `&mut LoadContext` to the function. This means that the `'p` infects the rest of the Visitor types, but this is acceptable IMO. This PR also elides the lifetimes in the `impl<'de> Visitor<'de> for -Visitor` blocks where possible. ### Future possibilities I think it's technically possible to turn the processor into a trait, and make the deserializers generic over that trait. This would also open the door to an API like: ```rs type Seed; fn seed_deserialize(&mut self, r: &TypeRegistration) -> Option<Self::Seed>; fn deserialize(&mut self, r: &TypeRegistration, d: &mut dyn erased_serde::Deserializer, s: Self::Seed) -> ...; ``` A similar processor system should also be added to the serialization side, but that's for another PR. Ideally, both PRs will be in the same release, since one isn't very useful without the other. ## Testing Added unit tests to `bevy_reflect::serde::de`. Also using almost exactly the same implementation in [my fork of `bevy_animation_graph`](https://github.com/aecsocket/bevy_animation_graph/tree/feat/dynamic-nodes). ## Migration Guide `bevy_reflect`'s `ReflectDeserializer` and `TypedReflectDeserializer` now take a second lifetime parameter `'p` for storing the `ReflectDeserializerProcessor` field lifetimes. However, the rest of the API surface (`new`) remains the same, so if you are not storing these deserializers or referring to them with lifetimes, you should not have to make any changes. </details>

**NOTE: This is based on, and should be merged alongside, #15482 I'll leave this in draft until that PR is merged. # Objective Equivalent of #15482 but for serialization. See that issue for the motivation. Also part of this tracking issue: #15518 This PR is non-breaking, just like the deserializer PR (because the new type parameter `P` has a default `P = ()`). ## Solution Identical solution to the deserializer PR. ## Testing Added unit tests and a very comprehensive doc test outlining a clear example and use case.

**NOTE: Also see bevyengine#15548 for the serializer equivalent** # Objective The current `ReflectDeserializer` and `TypedReflectDeserializer` use the `TypeRegistration` and/or `ReflectDeserialize` of a given type in order to determine how to deserialize a value of that type. However, there is currently no way to statefully override deserialization of a given type when using these two deserializers - that is, to have some local data in the same scope as the `ReflectDeserializer`, and make use of that data when deserializing. The motivating use case for this came up when working on [`bevy_animation_graph`](https://github.com/aecsocket/bevy_animation_graph/tree/feat/dynamic-nodes), when loading an animation graph asset. The `AnimationGraph` stores `Vec<Box<dyn NodeLike>>`s which we have to load in. Those `Box<dyn NodeLike>`s may store `Handle`s to e.g. `Handle<AnimationClip>`. I want to trigger a `load_context.load()` for that handle when it's loaded. ```rs #[derive(Reflect)] struct Animation { clips: Vec<Handle<AnimationClip>>, } ``` ```rs ( clips: [ "animation_clips/walk.animclip.ron", "animation_clips/run.animclip.ron", "animation_clips/jump.animclip.ron", ], ) ```` Currently, if this were deserialized from an asset loader, this would be deserialized as a vec of `Handle::default()`s, which isn't useful since we also need to `load_context.load()` those handles for them to be used. With this processor field, a processor can detect when `Handle<T>`s are being loaded, then actually load them in. ## Solution ```rs trait ReflectDeserializerProcessor { fn try_deserialize<'de, D>( &mut self, registration: &TypeRegistration, deserializer: D, ) -> Result<Result<Box<dyn PartialReflect>, D>, D::Error> where D: serde::Deserializer<'de>; } ``` ```diff - pub struct ReflectDeserializer<'a> { + pub struct ReflectDeserializer<'a, P = ()> { // also for ReflectTypedDeserializer registry: &'a TypeRegistry, + processor: Option<&'a mut P>, } ``` ```rs impl<'a, P: ReflectDeserializerProcessor> ReflectDeserializer<'a, P> { // also for ReflectTypedDeserializer pub fn with_processor(registry: &'a TypeRegistry, processor: &'a mut P) -> Self { Self { registry, processor: Some(processor), } } } ``` This does not touch the existing `fn new`s. This `processor` field is also added to all internal visitor structs. When `TypedReflectDeserializer` runs, it will first try to deserialize a value of this type by passing the `TypeRegistration` and deserializer to the processor, and fallback to the default logic. This processor runs the earliest, and takes priority over all other deserialization logic. ## Testing Added unit tests to `bevy_reflect::serde::de`. Also using almost exactly the same implementation in [my fork of `bevy_animation_graph`](https://github.com/aecsocket/bevy_animation_graph/tree/feat/dynamic-nodes). ## Migration Guide (Since I added `P = ()`, I don't think this is actually a breaking change anymore, but I'll leave this in) `bevy_reflect`'s `ReflectDeserializer` and `TypedReflectDeserializer` now take a `ReflectDeserializerProcessor` as the type parameter `P`, which allows you to customize deserialization for specific types when they are found. However, the rest of the API surface (`new`) remains the same. <details> <summary>Original implementation</summary> Add `ReflectDeserializerProcessor`: ```rs struct ReflectDeserializerProcessor { pub can_deserialize: Box<dyn FnMut(&TypeRegistration) -> bool + 'p>, pub deserialize: Box< dyn FnMut( &TypeRegistration, &mut dyn erased_serde::Deserializer, ) -> Result<Box<dyn PartialReflect>, erased_serde::Error> + 'p, } ``` Along with `ReflectDeserializer::new_with_processor` and `TypedReflectDeserializer::new_with_processor`. This does not touch the public API of the existing `new` fns. This is stored as an `Option<&mut ReflectDeserializerProcessor>` on the deserializer and any of the private `-Visitor` structs, and when we attempt to deserialize a value, we first pass it through this processor. Also added a very comprehensive doc test to `ReflectDeserializerProcessor`, which is actually a scaled down version of the code for the `bevy_animation_graph` loader. This should give users a good motivating example for when and why to use this feature. ### Why `Box<dyn ..>`? When I originally implemented this, I added a type parameter to `ReflectDeserializer` to determine the processor used, with `()` being "no processor". However when using this, I kept running into rustc errors where it failed to validate certain type bounds and led to overflows. I then switched to a dynamic dispatch approach. The dynamic dispatch should not be that expensive, nor should it be a performance regression, since it's only used if there is `Some` processor. (Note: I have not benchmarked this, I am just speculating.) Also, it means that we don't infect the rest of the code with an extra type parameter, which is nicer to maintain. ### Why the `'p` on `ReflectDeserializerProcessor<'p>`? Without a lifetime here, the `Box`es would automatically become `Box<dyn FnMut(..) + 'static>`. This makes them practically useless, since any local data you would want to pass in must then be `'static`. In the motivating example, you couldn't pass in that `&mut LoadContext` to the function. This means that the `'p` infects the rest of the Visitor types, but this is acceptable IMO. This PR also elides the lifetimes in the `impl<'de> Visitor<'de> for -Visitor` blocks where possible. ### Future possibilities I think it's technically possible to turn the processor into a trait, and make the deserializers generic over that trait. This would also open the door to an API like: ```rs type Seed; fn seed_deserialize(&mut self, r: &TypeRegistration) -> Option<Self::Seed>; fn deserialize(&mut self, r: &TypeRegistration, d: &mut dyn erased_serde::Deserializer, s: Self::Seed) -> ...; ``` A similar processor system should also be added to the serialization side, but that's for another PR. Ideally, both PRs will be in the same release, since one isn't very useful without the other. ## Testing Added unit tests to `bevy_reflect::serde::de`. Also using almost exactly the same implementation in [my fork of `bevy_animation_graph`](https://github.com/aecsocket/bevy_animation_graph/tree/feat/dynamic-nodes). ## Migration Guide `bevy_reflect`'s `ReflectDeserializer` and `TypedReflectDeserializer` now take a second lifetime parameter `'p` for storing the `ReflectDeserializerProcessor` field lifetimes. However, the rest of the API surface (`new`) remains the same, so if you are not storing these deserializers or referring to them with lifetimes, you should not have to make any changes. </details>

**NOTE: This is based on, and should be merged alongside, bevyengine#15482 I'll leave this in draft until that PR is merged. # Objective Equivalent of bevyengine#15482 but for serialization. See that issue for the motivation. Also part of this tracking issue: bevyengine#15518 This PR is non-breaking, just like the deserializer PR (because the new type parameter `P` has a default `P = ()`). ## Solution Identical solution to the deserializer PR. ## Testing Added unit tests and a very comprehensive doc test outlining a clear example and use case.

**NOTE: Also see bevyengine#15548 for the serializer equivalent** # Objective The current `ReflectDeserializer` and `TypedReflectDeserializer` use the `TypeRegistration` and/or `ReflectDeserialize` of a given type in order to determine how to deserialize a value of that type. However, there is currently no way to statefully override deserialization of a given type when using these two deserializers - that is, to have some local data in the same scope as the `ReflectDeserializer`, and make use of that data when deserializing. The motivating use case for this came up when working on [`bevy_animation_graph`](https://github.com/aecsocket/bevy_animation_graph/tree/feat/dynamic-nodes), when loading an animation graph asset. The `AnimationGraph` stores `Vec<Box<dyn NodeLike>>`s which we have to load in. Those `Box<dyn NodeLike>`s may store `Handle`s to e.g. `Handle<AnimationClip>`. I want to trigger a `load_context.load()` for that handle when it's loaded. ```rs #[derive(Reflect)] struct Animation { clips: Vec<Handle<AnimationClip>>, } ``` ```rs ( clips: [ "animation_clips/walk.animclip.ron", "animation_clips/run.animclip.ron", "animation_clips/jump.animclip.ron", ], ) ```` Currently, if this were deserialized from an asset loader, this would be deserialized as a vec of `Handle::default()`s, which isn't useful since we also need to `load_context.load()` those handles for them to be used. With this processor field, a processor can detect when `Handle<T>`s are being loaded, then actually load them in. ## Solution ```rs trait ReflectDeserializerProcessor { fn try_deserialize<'de, D>( &mut self, registration: &TypeRegistration, deserializer: D, ) -> Result<Result<Box<dyn PartialReflect>, D>, D::Error> where D: serde::Deserializer<'de>; } ``` ```diff - pub struct ReflectDeserializer<'a> { + pub struct ReflectDeserializer<'a, P = ()> { // also for ReflectTypedDeserializer registry: &'a TypeRegistry, + processor: Option<&'a mut P>, } ``` ```rs impl<'a, P: ReflectDeserializerProcessor> ReflectDeserializer<'a, P> { // also for ReflectTypedDeserializer pub fn with_processor(registry: &'a TypeRegistry, processor: &'a mut P) -> Self { Self { registry, processor: Some(processor), } } } ``` This does not touch the existing `fn new`s. This `processor` field is also added to all internal visitor structs. When `TypedReflectDeserializer` runs, it will first try to deserialize a value of this type by passing the `TypeRegistration` and deserializer to the processor, and fallback to the default logic. This processor runs the earliest, and takes priority over all other deserialization logic. ## Testing Added unit tests to `bevy_reflect::serde::de`. Also using almost exactly the same implementation in [my fork of `bevy_animation_graph`](https://github.com/aecsocket/bevy_animation_graph/tree/feat/dynamic-nodes). ## Migration Guide (Since I added `P = ()`, I don't think this is actually a breaking change anymore, but I'll leave this in) `bevy_reflect`'s `ReflectDeserializer` and `TypedReflectDeserializer` now take a `ReflectDeserializerProcessor` as the type parameter `P`, which allows you to customize deserialization for specific types when they are found. However, the rest of the API surface (`new`) remains the same. <details> <summary>Original implementation</summary> Add `ReflectDeserializerProcessor`: ```rs struct ReflectDeserializerProcessor { pub can_deserialize: Box<dyn FnMut(&TypeRegistration) -> bool + 'p>, pub deserialize: Box< dyn FnMut( &TypeRegistration, &mut dyn erased_serde::Deserializer, ) -> Result<Box<dyn PartialReflect>, erased_serde::Error> + 'p, } ``` Along with `ReflectDeserializer::new_with_processor` and `TypedReflectDeserializer::new_with_processor`. This does not touch the public API of the existing `new` fns. This is stored as an `Option<&mut ReflectDeserializerProcessor>` on the deserializer and any of the private `-Visitor` structs, and when we attempt to deserialize a value, we first pass it through this processor. Also added a very comprehensive doc test to `ReflectDeserializerProcessor`, which is actually a scaled down version of the code for the `bevy_animation_graph` loader. This should give users a good motivating example for when and why to use this feature. ### Why `Box<dyn ..>`? When I originally implemented this, I added a type parameter to `ReflectDeserializer` to determine the processor used, with `()` being "no processor". However when using this, I kept running into rustc errors where it failed to validate certain type bounds and led to overflows. I then switched to a dynamic dispatch approach. The dynamic dispatch should not be that expensive, nor should it be a performance regression, since it's only used if there is `Some` processor. (Note: I have not benchmarked this, I am just speculating.) Also, it means that we don't infect the rest of the code with an extra type parameter, which is nicer to maintain. ### Why the `'p` on `ReflectDeserializerProcessor<'p>`? Without a lifetime here, the `Box`es would automatically become `Box<dyn FnMut(..) + 'static>`. This makes them practically useless, since any local data you would want to pass in must then be `'static`. In the motivating example, you couldn't pass in that `&mut LoadContext` to the function. This means that the `'p` infects the rest of the Visitor types, but this is acceptable IMO. This PR also elides the lifetimes in the `impl<'de> Visitor<'de> for -Visitor` blocks where possible. ### Future possibilities I think it's technically possible to turn the processor into a trait, and make the deserializers generic over that trait. This would also open the door to an API like: ```rs type Seed; fn seed_deserialize(&mut self, r: &TypeRegistration) -> Option<Self::Seed>; fn deserialize(&mut self, r: &TypeRegistration, d: &mut dyn erased_serde::Deserializer, s: Self::Seed) -> ...; ``` A similar processor system should also be added to the serialization side, but that's for another PR. Ideally, both PRs will be in the same release, since one isn't very useful without the other. ## Testing Added unit tests to `bevy_reflect::serde::de`. Also using almost exactly the same implementation in [my fork of `bevy_animation_graph`](https://github.com/aecsocket/bevy_animation_graph/tree/feat/dynamic-nodes). ## Migration Guide `bevy_reflect`'s `ReflectDeserializer` and `TypedReflectDeserializer` now take a second lifetime parameter `'p` for storing the `ReflectDeserializerProcessor` field lifetimes. However, the rest of the API surface (`new`) remains the same, so if you are not storing these deserializers or referring to them with lifetimes, you should not have to make any changes. </details>

**NOTE: This is based on, and should be merged alongside, bevyengine#15482 I'll leave this in draft until that PR is merged. # Objective Equivalent of bevyengine#15482 but for serialization. See that issue for the motivation. Also part of this tracking issue: bevyengine#15518 This PR is non-breaking, just like the deserializer PR (because the new type parameter `P` has a default `P = ()`). ## Solution Identical solution to the deserializer PR. ## Testing Added unit tests and a very comprehensive doc test outlining a clear example and use case.

aecsocket force-pushed the feat/reflect-serde branch from 847aa05 to 37151e6 Compare September 27, 2024 22:07

MrGVSV self-requested a review September 27, 2024 23:41

aecsocket force-pushed the feat/reflect-serde branch from 37151e6 to 4f8b666 Compare September 28, 2024 16:26

MrGVSV reviewed Sep 28, 2024

View reviewed changes

MrGVSV added the M-Migration-Guide A breaking change to Bevy's public API that needs to be noted in a migration guide label Sep 28, 2024

aecsocket mentioned this pull request Sep 29, 2024

Meta-issue for improving reflect-deserialized asset loading #15518

Open

aecsocket force-pushed the feat/reflect-serde branch from 0981e38 to c335283 Compare September 30, 2024 10:42

alice-i-cecile removed the M-Migration-Guide A breaking change to Bevy's public API that needs to be noted in a migration guide label Sep 30, 2024

aecsocket mentioned this pull request Sep 30, 2024

bevy_reflect: Add ReflectSerializerProcessor #15548

Merged

MrGVSV reviewed Oct 4, 2024

View reviewed changes

aecsocket force-pushed the feat/reflect-serde branch 2 times, most recently from 15f1a7d to 6a63ff5 Compare October 5, 2024 10:48

aecsocket force-pushed the feat/reflect-serde branch from 4c85135 to b6ba833 Compare October 12, 2024 14:58

MrGVSV reviewed Oct 15, 2024

View reviewed changes

examples/asset/reflect_deserialized_asset.rs Outdated Show resolved Hide resolved

crates/bevy_reflect/src/serde/de/mod.rs Outdated Show resolved Hide resolved

crates/bevy_reflect/src/serde/de/processor.rs Show resolved Hide resolved

aecsocket force-pushed the feat/reflect-serde branch from 38fbfd7 to a59cae9 Compare October 29, 2024 09:12

BenjaminBrienen approved these changes Oct 31, 2024

View reviewed changes

aecsocket added 2 commits October 31, 2024 17:57

Add ReflectDeserializerProcessor

9d59375

fix clippy

8f5cd7b

aecsocket added 11 commits October 31, 2024 17:57

fix docs

066d722

wip: image pipeline example

63dae42

working logic

56c7e17

Update examples/README.md

e76e63d

Add nicer image asset for image processor example

18069a9

fix up rebase

9e447be

further fix up rebase

27a3183

remove as_refs

b00d98d

add more processor docs

7a652af

remove example

aa0d89b

switch to incorrect spelling

7faf965

aecsocket force-pushed the feat/reflect-serde branch from da61336 to 7faf965 Compare October 31, 2024 17:57

mbrea-c added a commit to aecsocket/bevy_animation_graph that referenced this pull request Nov 1, 2024

WIP migration: Add reflect de/serializer code from bevyengine/bevy#15482

225c5ca

MrGVSV approved these changes Nov 7, 2024

View reviewed changes

MrGVSV added S-Ready-For-Final-Review This PR has been approved by the community. It's ready for a maintainer to consider merging it and removed S-Needs-Review Needs reviewer attention (from anyone!) to move forward labels Nov 7, 2024

mockersf added this pull request to the merge queue Nov 11, 2024

Merged via the queue into bevyengine:main with commit 57931ce Nov 11, 2024
30 checks passed

alice-i-cecile added this to the 0.15 milestone Nov 17, 2024

aecsocket mentioned this pull request Nov 18, 2024

0.15: ReflectDe/SerializerProcessor bevyengine/bevy-website#1818

Merged

rparrett mentioned this pull request Nov 29, 2024

0.15 migration: Fix other guides getting swallowed by broken formatting bevyengine/bevy-website#1881

Closed

ElliottjPierce mentioned this pull request Mar 26, 2025

Clean up incomplete migration guides tracking issue bevyengine/bevy-website#2014

Open

16 tasks

Uh oh!

bevy_reflect: Add ReflectDeserializerProcessor #15482

bevy_reflect: Add ReflectDeserializerProcessor #15482

Uh oh!

Conversation

aecsocket commented Sep 27, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Objective

Solution

Testing

Migration Guide

Why Box<dyn ..>?

Why the 'p on ReflectDeserializerProcessor<'p>?

Future possibilities

Testing

Migration Guide

Uh oh!

aecsocket commented Sep 27, 2024

Uh oh!

MrGVSV left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

MrGVSV commented Sep 28, 2024

Uh oh!

github-actions bot commented Sep 28, 2024

Uh oh!

aecsocket commented Sep 28, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aecsocket commented Sep 29, 2024

Uh oh!

aecsocket commented Sep 30, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

MrGVSV Oct 4, 2024

Choose a reason for hiding this comment

Uh oh!

aecsocket Oct 4, 2024

Choose a reason for hiding this comment

Uh oh!

aecsocket Oct 4, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

github-actions bot commented Oct 5, 2024

Uh oh!

aecsocket commented Oct 5, 2024

Uh oh!

MrGVSV left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

MrGVSV left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

bevy_reflect: Add `ReflectDeserializerProcessor` #15482

bevy_reflect: Add `ReflectDeserializerProcessor` #15482

aecsocket commented Sep 27, 2024 •

edited

Loading

Why `Box<dyn ..>`?

Why the `'p` on `ReflectDeserializerProcessor<'p>`?

MrGVSV left a comment •

edited

Loading

aecsocket commented Sep 28, 2024 •

edited

Loading