Refactor type readiness check to avoid `NullReferenceException` #3121

Leh2 · 2024-04-09T12:50:57Z

We encountered a issue where our service would throw a NullReferenceException during startup, necessitating a reboot for recovery. This problem arises particularly when multiple operations execute in parallel at startup, such as Events.AggregateStreamAsync<T> and Events.Append(...) + SaveChanges. This situation triggers the compilation/build of generated types through different execution paths but not in a synchronized manner.

Through debugging, we observed that the setters for our aggregate in the generated document were not set when the error occurred. This led us to scrutinize the compile/build code, particularly the assembleTypes method, where the essential processing happens. We identified a specific flaw where the lock code checks the _liveGeneratedType property, but the setters are assigned afterward. This discrepancy creates a window where the code incorrectly assumes the compilation/build process is complete, proceeding to operations that depend on setters that have not yet been initialized.

The fix involves changing the source of truth for checking if types are ready from _liveType and _liveGeneratedType (which are set mid-process and hence unreliable) to a more dependable flag: _hasGenerated. This change ensures a more accurate and consistent readiness check, preventing the premature usage of types.

Example from our debug where we can see that BuildLiveAggregator is called before the setters are ready ("Setters is now set").

8: Starting building inline projections
8: generateIfNecessary
8: got _assembleLocker
8: Generating
8: tryAttachTypes
8: BuildLiveAggregationType thread
8: _liveGeneratedType is now set
5: BuildLiveAggregator: Pre BuildLiveAggregator called
5: BuildLiveAggregator: _liveType is not null. _liveGeneratedType have value: True
5: BuildLiveAggregator: for Marten.Generated.SingleStreamProjectionLiveAggregation525137164: Setters count 0
8: Setters is now set

The exception:

System.NullReferenceException: Object reference not set to an instance of an object.
   at XxxAggregate Marten.Generated.EventStore.SingleStreamProjectionLiveAggregation792320531.Apply(IEvent event, XxxAggregate aggregate, IQuerySession session) in /source/xxx.Service/Internal/Generated/EventStore/SingleStreamProjectionRuntimeSupport792320531.cs:line 92
   at XxxAggregate Marten.Generated.EventStore.SingleStreamProjectionLiveAggregation792320531.Build(IReadOnlyList<IEvent> events, IQuerySession session, XxxAggregate snapshot) in /source/xxx.Service/Internal/Generated/EventStore/SingleStreamProjectionRuntimeSupport792320531.cs:line 52
   at ValueTask<T> Marten.Events.Aggregation.SyncLiveAggregatorBase<T>.BuildAsync(IReadOnlyList<IEvent> events, IQuerySession session, T snapshot, CancellationToken cancellation)
   at async ValueTask<T> Marten.Events.Aggregation.AggregateVersioning<T>.BuildAsync(IReadOnlyList<IEvent> events, IQuerySession session, T snapshot, CancellationToken cancellation)
   at async Task<T> Marten.Events.QueryEventStore.AggregateStreamAsync<T>(Guid streamId, long version, DateTimeOffset? timestamp, T state, long fromVersion, CancellationToken token)
   at async Task<TAggregate> xxx.RepositoryV2.Load<TAggregate>(Guid id, long version) in /_/Xxx.EventSourcing.MartenV2/RepositoryV2.cs:line 49

Test
I managed to create a reproducible test through a workaround that involved inserting a sleep immediately after setting _liveGeneratedType. This approach also necessitated having events pre-persisted before executing the test to simulate the real-world scenario accurately. While this test is not included, it is accessible for review at the following URL: Leh2@246c83c.

It's important to note that in our implementation, we leverage private apply methods, which exhibit somewhat different behavior in the generated document.

Previously, the readiness check relied on the null status of `_liveType` and `_liveGeneratedType`, which proved unreliable as these variables are set mid-process.

Leh2 added 5 commits April 9, 2024 12:53

Add defensive validations

2fa4086

Refactor type readiness check to use _hasGenerated

6164de1

Previously, the readiness check relied on the null status of `_liveType` and `_liveGeneratedType`, which proved unreliable as these variables are set mid-process.

Double lock on _assembleLocker has no effect

d51a627

Format

48d2cdc

Reuse _assembleLocker

f94e9b8

Leh2 changed the title ~~Refactor type readiness check to avoid `NullReferenceException~~ Refactor type readiness check to avoid NullReferenceException Apr 9, 2024

Leh2 marked this pull request as ready for review April 9, 2024 13:11

jeremydmiller merged commit 36afb2a into JasperFx:master Apr 10, 2024
11 checks passed

Baune8D mentioned this pull request Apr 22, 2024

Add .NET 8 compatibility with Marten 3.x #3152

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor type readiness check to avoid `NullReferenceException` #3121

Refactor type readiness check to avoid `NullReferenceException` #3121

Leh2 commented Apr 9, 2024

Refactor type readiness check to avoid NullReferenceException #3121

Refactor type readiness check to avoid NullReferenceException #3121

Conversation

Leh2 commented Apr 9, 2024

Refactor type readiness check to avoid `NullReferenceException` #3121

Refactor type readiness check to avoid `NullReferenceException` #3121