CSMC & Sparse Particle Storage with RB and GPU-acceleration #22

THargreaves · 2024-10-29T17:15:49Z

A very much draft piece of work tightening up and testing the CSMC interface. Just wanted to check it technically works.

Contributions and comments welcomed.

TODO:

On the last point, the callback is not ran after initialisation so x0 of the reference trajectory is not stored.

Use offset arrays to include initial state in reference trajectory. Implement Kalman smoother for use unit tests.

THargreaves · 2024-11-04T16:41:29Z

Looks like the tests before were passing by chance. I've made a few changes so they now have a valid unit test that passes:

Implemented RTS smoother and wrote unit test
Used OffsetVectors to include x0 in reference trajectory
Correct ancestry code

The first change required modifying the GaussianContainer to have proposal/filtered states, which breaks the RBPF test (though the others still pass) as discussed in #14.

I think the way around this might be to have initialise/update/predict act on the state itself, not the container of proposal/filtered states. The step method can then map these to the container (if the container is even needed at all).

…ontainers

THargreaves · 2024-11-13T13:32:13Z

Code is very messy at this point but...it works 🙌

Have a unit test comparing GPU-accelerated, Rao-Blackwellised CSMC to RTS on a dummy Gaussian problem and the smoothing distributions match.

I will probably clean up once we chat on Friday to discuss interface changes.

The biggest long-term issue is that I'm storing the reference trajectories essentially as a vector of (D x 1) CuArrays (i.e. particle containers of size 1). This is probably not very efficient.

Maybe it's best falling back to the CPU for the reference trajectory stuff since we're no longer in a parallel setting. Though a question I've been wondering is whether having one reference trajectory is always optimal for CSMC/PG. The theory works the same with any number...and the GPU would be useful if we had multiple (can compute the ancestries in parallel).

THargreaves · 2024-11-15T18:34:38Z

Here's the PDF detailing the changes involved in this PR.

GeneralisedFilters.jl Containers.pdf

THargreaves · 2024-11-19T15:27:14Z

The last commit starts the conversion of the interface to act on distributions rather than the combined intermediate storage.

With this change the Kalman smoother, particle filter and RBPF are all simultaneously passing.

I apologise that this has made some other parts of the code a bit clunkier and may get in the way of some of your ideas. I don't see this as a final version though so happy to make changes to make things more elegant provided it still allows for the RBPF to work nicely.

Some of the main changes that were required:

Added type AbstractParticleFilter which has a modified step method which contains the resampling rather than inside of the predict step (where ancestors is not available)
Although the update_ref still takes place within initialise/predict, the updating of ancestor indices happens in this custom step, which is a bit clunky.
Add an instantiate method which creates the intermediate storage before it gets populated by initialise.

I'll spend the rest of the day making sure the other tests pass and introducing the other changes we discussed last Friday.

Thank you for your patience whilst I make these quite clunky changes—the fast GPU—RB—PGAS should all be worth it though!

charlesknipp

I like what I see so far as long as unit tests are passing across the board. I also want to make sure this code is type stable since this could be a huge bottleneck in terms of performance.

charlesknipp · 2024-11-19T22:01:45Z

Project.toml

 StatsBase = "2913bbd2-ae8a-5f71-8c99-4fb6c76f3a91"

 [compat]
 DataStructures = "0.18.20"
 GaussianDistributions = "0.5.2"
+OffsetArrays = "1.14.1"
+Statistics = "1.11.1"


this is a little odd, I was able to remove this on my work computer (which is stuck at Julia-1.10.4)

charlesknipp · 2024-11-19T23:06:21Z

src/containers.jl

+mutable struct Intermediate
+    proposed::Any
+    filtered::Any
+    ancestors::Any
+    Intermediate() = new()
+end


I'm not too keen on the idea of sacrificing type stability for convenience. If we can ensure that the RBPF is type stable with Intermediate that would be ideal.

I agree. I think we can compute these types at instantiation time, it'll just be a bit of a faff. It's basically a generalisation of the rb_type used with the CPU.

src/containers.jl

src/algorithms/kalman.jl

charlesknipp · 2024-11-19T23:16:23Z

src/algorithms/bootstrap.jl

+function instantiate(
+    model::StateSpaceModel{T}, filter::BootstrapFilter; kwargs...
+) where {T}
+    N = filter.N
+    particle_state = ParticleState(Vector{Vector{T}}(undef, N), Vector{T}(undef, N))
+    return ParticleContainer(
+        particle_state, deepcopy(particle_state), Vector{Int}(undef, N)
+    )
 end


I think we can generalize this to any AbstractParticleFilter, except for the RBPF

Indeed. Can probably even extend it to the RBPF too. It's just replacing the Vector{T} with the Rao-Blackwellised particle

THargreaves · 2024-11-20T00:07:04Z

I also want to make sure this code is type stable since this could be a huge bottleneck in terms of performance.

Yeah, I definitely think some things have slipped through in the process. Code, especially the GPU stuff, seems slower now than I expected.

THargreaves added 4 commits October 29, 2024 17:12

Add unit tests for CSMC and RBCSMC

97679c4

Add prototype Kalman smoother

10dfd4d

Merge branch 'main' into th/RBCSMC

ea91860

Correct CSMC algorithm and test against RTS

954d62e

Use offset arrays to include initial state in reference trajectory. Implement Kalman smoother for use unit tests.

THargreaves added 8 commits November 5, 2024 11:19

Correct batch KF test to match container API

6ded93b

Add prototype for GPU sparse path storage

5cfb567

Update Statistics package for new Julia version

3e6b907

Add callback for sparse GPU ancestry

189e3db

Generalised GPU/RB sparse particle storage and rewrite GPU particle c…

8c40b86

…ontainers

Fix bug with sparse container expansion

3e0bd7b

Add GPU reference trajectory

4cacbb7

Add OffsetArrays as dependency

fc96394

THargreaves mentioned this pull request Nov 12, 2024

GPU Sparse Particle Storage #23

Closed

Merge branch 'th/gpu-sparse-storage' into th/RBCSMC

047a5b2

THargreaves changed the title ~~CSMC with RB and GPU-acceleration~~ CSMC & Sparse Particle Storage with RB and GPU-acceleration Nov 12, 2024

Added unit test for GPU-RB-CSMC

4b1a6da

THargreaves added 3 commits November 18, 2024 15:53

Remove redundant code from unit tests

e599f86

Move constructor to outer for ancestor callback

69cb40d

Modify CPU interface to act on distributions rather than intermediates

1e512ea

Adapt remaining algorithms to distribution-based inference interface

c5c01b1

charlesknipp reviewed Nov 19, 2024

View reviewed changes

THargreaves added 2 commits November 19, 2024 23:58

Increase particle count for GPU-RB-CSMC test

d5cd0fa

Avoid redundant division

7aaa90b

THargreaves added 2 commits November 20, 2024 00:09

Remove weight reset from update_ref

d4e15bd

Remove arbitrary weights from reference trajectory

9e72cd5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CSMC & Sparse Particle Storage with RB and GPU-acceleration #22

CSMC & Sparse Particle Storage with RB and GPU-acceleration #22

THargreaves commented Oct 29, 2024 •

edited

Loading

THargreaves commented Nov 4, 2024

THargreaves commented Nov 13, 2024 •

edited

Loading

THargreaves commented Nov 15, 2024

THargreaves commented Nov 19, 2024

charlesknipp left a comment

charlesknipp Nov 19, 2024

charlesknipp Nov 19, 2024

THargreaves Nov 20, 2024

charlesknipp Nov 19, 2024

THargreaves Nov 20, 2024

THargreaves commented Nov 20, 2024

CSMC & Sparse Particle Storage with RB and GPU-acceleration #22

Are you sure you want to change the base?

CSMC & Sparse Particle Storage with RB and GPU-acceleration #22

Conversation

THargreaves commented Oct 29, 2024 • edited Loading

THargreaves commented Nov 4, 2024

THargreaves commented Nov 13, 2024 • edited Loading

THargreaves commented Nov 15, 2024

THargreaves commented Nov 19, 2024

charlesknipp left a comment

Choose a reason for hiding this comment

charlesknipp Nov 19, 2024

Choose a reason for hiding this comment

charlesknipp Nov 19, 2024

Choose a reason for hiding this comment

THargreaves Nov 20, 2024

Choose a reason for hiding this comment

charlesknipp Nov 19, 2024

Choose a reason for hiding this comment

THargreaves Nov 20, 2024

Choose a reason for hiding this comment

THargreaves commented Nov 20, 2024

THargreaves commented Oct 29, 2024 •

edited

Loading

THargreaves commented Nov 13, 2024 •

edited

Loading