Updated Likelihood Interface #29

charlesknipp · 2024-09-09T21:54:09Z

This is a WIP PR which adds a suite of estimation tools for SSJ. While this doesn't implement anything on the block level (as of writing), this does introduce a number of conventions which may be useful for likelihood evaluation regardless of how it's structured. These features are listed below:

Probability distributions, each with a sampling method rand, a log likelihood method logpdf, and a method to check whether a given draw lies within the existing support in_support. This design may change, any feedback is appreciated (especially for the Prior class).
Shock objects (limited to AR, MA, and ARMA for now) and containers to build off of the dictionary design paradigm used throughout the module.

Lastly, I added a notebook which demonstrates these methods in action. Some of these methods are rather clunky ~~(particularly when it comes to parameterization)~~ but are completely generalizable and will at least get the ball rolling on future iterations.

…face

charlesknipp · 2024-09-09T22:51:52Z

Credit to @bbardoczy for suggesting optimagic instead of scipy.optimize. This saved a ton of time allowing me to pass a dictionary as an argument to the objective function.

I still need to figure out whether the user should be allowed to pass a numpy array within a nested dictionary. For ARMA(p,q) shocks, their parameters phi and theta are arrays, so I'm not particularly sure how I want to handle this case. Any suggestions are welcome.

charlesknipp · 2024-09-10T22:10:28Z

I cleaned up the estimation notebook considerably. Right now, I limit the estimation to shock parameters to eliminate the need to recompute the Jacobian (this is a much bigger issue).

I also added a random walk MH algorithm at the bottom of the notebook. It seems finicky since the current log likelihood function references globals. This is definitely something I plan on addressing later this week.

charlesknipp · 2024-09-11T21:25:07Z

PyMC Integration

I added another notebook to try interfacing with pymc. It's not a very intuitive process, but can be disguised under the hood. There is also a ton of room for improvement in terms of optimization, which that requires integrating existing code with pytensor. So I'm not convinced this is the best path forward.

In my demo here the sampler takes about 8 minutes, and is significantly slower than the simple/stupid sampler in my other notebook. This is likely because of the "black box" inference of the modeling language.

Why Use a PPL?

The advantage to using a PPL (probablistic programming language) like pymc is clear when applying the same model to a number of different solvers. This also paves the way for using more robust and efficient inference algorithms like HMC.

I'm not too familiar with the Bayesian landscape in Python, so definitely let me know if there is anything more robust.

charlesknipp · 2024-09-13T18:42:12Z

@bbardoczy I figured out the problem with parameter inference in PyMC. Apparently each draw gets allocated to a zero dimensional numpy array as opposed to a scalar. This causes AccumulatedDerivative to error when multiplying, even though we are still technically multiplying a scalar.

I can't change PyMC's behavior since it's technically not doing anything wrong, and in most cases is helpful for sampling efficiency. The problem is, I don't want to change simple_displacement.py since it may override expected behavior. I may need a little help understanding exactly what this design choice was.

bbardoczy · 2024-09-13T19:44:03Z

I see. My guess is that we could safely extend all the scalar operations of the AccumulatedDerivative class to also work with zero dimensional numpy arrays. You could try that and see if it breaks any of tests. There are some direct test of AccumulatedDerivatce in test_displacement_handlers.py. You should also check that the notebooks still run properly. They all use SimpleBlocks and SolvedBlocks.

charlesknipp added 10 commits August 14, 2024 16:56

add simulation method to blocks

ad2d670

added proper shock class

c68d973

added parameterization utilities

34cbea5

update interface

56a0532

added probability distributions

a625cab

first pass at estimation interface

2fb0d28

Merge remote-tracking branch 'origin/master' into ck/likelihood-inter…

ac283d5

…face

resolve exports

f8bf039

remove block level methods

d2dffce

updated optimization method

cab2411

charlesknipp added 2 commits September 10, 2024 10:40

fixed bug in sigma estimation

ceda7a2

cleaned up estimation notebook

b726e39

added pymc integration test

1ecb376

update PyMC estimation interface

13af7bb

charlesknipp added 3 commits September 16, 2024 16:10

replaced instances of numpy.isscalar

825439a

MCMC now takes model parameters as a dictionary

bab32a3

fix unit tests

6b0518e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Updated Likelihood Interface #29

Updated Likelihood Interface #29

charlesknipp commented Sep 9, 2024 •

edited

Loading

charlesknipp commented Sep 9, 2024 •

edited

Loading

charlesknipp commented Sep 10, 2024

charlesknipp commented Sep 11, 2024 •

edited

Loading

charlesknipp commented Sep 13, 2024

bbardoczy commented Sep 13, 2024

Updated Likelihood Interface #29

Are you sure you want to change the base?

Updated Likelihood Interface #29

Conversation

charlesknipp commented Sep 9, 2024 • edited Loading

charlesknipp commented Sep 9, 2024 • edited Loading

charlesknipp commented Sep 10, 2024

charlesknipp commented Sep 11, 2024 • edited Loading

PyMC Integration

Why Use a PPL?

charlesknipp commented Sep 13, 2024

bbardoczy commented Sep 13, 2024

charlesknipp commented Sep 9, 2024 •

edited

Loading

charlesknipp commented Sep 9, 2024 •

edited

Loading

charlesknipp commented Sep 11, 2024 •

edited

Loading