Stan model as input to pigeons

Note

We use the package BridgeStan.jl as a package extension which will attempt to automatically install Stan. For BridgeStan.jl to work, a C++ compiler and make are needed, see the BridgeStan requirements.

To target the posterior distribution specified by a Stan model, use a StanLogPotential.

Here we show how this is done using our familiar unidentifiable toy example ported to the Stan language.

using BridgeStan
using Pigeons
using Random

# We will use this type to make sure our iid sampler (next section) will
# be used only for this model
struct StanUnidentifiableExample end

function stan_unid(n_trials, n_successes)
    # path to a .stan file (compiled files will be cached in the same directory)
    stan_file = dirname(dirname(pathof(Pigeons))) * "/examples/stan/unid.stan"

    # data can be specified either using...
    #   - a path to a json file with suffix .json containing the data to condition on
    #   - the JSON string itself (here via the utility Pigeons.json())
    stan_data = Pigeons.json(; n_trials, n_successes)

    return StanLogPotential(stan_file, stan_data, StanUnidentifiableExample())
end

pt = pigeons(target = stan_unid(100, 50), reference = stan_unid(0, 0))

BridgeStan not found at location specified by $BRIDGESTAN environment variable, downloading version 2.6.2 to /home/runner/.bridgestan/bridgestan-2.6.2
Done!
┌ Warning: Loading a shared object '/home/runner/work/Pigeons.jl/Pigeons.jl/examples/stan/unid_model.so' which is already loaded.
│ If the file has changed since the last time it was loaded, this load may not update the library!
└ @ BridgeStan ~/.julia/packages/BridgeStan/Qc2Lj/src/model.jl:60
┌ Info: Neither traces, disk, nor online recorders included.
│    You may not have access to your samples (unless you are using a custom recorder, or maybe you just want log(Z)).
└    To add recorders, use e.g. pigeons(target = ..., record = [traces; record_default()])
┌ Warning: It looks like sample_iid!() is not implemented for a
│ reference_log_potential of type StanLogPotential{...}.
│ Instead, using step!().
└ @ Pigeons ~/work/Pigeons.jl/Pigeons.jl/src/targets/target.jl:51
──────────────────────────────────────────────────────────────────────────────────────────────────
  scans        Λ        time(s)    allc(B)  log(Z₁/Z₀)   min(α)     mean(α)    min(αₑ)   mean(αₑ)
────────── ────────── ────────── ────────── ────────── ────────── ────────── ────────── ──────────
        2        1.3      0.548   1.92e+07      -5.89     0.0027      0.856      0.351      0.569
        4       1.85      0.113    6.9e+06      -4.38      0.592      0.794      0.369      0.585
        8       1.36    0.00763   1.05e+05      -4.83      0.282      0.849      0.519      0.618
       16       1.56     0.0156   2.01e+05      -4.88      0.595      0.827      0.503      0.586
       32       1.29      0.028    3.4e+05      -4.49      0.733      0.857      0.574      0.647
       64       1.37     0.0564   5.41e+05      -4.67      0.725      0.848      0.565      0.632
      128       1.53      0.112    9.4e+05      -5.04      0.787       0.83       0.58      0.626
      256       1.38      0.226   1.75e+06      -4.78      0.805      0.847      0.554      0.632
      512       1.57      0.451   3.37e+06      -4.95       0.78      0.826      0.546      0.628
 1.02e+03        1.5      0.904   6.62e+06      -4.97      0.798      0.834      0.551      0.623
──────────────────────────────────────────────────────────────────────────────────────────────────

Notice that we have specified a reference distribution, in this case the same model but with no observations (hence the prior). This needs to be done with Stan targets because it is not possible to automatically extract a prior from a .stan file.

For a StanLogPotential, the default_explorer() is AutoMALA^[1].

Sampling from the reference distribution

Ability to sample from the reference distribution can be beneficial, e.g. to jump modes in multi-modal distribution. For stan targets, this is done as follows:

using BridgeStan

function Pigeons.sample_iid!(
        log_potential::StanLogPotential{M, S, D, StanUnidentifiableExample}, replica, shared) where {M, S, D}
    # sample in constrained space
    constrained = rand(replica.rng, 2)
    # transform to unconstrained space
    replica.state.unconstrained_parameters .= BridgeStan.param_unconstrain(log_potential.model, constrained)
end

pt = pigeons(target = stan_unid(100, 50), reference = stan_unid(0, 0))

┌ Warning: Loading a shared object '/home/runner/work/Pigeons.jl/Pigeons.jl/examples/stan/unid_model.so' which is already loaded.
│ If the file has changed since the last time it was loaded, this load may not update the library!
└ @ BridgeStan ~/.julia/packages/BridgeStan/Qc2Lj/src/model.jl:60
┌ Warning: Loading a shared object '/home/runner/work/Pigeons.jl/Pigeons.jl/examples/stan/unid_model.so' which is already loaded.
│ If the file has changed since the last time it was loaded, this load may not update the library!
└ @ BridgeStan ~/.julia/packages/BridgeStan/Qc2Lj/src/model.jl:60
┌ Info: Neither traces, disk, nor online recorders included.
│    You may not have access to your samples (unless you are using a custom recorder, or maybe you just want log(Z)).
└    To add recorders, use e.g. pigeons(target = ..., record = [traces; record_default()])
──────────────────────────────────────────────────────────────────────────────────────────────────
  scans        Λ        time(s)    allc(B)  log(Z₁/Z₀)   min(α)     mean(α)    min(αₑ)   mean(αₑ)
────────── ────────── ────────── ────────── ────────── ────────── ────────── ────────── ──────────
        2       1.24    0.00138   4.86e+04      -4.29     0.0658      0.863      0.383      0.593
        4       1.73    0.00369   5.91e+04       -4.2      0.429      0.808      0.476      0.607
        8        1.2    0.00631    1.1e+05      -4.52      0.706      0.867      0.481       0.59
       16       1.12     0.0127   1.88e+05      -4.64      0.702      0.875      0.598       0.65
       32       1.77      0.025      3e+05         -5      0.593      0.803      0.506      0.622
       64       1.45     0.0501   5.19e+05      -4.84      0.735      0.839      0.534       0.63
      128        1.5      0.102   8.87e+05      -4.81      0.733      0.834      0.555       0.63
      256       1.53      0.202   1.64e+06      -4.93      0.784       0.83      0.602      0.634
      512       1.51      0.403   3.16e+06      -4.92      0.804      0.833      0.568      0.635
 1.02e+03       1.56      0.812    6.2e+06      -5.03       0.81      0.827      0.531      0.629
──────────────────────────────────────────────────────────────────────────────────────────────────

Manipulating the output

Internally, Stan target's states are stored in an unconstrained parameterization provided by Stan (for example, bounded support variables are mapped to the full real line). However, sample post-processing functions such as sample_array() and process_sample() convert back to the original ("constrained") parameterization via extract_sample().

As a result parameterization issues can be essentially ignored when post-processing, for example some common post-processing are shown below, see the section on output processing for more information.

using MCMCChains
using StatsPlots
plotlyjs()

pt = pigeons(
        target = stan_unid(100, 50),
        reference = stan_unid(0, 0),
        record = [traces])
samples = Chains(pt)
my_plot = StatsPlots.plot(samples)
StatsPlots.savefig(my_plot, "stan_posterior_densities_and_traces.html");

samples

Chains MCMC chain (1024×3×1 Array{Float64, 3}):

Iterations        = 1:1:1024
Number of chains  = 1
Samples per chain = 1024
parameters        = p1, p2
internals         = log_density

Summary Statistics
  parameters      mean       std      mcse   ess_bulk   ess_tail      rhat   e ⋯
      Symbol   Float64   Float64   Float64    Float64    Float64   Float64     ⋯

          p1    0.7107    0.1471    0.0070   417.8783   281.3432    1.0030     ⋯
          p2    0.7207    0.1470    0.0073   398.8000   360.5433    1.0000     ⋯
                                                                1 column omitted

Quantiles
  parameters      2.5%     25.0%     50.0%     75.0%     97.5% 
      Symbol   Float64   Float64   Float64   Float64   Float64 

          p1    0.4597    0.5916    0.6991    0.8300    0.9830
          p2    0.4801    0.5970    0.7077    0.8452    0.9769

1Biron-Lattes, M., Surjanovic, N., Syed, S., Campbell, T., & Bouchard-Côté, A.. (2024). autoMALA: Locally adaptive Metropolis-adjusted Langevin algorithm. Proceedings of The 27th International Conference on Artificial Intelligence and Statistics, in Proceedings of Machine Learning Research 238:4600-4608.