Extended output (i.e., for all chains)

So far when outputting traces (either to memory via traces or to disk via disk), we have been storing only the target distribution's samples. This is the most common scenario and the default. Here we show how to instead store the samples from all chains.

This can be useful in scenarios where all distributions $\pi_i$ are of interest, e.g. in certain statistical mechanics applications and for Bayesian inference under model mis-specification.

The key argument to add is extended_traces = true, which we demonstrate for various common scenarios below.

Posterior densities and trace plots for all chains

Make sure to have the third party DynamicPPL, MCMCChains, and StatsPlots packages installed via

using Pkg; Pkg.add("DynamicPPL", "MCMCChains", "StatsPlots")

Then use the following:

using DynamicPPL
using Pigeons
using MCMCChains
using StatsPlots
plotlyjs()

# example target: Binomial likelihood with parameter p = p1 * p2
an_unidentifiable_model = Pigeons.toy_turing_unid_target(100, 50)

pt = pigeons(target = an_unidentifiable_model,
                n_rounds = 12,
                extended_traces = true,
                # make sure to record the trace:
                record = [traces; round_trip; record_default()])

# collect the statistics and convert to MCMCChains' Chains
# to have axes labels matching variable names in Turing and Stan
samples = Chains(pt)

# create the trace plots
my_plot = StatsPlots.plot(samples)
StatsPlots.savefig(my_plot, "posterior_densities_and_traces_extended.html");

─────────────────────────────────────────────────────────────────────────────────────────────────────────────
  scans     restarts      Λ        time(s)    allc(B)  log(Z₁/Z₀)   min(α)     mean(α)    min(αₑ)   mean(αₑ)
────────── ────────── ────────── ────────── ────────── ────────── ────────── ────────── ────────── ──────────
        2          0       3.24    0.00887   1.33e+06      -8.14    0.00178       0.64          1          1
        4          0       1.64    0.00269   2.46e+06      -5.04     0.0352      0.818          1          1
        8          0       1.17    0.00502   4.82e+06      -4.42      0.708      0.871          1          1
       16          1        1.2     0.0109   1.01e+07      -4.03      0.549      0.867          1          1
       32          6       1.11     0.0228   1.98e+07      -4.77      0.754      0.877          1          1
       64         11       1.35     0.0451   3.99e+07      -4.79      0.698       0.85          1          1
      128         25        1.6     0.0815   7.89e+07      -4.97      0.725      0.823          1          1
      256         43       1.51      0.159   1.57e+08      -4.92      0.758      0.832          1          1
      512         95       1.48      0.397   3.17e+08      -4.94      0.806      0.836          1          1
 1.02e+03        170       1.53      0.796   6.31e+08      -5.08      0.808       0.83          1          1
 2.05e+03        377        1.5       1.49   1.26e+09      -5.03      0.819      0.833          1          1
  4.1e+03        729       1.51       2.75   2.52e+09      -4.96      0.816      0.832          1          1
─────────────────────────────────────────────────────────────────────────────────────────────────────────────

Here the ten different colours correspond to the 10 chains interpolating between the posterior and the prior (here a uniform distribution).

Off-memory processing for all chains

The same option, extended_traces = true can be used in the same fashion to save to disk samples from all chains:

using Pigeons

# example target: a 1000 dimensional target
high_d_target = Pigeons.toy_mvn_target(1000)

pt = pigeons(target = high_d_target,
                checkpoint = true,
                extended_traces = true,
                record = [disk])

first_dim_of_each = zeros(10, 1024)
process_sample(pt) do chain, scan, sample # ordered as if we had an inner loop over scans
    # each sample here is a Vector{Float64} of length 1000
    # in general, it will is produced by extract_sample()
    first_dim_of_each[chain, scan] = sample[1]
end

──────────────────────────────────────────────────────
  scans        Λ      log(Z₁/Z₀)   min(α)     mean(α)
────────── ────────── ────────── ────────── ──────────
        2          9  -1.18e+03  7.04e-107   0.000359
        4       8.97  -1.17e+03  1.31e-102     0.0035
        8       8.38   -1.2e+03  1.15e-107     0.0694
       16       8.99  -1.18e+03   1.94e-93    0.00096
       32       8.95  -1.17e+03   4.84e-72    0.00518
       64       8.89  -1.17e+03   1.44e-83     0.0126
      128       8.78  -1.16e+03   1.49e-69     0.0249
      256       8.92  -1.15e+03   2.16e-62    0.00875
      512       8.93  -1.16e+03   5.97e-67    0.00753
 1.02e+03       8.95  -1.16e+03   2.95e-66    0.00603
──────────────────────────────────────────────────────

Accessing the annealing parameters

To obtain the annealing parameter used to define each intermediate distribution, use:

using Pigeons

an_unidentifiable_model = Pigeons.toy_turing_unid_target(100, 50)

pt = pigeons(target = an_unidentifiable_model)

pt.shared.tempering.schedule

Pigeons.Schedule([0.0, 0.007721326484095335, 0.01835214114177213, 0.03664880881340905, 0.06793721365019918, 0.11670085893172734, 0.20042097455606725, 0.34327402177825367, 0.5714152185862837, 1.0])