Walkthrough

This example demonstrates how using an example system with symbolic array state and symbolic array parameters.

Inversion setup

The inversion setup is given by overriding specific functions with the first argument being a specific subtype of AbstractCrossInversionCase corresponding to the inversion problem.

Here, we define singleton type DocuVecCase and provide inversion setup with defining methods for this type. Take care to add methods to the function in module CrossInverts rather define the methods in module Main.

Example system

First lets setup the system to be inverted, using function get_case_inverted_system.

using ModelingToolkit, OrdinaryDiffEq
using ModelingToolkit: t_nounits as t, D_nounits as D
using ComponentArrays: ComponentArrays as CA
using MTKHelpers
using CrossInverts
using DistributionFits
using PDMats: PDiagMat
using Turing

function samplesystem_vec(; name, τ = 3.0, i = 0.1, p = [1.1, 1.2, 1.3])
    n_comp = 2
    @parameters t
    D = Differential(t)
    @variables x(..)[1:n_comp] dec2(..)
    ps = @parameters τ=τ i=i i2 p[1:3]=p
    sts = vcat([x(t)[i] for i in 1:n_comp], dec2(t))
    eq = [
        D(x(t)[1]) ~ i - p[1] * x(t)[1] + (p[2] - x(t)[1]^2) / τ,
        D(x(t)[2]) ~ i - dec2(t) + i2,
        dec2(t) ~ p[3] * x(t)[2], # observable
    ]
    sys = ODESystem(eq, t, sts, vcat(ps...); name)
end

struct DocuVecCase <: AbstractCrossInversionCase end

function CrossInverts.get_case_inverted_system(::DocuVecCase; scenario)
    @named sv = samplesystem_vec()
    @named system = embed_system(sv)
    u0_default = CA.ComponentVector()
    p_default = CA.ComponentVector(sv₊i2 = 0.1)
    (;system, u0_default, p_default)
end

inv_case = DocuVecCase()
scenario = NTuple{0, Symbol}()
(;system, u0_default, p_default) = get_case_inverted_system(inv_case; scenario)
system

\[ \begin{align} \frac{\mathrm{d} sv_{+}x\left( t \right)_{1}}{\mathrm{d}t} &= sv_{+}i + \frac{sv_{+}p_{2} - sv_{+}x\left( t \right)_{1}^{2}}{sv_+\tau} - sv_{+}p_{1} sv_{+}x\left( t \right)_{1} \\ \frac{\mathrm{d} sv_{+}x\left( t \right)_{2}}{\mathrm{d}t} &= sv_{+}i + sv_{+}i2 - sv_{+}dec2\left( t \right) \end{align} \]

Here, some parameters have default values, others, suche as sv₊i2, need to specified with with returned ComponentVector p_default.

Optimized parameters and individuals

First, we define which parameters should be calibrated as fixed, ranmul, or individual parameters using function get_case_mixed_keys. Next, we define which individuals take part in the inversion scenario using function get_case_indiv_ids

function CrossInverts.get_case_mixed_keys(::AbstractCrossInversionCase; scenario)
    (;
        fixed = (:sv₊p,),
        ranadd = (),
        ranmul = (:sv₊x, :sv₊τ),
        indiv = (:sv₊i,))
end

CrossInverts.get_case_indiv_ids(::DocuVecCase; scenario) = (:A, :B, :C)

Priors, Observations, and Observation uncertainty

We need to provide additional information to the inversion, such as observations, observation uncertainties, and prior distribution.

We provide priors with function get_case_priors_dict. For simplicity we return the same priors independent of the individual or the scenario. For the SymbolicArray parameters, we need to provide a Multivariate distribution. Here, we provide a product distribution of uncorrelated LogNormal distributions, which are specified by its mode and upper quantile using df_from_paramsModeUpperRows.

function CrossInverts.get_case_priors_dict(::DocuVecCase, indiv_id; scenario = NTuple{0, Symbol}())
    #using DataFrames, Tables, DistributionFits, Chain
    paramsModeUpperRows = [
        # τ = 3.0, i = 0.1, p = [1.1, 1.2, 1.3])
        (:sv₊i, LogNormal, 1.0, 6.0),
        (:sv₊τ, LogNormal, 1.0, 5.0),
        (:sv₊x_1, LogNormal, 1.0, 2.0),
        (:sv₊x_2, LogNormal, 1.0, 2.0),
    ]
    df_scalars = df_from_paramsModeUpperRows(paramsModeUpperRows)
    dd = Dict{Symbol, Distribution}(df_scalars.par .=> df_scalars.dist)
    dist_p0 = fit(LogNormal, @qp_m(1.0), @qp_uu(3.0))
    # dd[:sv₊p] = product_distribution(fill(dist_p0, 3))
    # dd[:sv₊x] = product_distribution(dd[:sv₊x_1], dd[:sv₊x_2])
    dd[:sv₊p] = product_MvLogNormal(fill(dist_p0, 3)...)
    dd[:sv₊x] = product_MvLogNormal(dd[:sv₊x_1], dd[:sv₊x_2])
    dd
end
function product_MvLogNormal(comp...)
    μ = collect(getproperty.(comp, :μ))
    σ = collect(getproperty.(comp, :σ))
    Σ = PDiagMat(exp.(σ))
    MvLogNormal(μ, Σ)
end

get_case_priors_dict(inv_case, :A; scenario)

Dict{Symbol, Distributions.Distribution} with 6 entries:
  :sv₊x   => Distributions.MvLogNormal{Float64, PDMats.PDiagMat{Float64, Vector…
  :sv₊i   => Distributions.LogNormal{Float64}(μ=0.461001, σ=0.678971)
  :sv₊x_2 => Distributions.LogNormal{Float64}(μ=0.0935794, σ=0.305908)
  :sv₊x_1 => Distributions.LogNormal{Float64}(μ=0.0935794, σ=0.305908)
  :sv₊p   => Distributions.MvLogNormal{Float64, PDMats.PDiagMat{Float64, Vector…
  :sv₊τ   => Distributions.LogNormal{Float64}(μ=0.388227, σ=0.623078)

Similarly, we provide prior distributions for uncertainty of the random effects by function get_case_priors_random_dict.

function CrossInverts.get_case_priors_random_dict(::DocuVecCase; scenario = NTuple{0, Symbol}())
    # prior in σ rather than σstar
    d_exp = Exponential(log(1.05))
    dd = Dict{Symbol, Distribution}([:sv₊τ, :sv₊i] .=> d_exp)
    dd[:sv₊x] = Distributions.Product(fill(d_exp, 2))
    dd
end

get_case_priors_random_dict(inv_case; scenario)

Dict{Symbol, Distributions.Distribution} with 3 entries:
  :sv₊x => Distributions.Product{Distributions.Continuous, Distributions.Expone…
  :sv₊i => Distributions.Exponential{Float64}(θ=0.0487902)
  :sv₊τ => Distributions.Exponential{Float64}(θ=0.0487902)

Further, the type of distribution of observation uncertainties of the observations of different data streams by function get_case_obs_uncertainty_dist_type.

function CrossInverts.get_case_obs_uncertainty_dist_type(::DocuVecCase, stream;
        scenario = NTuple{0, Symbol}())
    dtypes = Dict{Symbol, Type}(:sv₊dec2 => LogNormal,
        :sv₊x => MvLogNormal)
    dtypes[stream]
end

get_case_obs_uncertainty_dist_type(inv_case, :sv₊dec2; scenario)

Distributions.LogNormal

Finally, for each

each individual,
for each stream,

we provide a vectors of

t: time
obs: observations (vectors for multivariate variables)
obs_unc: observation uncertainty parameters (can be matrices for multivariate variables)
obs_true (optionally): values of the true model to be rediscovered in synthetic experiments

This is done by implementing function get_case_indivdata. Usually, this would be read information from a file or database. Here, we provide the numbers as text.

function CrossInverts.get_case_indivdata(::DocuVecCase, indiv_id; scenario = NTuple{0, Symbol}())
    data = (A = (sv₊x = (t = [0.2, 0.4, 1.0, 2.0],
                obs = [
                    [2.3696993004601956, 2.673733320916141],
                    [1.8642844249865063, 2.0994355527637607],
                    [1.9744553950945931, 2.049494086682751],
                    [1.806115091024414, 1.4088107777562726],
                ],
                obs_unc = [
                    [0.09531017980432493 0.0; 0.0 0.09531017980432493],
                    [0.09531017980432493 0.0; 0.0 0.09531017980432493],
                    [0.09531017980432493 0.0; 0.0 0.09531017980432493],
                    [0.09531017980432493 0.0; 0.0 0.09531017980432493],
                ],
                obs_true = [
                    [1.4528506430586314, 1.502300054146255],
                    [1.2174085538439976, 1.1706665606844529],
                    [1.0483430119731987, 0.7600115428483291],
                    [1.0309694961068738, 0.6441417808271487],
                ]),
            sv₊dec2 = (t = [0.2, 0.4, 1.0, 2.0],
                obs = [
                    3.7951565919532038,
                    2.932295276687423,
                    2.0064853619502925,
                    1.6522510350996853,
                ],
                obs_unc = [1.1, 1.1, 1.1, 1.1],
                obs_true = [
                    3.606705597390664,
                    2.810523520548073,
                    1.8246274291924653,
                    1.546448567322152,
                ])),
        B = (sv₊x = (t = [0.2, 0.4, 1.0, 2.0],
                obs = [
                    [2.0681893973690264, 2.76555266499398],
                    [3.002213659926257, 2.738988031384357],
                    [2.2024778579768736, 1.8863521088263966],
                    [1.8970493973645883, 1.4592874111525584],
                ],
                obs_unc = [
                    [0.09531017980432493 0.0; 0.0 0.09531017980432493],
                    [0.09531017980432493 0.0; 0.0 0.09531017980432493],
                    [0.09531017980432493 0.0; 0.0 0.09531017980432493],
                    [0.09531017980432493 0.0; 0.0 0.09531017980432493],
                ],
                obs_true = [
                    [1.4319499386364825, 1.4846599446224278],
                    [1.2097697867481565, 1.1597529395039063],
                    [1.0512489486634184, 0.7574273823278419],
                    [1.035264629162679, 0.6439076211840167],
                ]),
            sv₊dec2 = (t = [0.2, 0.4, 1.0, 2.0],
                obs = [
                    5.286801850397016,
                    2.9649984441621826,
                    2.1180756620394585,
                    2.6749483017364,
                ],
                obs_unc = [1.1, 1.1, 1.1, 1.1],
                obs_true = [
                    3.5643554146940866,
                    2.784322217758367,
                    1.8184234047779861,
                    1.5458863994028762,
                ])),
        C = (sv₊x = (t = [0.2, 0.4, 1.0, 2.0],
                obs = [
                    [2.2350643301157382, 2.3130035358019856],
                    [2.0736166580761624, 1.9436035468232888],
                    [2.0472448291872816, 1.529804596360485],
                    [1.8267544248914431, 1.2760177129115113],
                ],
                obs_unc = [
                    [0.09531017980432493 0.0; 0.0 0.09531017980432493],
                    [0.09531017980432493 0.0; 0.0 0.09531017980432493],
                    [0.09531017980432493 0.0; 0.0 0.09531017980432493],
                    [0.09531017980432493 0.0; 0.0 0.09531017980432493],
                ],
                obs_true = [
                    [1.4810168420659708, 1.502512426277095],
                    [1.226148237932659, 1.1707979724544357],
                    [1.0387515337959667, 0.7600427779041109],
                    [1.0183823891718273, 0.6441445598911335],
                ]),
            sv₊dec2 = (t = [0.2, 0.4, 1.0, 2.0],
                obs = [
                    4.026668907719985,
                    3.1937462073315097,
                    6.2700505882164785,
                    3.4322758342125548,
                ],
                obs_unc = [1.1, 1.1, 1.1, 1.1],
                obs_true = [
                    3.607215458087877,
                    2.8108390124932754,
                    1.8247024179739757,
                    1.5464552392686794,
                ])))
    data[indiv_id]
end

get_case_indivdata(inv_case, :A; scenario)

(sv₊x = (t = [0.2, 0.4, 1.0, 2.0], obs = [[2.3696993004601956, 2.673733320916141], [1.8642844249865063, 2.0994355527637607], [1.9744553950945931, 2.049494086682751], [1.806115091024414, 1.4088107777562726]], obs_unc = [[0.09531017980432493 0.0; 0.0 0.09531017980432493], [0.09531017980432493 0.0; 0.0 0.09531017980432493], [0.09531017980432493 0.0; 0.0 0.09531017980432493], [0.09531017980432493 0.0; 0.0 0.09531017980432493]], obs_true = [[1.4528506430586314, 1.502300054146255], [1.2174085538439976, 1.1706665606844529], [1.0483430119731987, 0.7600115428483291], [1.0309694961068738, 0.6441417808271487]]), sv₊dec2 = (t = [0.2, 0.4, 1.0, 2.0], obs = [3.7951565919532038, 2.932295276687423, 2.0064853619502925, 1.6522510350996853], obs_unc = [1.1, 1.1, 1.1, 1.1], obs_true = [3.606705597390664, 2.810523520548073, 1.8246274291924653, 1.546448567322152]))

Often, when one parameter is adjusted, this has consequences for other non-optimized parameters. Function get_case_problemupdater allows to provide a ParameterUpdater to take care. In this example, when optimizing parameter i, then parameter i2 is set to the same value.

function CrossInverts.get_case_problemupdater(::DocuVecCase; system, scenario = NTuple{0, Symbol}())
    mapping = (:sv₊i => :sv₊i2,)
    pset = ODEProblemParSetter(system, Symbol[]) # parsetter to get state symbols
    get_ode_problemupdater(KeysProblemParGetter(mapping, keys(axis_state(pset))), system)
end

get_case_problemupdater(inv_case; system, scenario)

MTKHelpers.ProblemUpdater(MTKHelpers.KeysProblemParGetter{1}((:sv₊i,), (:sv₊i2,), Bool[0]), MTKHelpers.ODEProblemParSetter(Axis(state = 1:0, par = ViewAxis(1:1, Axis(sv₊i2 = 1,))), Axis(sv₊x = 1:2,), Axis(var"(sv₊x(t))[1]" = 1, var"(sv₊x(t))[2]" = 2), Axis(sv₊τ = 1, sv₊i = 2, sv₊i2 = 3, sv₊p = 4:6), Axis(state = 1:0, par = ViewAxis(1:1, Axis(sv₊i2 = 1,))), Axis(sv₊i2 = 1,), SymbolicUtils.BasicSymbolic{Real}[], SymbolicUtils.BasicSymbolic{Real}[sv₊i2], ModelingToolkit.ParameterIndex{SciMLStructures.Tunable, Int64}[ModelingToolkit.ParameterIndex{SciMLStructures.Tunable, Int64}(SciMLStructures.Tunable(), 1, false), ModelingToolkit.ParameterIndex{SciMLStructures.Tunable, Int64}(SciMLStructures.Tunable(), 4, false), ModelingToolkit.ParameterIndex{SciMLStructures.Tunable, Int64}(SciMLStructures.Tunable(), 5, false), ModelingToolkit.ParameterIndex{SciMLStructures.Tunable, Int64}(SciMLStructures.Tunable(), 6, false), ModelingToolkit.ParameterIndex{SciMLStructures.Tunable, Int64}(SciMLStructures.Tunable(), 2, false), ModelingToolkit.ParameterIndex{SciMLStructures.Tunable, Int64}(SciMLStructures.Tunable(), 3, false)], Any[], ModelingToolkit.ParameterIndex{SciMLStructures.Tunable, Int64}[ModelingToolkit.ParameterIndex{SciMLStructures.Tunable, Int64}(SciMLStructures.Tunable(), 5, false)]))

Compiling the setup

With all the functions of the setup defined, we can call function setup_inversion to compile all the setup. We get the system object, information at population level, and information at individual level as a DataFrame.

(;system, indiv_info, pop_info) = setup_inversion(inv_case; scenario)
keys(pop_info.sample0)

(:fixed, :ranadd, :ranmul, :ranadd_σ, :ranmul_σ, :indiv, :indiv_ranadd, :indiv_ranmul)

A single sample is a ComponentVector with components

fixed: fixed effects
ranadd: mean additive random effects
ranmul: mean multiplicative random effects
ranadd_σ: uncertainty parameter of the additive random effects
ranmul_σ: uncertainty parameter of the multiplicative random effects
indiv: Component vector of each individual with individual effects
indiv_ranadd: Difference between individual and mean additive random effect
indiv_ranmul: Ratio betweenn individual and mean multiplicative random effect

A reminder of the effects:

pop_info.mixed_keys

(fixed = (:sv₊p,), ranadd = (), ranmul = (:sv₊x, :sv₊τ), indiv = (:sv₊i,))

Accessing single components.

pop_info.sample0[:ranmul]

ComponentVector{Float64}(sv₊x = [2.1651893082724047, 2.1651893082724047], sv₊τ = 1.7902227199892276)

pop_info.sample0[:indiv][:A]

ComponentVector{Float64}(sv₊i = 1.9967121496392446)

Forward simulation

Although not necessary for the inversion, it can be helpful for analysing to do a single forward simulation for all individuals for a given estimate of the effects.

First, a function is created using gen_sim_sols_probs that requires an estimate of the effects, and returns the solution and the updated problem for each individual. Then this function is called with initial estimates.

solver = AutoTsit5(Rodas5P())
sim_sols_probs = gen_sim_sols_probs(;
    tools = indiv_info.tools, psets = pop_info.psets,
    problemupdater = pop_info.problemupdater, solver)
(;fixed, ranadd, ranmul, indiv, indiv_ranadd, indiv_ranmul) = pop_info.mixed
sols_probs = sim_sols_probs(fixed, ranadd, ranmul, indiv, indiv_ranadd, indiv_ranmul)
(sol, problem_opt) = sols_probs[1]
sol[:sv₊x]

Below we just check that the ProblemUpdater really updated the non-optimized parameter i2 to the value of the optimized parameter i.

pset = pop_info.psets.fixed
get_par_labeled(pset, problem_opt)[:sv₊i2] == get_par_labeled(pset, problem_opt)[:sv₊i]

true

Model Inversion

First, a Turing-model is created using gen_model_cross. Next, a few samples are drawn from this model using the NUTS sampler.

model_cross = gen_model_cross(;
    inv_case, tools = indiv_info.tools,
    priors_pop = pop_info.priors_pop, psets = pop_info.psets,
    sim_sols_probs, scenario, solver);

n_burnin = 0
n_sample = 10
chn = Turing.sample(model_cross, Turing.NUTS(n_burnin, 0.65, init_ϵ = 0.2), n_sample,
    init_params = collect(pop_info.sample0))

names(chn, :parameters)

21-element Vector{Symbol}:
 Symbol("fixed[:sv₊p][1]")
 Symbol("fixed[:sv₊p][2]")
 Symbol("fixed[:sv₊p][3]")
 Symbol("ranmul[:sv₊x][1]")
 Symbol("ranmul[:sv₊x][2]")
 Symbol("ranmul[:sv₊τ]")
 Symbol("pranmul_σ[:sv₊x][1]")
 Symbol("pranmul_σ[:sv₊x][2]")
 Symbol("pranmul_σ[:sv₊τ]")
 Symbol("indiv[:sv₊i, 1]")
 ⋮
 Symbol("indiv_ranmul[:sv₊x, 1][1]")
 Symbol("indiv_ranmul[:sv₊x, 1][2]")
 Symbol("indiv_ranmul[:sv₊τ, 1]")
 Symbol("indiv_ranmul[:sv₊x, 2][1]")
 Symbol("indiv_ranmul[:sv₊x, 2][2]")
 Symbol("indiv_ranmul[:sv₊τ, 2]")
 Symbol("indiv_ranmul[:sv₊x, 3][1]")
 Symbol("indiv_ranmul[:sv₊x, 3][2]")
 Symbol("indiv_ranmul[:sv₊τ, 3]")

For each scalarized value of the effects there is a series of samples.

a group estimate for each fixed effect. For multivariate variables the index is appended last, e.g. Symbol("fixed[:sv₊p][1]").
a group mean additive random effect (none in the example case).
a group mean multiplicative random effect, e.g. Symbol("ranmul[:sv₊τ]").
an uncertainty parameter of the ranmul effect, e.g. Symbol("pranmul_σ[:sv₊τ]").
an individual effect for each individual, e.g. Symbol("indiv[:sv₊i, 3]") for the third individual.
the individual offset for the ranadd effect for each individual (none in the example case),
the individual multiplier for the ranmul effect for each individual, e.g. Symbol("indiv_ranmul[:sv₊τ, 3]").

Extracting individual effects

Each row of a multivariate chain can be extracted as a ComponentVector as described in [Extracting effects from sampled object].

chn2 = chn[:,vcat(pop_info.effect_pos[:indiv_ranmul][:B]...),:]
chn3 = extract_group(chn2, :indiv_ranmul, pop_info.indiv_ids)
names(chn3)

3-element Vector{Symbol}:
 Symbol(":sv₊x[:B][1]")
 Symbol(":sv₊x[:B][2]")
 Symbol(":sv₊τ[:B]")