.. DO NOT EDIT.
.. THIS FILE WAS AUTOMATICALLY GENERATED BY SPHINX-GALLERY.
.. TO MAKE CHANGES, EDIT THE SOURCE PYTHON FILE:
.. "auto_examples/sampling/tempering/parallel_tempering.py"
.. LINE NUMBERS ARE GIVEN BELOW.

.. only:: html

    .. note::
        :class: sphx-glr-download-link-note

        :ref:`Go to the end <sphx_glr_download_auto_examples_sampling_tempering_parallel_tempering.py>`
        to download the full example code or to run this example in your browser via Binder

.. rst-class:: sphx-glr-example-title

.. _sphx_glr_auto_examples_sampling_tempering_parallel_tempering.py:


Parallel Tempering for Bayesian Inference and Reliability analyses
====================================================================

.. GENERATED FROM PYTHON SOURCE LINES 9-51

The general framework: one wants to sample from a distribution of the form

.. math:: p_{1}(x)=\dfrac{q_1(x)p_{0}(x)}{Z_{1}}

where :math:`q_{1}(x)` and :math:`p_{0}(x)` can be evaluated; and potentially estimate the constant
:math:`Z_{1}=\int{q_{1}(x) p_{0}(x)dx}`. Parallel tempering introduces a sequence of intermediate distributions:

.. math:: p_{\beta}(x) \propto q(x, \beta) p_{0}(x)

for values of :math:`\beta` in [0, 1] (note: :math:`\beta` is :math:`1/T` where :math:`T` is often referred as the
temperature). Setting :math:`\beta=1` equates sampling from the target, while :math:`\beta \rightarrow 0` samples from
the reference distribution :math:`p_{0}`. Periodically during the run, the different temperatures swap members of
their ensemble in a way that preserves detailed balance. The chains closer to the reference chain (hot chains) can
sample from regions that have low probability under the target and thus allow a better exploration of the parameter
space, while the cold chains can better explore the regions of high likelihood.

The normalizing constant :math:`Z_{1}` is estimated via thermodynamic integration:

.. math:: \ln{Z_{\beta=1}} = \ln{Z_{\beta=0}} + \int_{0}^{1} E_{p_{\beta}} \left[ \frac{\partial \ln{q_{\beta}(x)}}{\partial \beta} \right] d\beta = \ln{Z_{\beta=0}} + \int_{0}^{1} E_{p_{\beta}} \left[ U_{\beta}(x) \right] d\beta

where :math:`\ln{Z_{\beta=0}}=\int{q_{\beta=0}(x) p_{0}(x)dx}` can be determined by simple MC sampling since
:math:`q_{\beta=0}(x)` is close to the reference distribution :math:`p_{0}`. The function
:math:`U_{\beta}(x)=\frac{\partial \ln{q_{\beta}(x)}}{\partial \beta}` is called the potential, and can be evaluated
using posterior samples from :math:`p_{\beta}(x)`.

In the code, the user must define:
- a function to evaluate the reference distribution :math:`p_{0}(x)`,
- a function to evaluate the intermediate factor :math:`q(x, \beta)` (function that takes in two inputs: x and
:math:`\beta`),
- if evaluation of :math:`Z_{1}` is of interest, a function that evaluates the potential :math:`U_{\beta}(x)`, from
evaluations of :math:`\ln{(x, \beta)}` which are saved during the MCMC run for the various chains (different
:math:`\beta` values).

Bayesian inference

In the Bayesian setting, :math:`p_{0}` is the prior and, given a likelihood :math:`L(data; x)`:

.. math:: q_{T}(x) = L(data; x) ^{\beta}

Then for the model evidence:

.. math:: U_{\beta}(x) = \ln{L(data; x)}

.. GENERATED FROM PYTHON SOURCE LINES 55-64

.. code-block:: default


    import numpy as np
    import matplotlib.pyplot as plt

    from UQpy.run_model import RunModel, PythonModel
    from UQpy.distributions import MultivariateNormal, JointIndependent, Normal, Uniform


.. GENERATED FROM PYTHON SOURCE LINES 65-66

%%

.. GENERATED FROM PYTHON SOURCE LINES 66-91

.. code-block:: default


    from scipy.stats import multivariate_normal, norm, uniform

    # bimodal posterior
    mu1 = np.array([1., 1.])
    mu2 = -0.8 * np.ones(2)
    w1 = 0.5
    # Width of 0.1 in each dimension
    sigma1 = np.diag([0.02, 0.05])
    sigma2 = np.diag([0.05, 0.02])

    # define prior, likelihood and target (posterior)
    prior_distribution = JointIndependent(marginals=[Uniform(loc=-2, scale=4), Uniform(loc=-2, scale=4)])


    def log_likelihood(x):
        # Posterior is a mixture of two gaussians
        return np.logaddexp(np.log(w1) + multivariate_normal.logpdf(x=x, mean=mu1, cov=sigma1),
                            np.log(1. - w1) + multivariate_normal.logpdf(x=x, mean=mu2, cov=sigma2))


    def log_target(x):
        return log_likelihood(x) + prior_distribution.log_pdf(x)


.. GENERATED FROM PYTHON SOURCE LINES 93-122

.. code-block:: default


    # estimate evidence
    def estimate_evidence_from_prior_samples(size):
        samples = -2. + 4 * np.random.uniform(size=size * 2).reshape((size, 2))
        return np.mean(np.exp(log_likelihood(samples)))


    def func_integration(x1, x2):
        x = np.array([x1, x2]).reshape((1, 2))
        return np.exp(log_likelihood(x)) * (1. / 4) ** 2


    def estimate_evidence_from_quadrature():
        from scipy.integrate import dblquad
        ev = dblquad(func=func_integration, a=-2, b=2, gfun=lambda x: -2, hfun=lambda x: 2)
        return ev


    x = np.arange(-2, 2, 0.02)
    y = np.arange(-2, 2, 0.02)
    xx, yy = np.meshgrid(x, y)
    z = np.exp(log_likelihood(np.concatenate([xx.reshape((-1, 1)), yy.reshape((-1, 1))], axis=-1)))
    h = plt.contourf(x, y, z.reshape(xx.shape))
    plt.title('Likelihood')
    plt.axis('equal')
    plt.show()

    print('Evidence computed analytically = {}'.format(estimate_evidence_from_quadrature()[0]))


.. GENERATED FROM PYTHON SOURCE LINES 124-151

.. code-block:: default


    from UQpy.sampling.mcmc import MetropolisHastings

    seed = -2. + 4. * np.random.rand(100, 2)
    mcmc0 = MetropolisHastings(log_pdf_target=log_target, burn_length=100, jump=3, seed=list(seed), dimension=2,
                               random_state=123, save_log_pdf=True)
    mcmc0.run(nsamples_per_chain=200)

    print(mcmc0.samples.shape)
    fig, ax = plt.subplots(ncols=1, figsize=(6, 4))
    ax.scatter(mcmc0.samples[:, 0], mcmc0.samples[:, 1], alpha=0.5)
    ax.set_xlim([-2, 2])
    ax.set_ylim([-2, 2])
    plt.show()


    def estimate_evidence_from_posterior_samples(log_posterior_values, posterior_samples):
        log_like = log_likelihood(posterior_samples)  # log_posterior_values - log_prior(posterior_samples)
        ev = 1. / np.mean(1. / np.exp(log_like))
        return ev


    evidence = estimate_evidence_from_posterior_samples(
        log_posterior_values=mcmc0.log_pdf_values, posterior_samples=mcmc0.samples)
    print('Estimated evidence by HM={}'.format(evidence))


.. GENERATED FROM PYTHON SOURCE LINES 155-160

.. code-block:: default


    def log_intermediate(x, temper_param):
        return temper_param * log_likelihood(x)  # + (1. - 1. / temperature) * log_prior(x)


.. GENERATED FROM PYTHON SOURCE LINES 164-184

.. code-block:: default


    from UQpy.sampling import MetropolisHastings, ParallelTemperingMCMC

    seed = -2. + 4. * np.random.rand(5, 2)
    betas = [1. / np.sqrt(2.) ** i for i in range(20 - 1, -1, -1)]
    print(len(betas))
    print(betas)

    samplers = [MetropolisHastings(burn_length=100, jump=3, seed=list(seed), dimension=2) for _ in range(len(betas))]
    mcmc = ParallelTemperingMCMC(log_pdf_intermediate=log_intermediate,
                                 distribution_reference=prior_distribution,
                                 n_iterations_between_sweeps=10,
                                 tempering_parameters=betas,
                                 random_state=123,
                                 save_log_pdf=True, samplers=samplers)

    mcmc.run(nsamples_per_chain=200)
    print(mcmc.samples.shape)
    print(mcmc.mcmc_samplers[-1].samples.shape)


.. GENERATED FROM PYTHON SOURCE LINES 188-206

.. code-block:: default


    # the intermediate samples can be accessed via the mcmc_samplers.samples attributes or
    # directly via the intermediate_samples attribute
    fig, ax = plt.subplots(ncols=3, figsize=(12, 3.5))
    for j, ind in enumerate([0, -6, -1]):
        ax[j].scatter(mcmc.mcmc_samplers[ind].samples[:, 0], mcmc.mcmc_samplers[ind].samples[:, 1], alpha=0.5,
                      color='orange')
        # ax[j].scatter(mcmc.intermediate_samples[ind][:, 0], mcmc.intermediate_samples[ind][:, 1], alpha=0.5,
        #              color='orange')
        ax[j].set_xlim([-2, 2])
        ax[j].set_ylim([-2, 2])
        ax[j].set_title(r'$\beta$ = {:.3f}'.format(mcmc.tempering_parameters[ind]), fontsize=16)
        ax[j].set_xlabel(r'$\theta_{1}$', fontsize=14)
        ax[j].set_ylabel(r'$\theta_{2}$', fontsize=14)
    plt.tight_layout()
    plt.show()


.. GENERATED FROM PYTHON SOURCE LINES 210-216

.. code-block:: default


    def compute_potential(x, temper_param, log_intermediate_values):
        """  """
        return log_intermediate_values / temper_param


.. GENERATED FROM PYTHON SOURCE LINES 220-227

.. code-block:: default


    ev = mcmc.evaluate_normalization_constant(compute_potential=compute_potential, log_Z0=0.)
    print('Estimate of evidence by thermodynamic integration = {:.4f}'.format(ev))

    ev = mcmc.evaluate_normalization_constant(compute_potential=compute_potential, nsamples_from_p0=5000)
    print('Estimate of evidence by thermodynamic integration = {:.4f}'.format(ev))


.. GENERATED FROM PYTHON SOURCE LINES 228-238

Reliability
------------
In a reliability context, :math:`p_{0}` is the pdf of the parameters and we have:

.. math:: q_{\beta}(x) = I_{\beta}(x) = \frac{1}{1 + \exp{ \left( \frac{G(x)}{1/\beta-1}\right)}}

where :math:`G(x)` is the performance function, negative if the system fails, and :math:`I_{\beta}(x)` are smoothed
versions of the indicator function. Then to compute the probability of failure, the potential can be computed as:

.. math:: U_{\beta}(x) = \frac{- \frac{G(x)}{(1-\beta)^2}}{1 + \exp{ \left( -\frac{G(x)}{1/\beta-1} \right) }} = - \frac{1 - I_{\beta}(x)}{\beta (1 - \beta)} \ln{ \left[ \frac{1 - I_{\beta}(x)}{I_{\beta}(x)} \right] }

.. GENERATED FROM PYTHON SOURCE LINES 240-258

.. code-block:: default


    from scipy.stats import norm


    def indic_sigmoid(y, beta):
        return 1. / (1. + np.exp(y / (1. / beta - 1.)))


    fig, ax = plt.subplots(figsize=(4, 3.5))
    ys = np.linspace(-5, 5, 100)
    for i, s in enumerate(1. / np.array([1.01, 1.25, 2., 4., 70.])):
        ax.plot(ys, indic_sigmoid(y=ys, beta=s), label=r'$\beta={:.2f}$'.format(s), color='blue', alpha=1. - i / 6)
    ax.set_xlabel(r'$y=g(\theta)$', fontsize=13)
    ax.set_ylabel(r'$q_{\beta}(\theta)=I_{\beta}(y)$', fontsize=13)
    # ax.set_title(r'Smooth versions of the indicator function', fontsize=14)
    ax.legend(fontsize=8.5)
    plt.show()


.. GENERATED FROM PYTHON SOURCE LINES 262-282

.. code-block:: default


    beta = 2  # Specified Reliability Index
    rho = 0.7  # Specified Correlation
    dim = 2  # Dimension

    # Define the correlation matrix
    C = np.ones((dim, dim)) * rho
    np.fill_diagonal(C, 1)
    print(C)

    # Print information related to the true probability of failure
    e, v = np.linalg.eig(np.asarray(C))
    beff = np.sqrt(np.max(e)) * beta
    print(beff)
    from scipy.stats import norm

    pf_true = norm.cdf(-beta)
    print('True pf={}'.format(pf_true))


.. GENERATED FROM PYTHON SOURCE LINES 286-292

.. code-block:: default


    def estimate_Pf_0(samples, model_values):
        mask = model_values <= 0
        return np.sum(mask) / len(mask)


.. GENERATED FROM PYTHON SOURCE LINES 296-317

.. code-block:: default


    # Sample from the prior
    model = RunModel(model=PythonModel(model_script='local_reliability_funcs.py', model_object_name="correlated_gaussian",
                                       b_eff=beff, d=dim))
    samples = MultivariateNormal(mean=np.zeros((2,)), cov=np.array([[1, 0.7], [0.7, 1]])).rvs(nsamples=20000)
    model.run(samples=samples, append_samples=False)
    model_values = np.array(model.qoi_list)

    print('Prob. failure (MC) = {}'.format(estimate_Pf_0(samples, model_values)))

    fig, ax = plt.subplots(figsize=(4, 3.5))
    mask = model_values <= 0
    ax.scatter(samples[np.squeeze(mask), 0], samples[np.squeeze(mask), 1], color='red', label='fail', alpha=0.5, marker='d')
    ax.scatter(samples[~np.squeeze(mask), 0], samples[~np.squeeze(mask), 1], color='blue', label='safe', alpha=0.5)
    plt.axis('equal')
    plt.xlabel(r'$\theta_{1}$', fontsize=13)
    plt.ylabel(r'$\theta_{2}$', fontsize=13)
    ax.legend(fontsize=13)
    fig.tight_layout()
    plt.show()


.. GENERATED FROM PYTHON SOURCE LINES 321-331

.. code-block:: default


    distribution_reference = MultivariateNormal(mean=np.zeros((2,)), cov=np.array([[1, 0.7], [0.7, 1]]))


    def log_factor_temp(x, temper_param):
        model.run(samples=x, append_samples=False)
        G_values = np.array(model.qoi_list)
        return np.squeeze(np.log(indic_sigmoid(G_values, temper_param)))


.. GENERATED FROM PYTHON SOURCE LINES 335-354

.. code-block:: default


    betas = (1. / np.array([1.01, 1.02, 1.05, 1.1, 1.2, 1.5, 2., 3., 5., 10., 25., 70.]))[::-1]

    print(len(betas))
    print(betas)

    fig, ax = plt.subplots(figsize=(5, 4))
    ys = np.linspace(-5, 5, 100)
    for i, s in enumerate(betas):
        ax.plot(ys, indic_sigmoid(y=ys, beta=s), label=r'$\beta={:.2f}$'.format(s), color='blue', alpha=1. - i / 15)
    ax.set_xlabel(r'$y=g(\theta)$', fontsize=13)
    ax.set_ylabel(r'$I_{\beta}(y)$', fontsize=13)
    ax.set_title(r'Smooth versions of the indicator function', fontsize=14)
    ax.legend()
    plt.show()

    scales = [0.1 / np.sqrt(beta) for beta in betas]
    print(scales)


.. GENERATED FROM PYTHON SOURCE LINES 358-378

.. code-block:: default


    from UQpy.sampling import MetropolisHastings, ParallelTemperingMCMC

    seed = -2. + 4. * np.random.rand(5, 2)

    print(betas)
    samplers = [MetropolisHastings(burn_length=5000, jump=5, seed=list(seed), dimension=2,
                                   proposal_is_symmetric=True,
                                   proposal=JointIndependent([Normal(scale=scale)] * 2)) for scale in scales]
    mcmc = ParallelTemperingMCMC(log_pdf_intermediate=log_factor_temp,
                                 distribution_reference=distribution_reference,
                                 n_iterations_between_sweeps=10,
                                 tempering_parameters=list(betas),
                                 random_state=123,
                                 save_log_pdf=True, samplers=samplers)

    mcmc.run(nsamples_per_chain=250)
    print(mcmc.samples.shape)
    print(mcmc.mcmc_samplers[0].samples.shape)


.. GENERATED FROM PYTHON SOURCE LINES 382-395

.. code-block:: default


    fig, ax = plt.subplots(ncols=3, figsize=(12, 3.5))
    for j, ind in enumerate([0, 6, -1]):
        ax[j].scatter(mcmc.mcmc_samplers[ind].samples[:, 0], mcmc.mcmc_samplers[ind].samples[:, 1], alpha=0.25)
        ax[j].set_xlim([-4, 4])
        ax[j].set_ylim([-4, 4])
        ax[j].set_title(r'$\beta$ = {:.3f}'.format(mcmc.tempering_parameters[ind]), fontsize=15)
        ax[j].set_xlabel(r'$\theta_{1}$', fontsize=13)
        ax[j].set_ylabel(r'$\theta_{2}$', fontsize=13)
    fig.tight_layout()
    plt.show()


.. GENERATED FROM PYTHON SOURCE LINES 399-408

.. code-block:: default


    def compute_potential(x, temper_param, log_intermediate_values):
        indic_beta = np.exp(log_intermediate_values)
        indic_beta = np.where(indic_beta > 1. - 1e-16, 1. - 1e-16, indic_beta)
        indic_beta = np.where(indic_beta < 1e-16, 1e-16, indic_beta)
        tmp_log = np.log((1. - indic_beta) / indic_beta)
        return - (1. - indic_beta) / (temper_param * (1. - temper_param)) * tmp_log


.. GENERATED FROM PYTHON SOURCE LINES 412-416

.. code-block:: default


    ev = mcmc.evaluate_normalization_constant(compute_potential=compute_potential, log_Z0=np.log(0.5))
    print('Estimate of evidence by thermodynamic integration = {}'.format(ev))


.. GENERATED FROM PYTHON SOURCE LINES 420-425

.. code-block:: default


    plt.plot(mcmc.thermodynamic_integration_results['temper_param_list'],
             mcmc.thermodynamic_integration_results['expect_potentials'], marker='x')
    plt.grid(True)


.. GENERATED FROM PYTHON SOURCE LINES 429-453

.. code-block:: default


    seed = -2. + 4. * np.random.rand(5, 2)

    samplers = [MetropolisHastings(burn_length=5000, jump=5, seed=list(seed), dimension=2,
                                   proposal_is_symmetric=True,
                                   proposal=JointIndependent([Normal(scale=scale)] * 2)) for scale in scales]
    mcmc = ParallelTemperingMCMC(log_pdf_intermediate=log_factor_temp,
                                 distribution_reference=distribution_reference,
                                 n_iterations_between_sweeps=10,
                                 tempering_parameters=list(betas),
                                 random_state=123,
                                 save_log_pdf=True, samplers=samplers)

    list_ev_0, list_ev_1 = [], []
    nsamples_per_chain = 0
    for i in range(50):
        nsamples_per_chain += 50
        mcmc.run(nsamples_per_chain=nsamples_per_chain)
        ev = mcmc.evaluate_normalization_constant(compute_potential=compute_potential, log_Z0=np.log(0.5))
        # print(np.exp(log_ev))
        list_ev_0.append(ev)
        ev = mcmc.evaluate_normalization_constant(compute_potential=compute_potential, nsamples_from_p0=100000)
        list_ev_1.append(ev)


.. GENERATED FROM PYTHON SOURCE LINES 457-465

.. code-block:: default


    fig, ax = plt.subplots(figsize=(5, 3.5))
    list_samples = [5 * i * 50 for i in range(1, 51)]
    ax.plot(list_samples, list_ev_0)
    ax.grid(True)
    ax.set_ylabel(r'$Z_{1}$ = proba. failure', fontsize=14)
    ax.set_xlabel(r'nb. saved samples per chain', fontsize=14)
    plt.show()


.. rst-class:: sphx-glr-timing

   **Total running time of the script:** ( 0 minutes  0.000 seconds)


.. _sphx_glr_download_auto_examples_sampling_tempering_parallel_tempering.py:

.. only:: html

  .. container:: sphx-glr-footer sphx-glr-footer-example


    .. container:: binder-badge

      .. image:: images/binder_badge_logo.svg
        :target: https://mybinder.org/v2/gh/SURGroup/UQpy/master?urlpath=lab/tree/notebooks/auto_examples/sampling/tempering/parallel_tempering.ipynb
        :alt: Launch binder
        :width: 150 px


    .. container:: sphx-glr-download sphx-glr-download-python

      :download:`Download Python source code: parallel_tempering.py <parallel_tempering.py>`

    .. container:: sphx-glr-download sphx-glr-download-jupyter

      :download:`Download Jupyter notebook: parallel_tempering.ipynb <parallel_tempering.ipynb>`


.. only:: html

 .. rst-class:: sphx-glr-signature

    `Gallery generated by Sphinx-Gallery <https://sphinx-gallery.github.io>`_