.. DO NOT EDIT.
.. THIS FILE WAS AUTOMATICALLY GENERATED BY SPHINX-GALLERY.
.. TO MAKE CHANGES, EDIT THE SOURCE PYTHON FILE:
.. "examples_regression/2-advanced-analysis/plot_main-tutorial-regression.py"
.. LINE NUMBERS ARE GIVEN BELOW.

.. only:: html

    .. note::
        :class: sphx-glr-download-link-note

        :ref:`Go to the end <sphx_glr_download_examples_regression_2-advanced-analysis_plot_main-tutorial-regression.py>`
        to download the full example code.

.. rst-class:: sphx-glr-example-title

.. _sphx_glr_examples_regression_2-advanced-analysis_plot_main-tutorial-regression.py:


# Comparison between conformalized quantile regressor and cross methods


In this tutorial, we compare the prediction intervals estimated by MAPIE on a
simple, one-dimensional, ground truth function `f(x) = x * sin(x)`.
Throughout this tutorial, we will answer the following questions:

- How well do the MAPIE strategies capture the aleatoric uncertainty
  existing in the data?

- How do the prediction intervals estimated by the resampling strategies
  evolve for new *out-of-distribution* data ?

- How do the prediction intervals vary between regressor models ?

Throughout this tutorial, we estimate the prediction intervals first using
a polynomial function, and then using a boosting model, and a simple neural
network.
**For practical problems, we advise using the faster CV+ or
Jackknife+-after-Bootstrap strategies.
For conservative prediction interval estimates, you can alternatively
use the CV-minmax strategies.**

.. GENERATED FROM PYTHON SOURCE LINES 26-51

.. code-block:: Python


    # sphinx_gallery_thumbnail_number = 2

    import os
    import warnings

    import matplotlib.pyplot as plt
    import numpy as np
    import pandas as pd
    from sklearn.linear_model import LinearRegression, QuantileRegressor
    from sklearn.model_selection import train_test_split
    from sklearn.pipeline import Pipeline
    from sklearn.preprocessing import PolynomialFeatures

    from mapie.metrics.regression import regression_coverage_score
    from mapie.regression import (
        ConformalizedQuantileRegressor,
        CrossConformalRegressor,
        JackknifeAfterBootstrapRegressor,
    )

    os.environ["TF_CPP_MIN_LOG_LEVEL"] = "3"
    warnings.filterwarnings("ignore")


.. GENERATED FROM PYTHON SOURCE LINES 52-58

1. Estimating the aleatoric uncertainty of homoscedastic noisy data
-------------------------------------------------------------------

Let's start by defining the `x * sin(x)` function and another
simple function that generates one-dimensional data with normal noise
uniformely in a given interval.

.. GENERATED FROM PYTHON SOURCE LINES 58-80

.. code-block:: Python


    def x_sinx(x):
        """One-dimensional x*sin(x) function."""
        return x * np.sin(x)


    def get_1d_data_with_constant_noise(funct, min_x, max_x, n_samples, noise):
        """
        Generate 1D noisy data uniformely from the given function
        and standard deviation for the noise.
        """
        rng = np.random.default_rng(59)
        X_train = np.linspace(min_x, max_x, n_samples)
        rng.shuffle(X_train)
        X_test = np.linspace(min_x, max_x, n_samples)
        y_train, y_mesh, y_test = funct(X_train), funct(X_test), funct(X_test)
        y_train += rng.normal(0, noise, y_train.shape[0])
        y_test += rng.normal(0, noise, y_test.shape[0])
        return (X_train.reshape(-1, 1), y_train, X_test.reshape(-1, 1), y_test, y_mesh)


.. GENERATED FROM PYTHON SOURCE LINES 81-84

We first generate noisy one-dimensional data uniformely on an interval.
Here, the noise is considered as *homoscedastic*, since it remains constant
over `x`.

.. GENERATED FROM PYTHON SOURCE LINES 84-90

.. code-block:: Python


    min_x, max_x, n_samples, noise = -5, 5, 600, 0.5
    X_train_conformalize, y_train_conformalize, X_test, y_test, y_mesh = (
        get_1d_data_with_constant_noise(x_sinx, min_x, max_x, n_samples, noise)
    )


.. GENERATED FROM PYTHON SOURCE LINES 91-92

Let's visualize our noisy function.

.. GENERATED FROM PYTHON SOURCE LINES 92-99

.. code-block:: Python


    plt.xlabel("x")
    plt.ylabel("y")
    plt.scatter(X_train_conformalize, y_train_conformalize, color="C0")
    _ = plt.plot(X_test, y_mesh, color="C1")
    plt.show()


.. image-sg:: /examples_regression/2-advanced-analysis/images/sphx_glr_plot_main-tutorial-regression_001.png
   :alt: plot main tutorial regression
   :srcset: /examples_regression/2-advanced-analysis/images/sphx_glr_plot_main-tutorial-regression_001.png
   :class: sphx-glr-single-img


.. GENERATED FROM PYTHON SOURCE LINES 100-103

As mentioned previously, we fit our training data with a simple
polynomial function. Here, we choose a degree equal to 10 so the function
is able to perfectly fit `x * sin(x)`.

.. GENERATED FROM PYTHON SOURCE LINES 103-121

.. code-block:: Python


    DEGREE_POLYN = 10
    polyn_model = Pipeline(
        [("poly", PolynomialFeatures(degree=DEGREE_POLYN)), ("linear", LinearRegression())]
    )
    polyn_model_quant = Pipeline(
        [
            ("poly", PolynomialFeatures(degree=DEGREE_POLYN)),
            (
                "linear",
                QuantileRegressor(
                    solver="highs",
                    alpha=0,
                ),
            ),
        ]
    )


.. GENERATED FROM PYTHON SOURCE LINES 122-127

We then estimate the prediction intervals for all the strategies very easily
with a
`fit` and `predict` process. The prediction interval's lower and upper bounds
are then saved in a DataFrame. Here, we set confidence_level=0.95
in order to obtain a 95% confidence for our prediction intervals.

.. GENERATED FROM PYTHON SOURCE LINES 127-191

.. code-block:: Python

    RANDOM_STATE = 1
    STRATEGIES = {
        "cv": {
            "class": CrossConformalRegressor,
            "init_params": dict(method="base", cv=10),
        },
        "cv_plus": {
            "class": CrossConformalRegressor,
            "init_params": dict(method="plus", cv=10),
        },
        "cv_minmax": {
            "class": CrossConformalRegressor,
            "init_params": dict(method="minmax", cv=10),
        },
        "jackknife": {
            "class": CrossConformalRegressor,
            "init_params": dict(method="base", cv=-1),
        },
        "jackknife_plus": {
            "class": CrossConformalRegressor,
            "init_params": dict(method="plus", cv=-1),
        },
        "jackknife_minmax": {
            "class": CrossConformalRegressor,
            "init_params": dict(method="minmax", cv=-1),
        },
        "jackknife_plus_ab": {
            "class": JackknifeAfterBootstrapRegressor,
            "init_params": dict(method="plus", resampling=50),
        },
        "jackknife_minmax_ab": {
            "class": JackknifeAfterBootstrapRegressor,
            "init_params": dict(method="minmax", resampling=50),
        },
        "conformalized_quantile_regression": {
            "class": ConformalizedQuantileRegressor,
            "init_params": dict(),
        },
    }


    y_pred, y_pis = {}, {}
    for strategy_name, strategy_params in STRATEGIES.items():
        init_params = strategy_params["init_params"]
        class_ = strategy_params["class"]
        if strategy_name == "conformalized_quantile_regression":
            X_train, X_conformalize, y_train, y_conformalize = train_test_split(
                X_train_conformalize,
                y_train_conformalize,
                test_size=0.3,
                random_state=RANDOM_STATE,
            )
            mapie = class_(polyn_model_quant, confidence_level=0.95, **init_params)
            mapie.fit(X_train, y_train)
            mapie.conformalize(X_conformalize, y_conformalize)
            y_pred[strategy_name], y_pis[strategy_name] = mapie.predict_interval(X_test)
        else:
            mapie = class_(
                polyn_model, confidence_level=0.95, random_state=RANDOM_STATE, **init_params
            )
            mapie.fit_conformalize(X_train_conformalize, y_train_conformalize)
            y_pred[strategy_name], y_pis[strategy_name] = mapie.predict_interval(X_test)


.. GENERATED FROM PYTHON SOURCE LINES 192-198

Let’s now compare the target confidence intervals with the predicted
intervals obtained with the Jackknife+, Jackknife-minmax, CV+, CV-minmax,
Jackknife+-after-Boostrap, and conformalized quantile regression (CQR)
strategies. Note that when the CQR method is called thanks to
`ConformalizedQuantileRegressor` with prefit=False,
it will use a "split" strategy.

.. GENERATED FROM PYTHON SOURCE LINES 198-250

.. code-block:: Python


    def plot_1d_data(
        X_test, y_test, y_mesh, y_sigma, y_pred, y_pred_low, y_pred_up, ax=None, title=None
    ):
        ax.set_xlabel("x")
        ax.set_ylabel("y")
        ax.fill_between(
            X_test, y_pred_low, y_pred_up, alpha=0.3, label="Prediction intervals"
        )
        ax.scatter(X_test, y_test, color="red", alpha=0.3, label="Test data")
        ax.plot(X_test, y_mesh, color="gray")
        ax.plot(
            X_test,
            y_mesh - y_sigma,
            color="gray",
            ls="--",
            label="True confidence intervals",
        )
        ax.plot(X_test, y_mesh + y_sigma, color="gray", ls="--")
        ax.plot(X_test, y_pred, color="blue", alpha=0.5, label="y_pred")
        if title is not None:
            ax.set_title(title)
        ax.legend()


    strategies = [
        "jackknife_plus",
        "jackknife_minmax",
        "cv_plus",
        "cv_minmax",
        "jackknife_plus_ab",
        "conformalized_quantile_regression",
    ]

    n_figs = len(strategies)
    fig, axs = plt.subplots(3, 2, figsize=(9, 13))
    coords = [axs[0, 0], axs[0, 1], axs[1, 0], axs[1, 1], axs[2, 0], axs[2, 1]]
    for strategy, coord in zip(strategies, coords):
        plot_1d_data(
            X_test.ravel(),
            y_test.ravel(),
            y_mesh.ravel(),
            np.full((X_test.shape[0]), 1.96 * noise).ravel(),
            y_pred[strategy].ravel(),
            y_pis[strategy][:, 0, 0].ravel(),
            y_pis[strategy][:, 1, 0].ravel(),
            ax=coord,
            title=strategy,
        )
    plt.show()


.. image-sg:: /examples_regression/2-advanced-analysis/images/sphx_glr_plot_main-tutorial-regression_002.png
   :alt: jackknife_plus, jackknife_minmax, cv_plus, cv_minmax, jackknife_plus_ab, conformalized_quantile_regression
   :srcset: /examples_regression/2-advanced-analysis/images/sphx_glr_plot_main-tutorial-regression_002.png
   :class: sphx-glr-single-img


.. GENERATED FROM PYTHON SOURCE LINES 251-255

At first glance, the strategies give similar results and the
prediction intervals are very close to the true confidence intervals.
Let’s confirm this by comparing the prediction interval widths over
`x` between all strategies.

.. GENERATED FROM PYTHON SOURCE LINES 255-267

.. code-block:: Python


    fig, ax = plt.subplots(1, 1, figsize=(9, 5))
    ax.axhline(1.96 * 2 * noise, ls="--", color="k", label="True width")
    for strategy in STRATEGIES:
        ax.plot(X_test, y_pis[strategy][:, 1, 0] - y_pis[strategy][:, 0, 0], label=strategy)
    ax.set_xlabel("x")
    ax.set_ylabel("Prediction Interval Width")
    ax.legend(fontsize=8)
    plt.show()


.. image-sg:: /examples_regression/2-advanced-analysis/images/sphx_glr_plot_main-tutorial-regression_003.png
   :alt: plot main tutorial regression
   :srcset: /examples_regression/2-advanced-analysis/images/sphx_glr_plot_main-tutorial-regression_003.png
   :class: sphx-glr-single-img


.. GENERATED FROM PYTHON SOURCE LINES 268-285

The Jackknife, Jackknife+, CV, CV+, and J+aB
give
similar widths that are very close to the true width. On the other hand,
the width estimated by Jackknife-minmax, Jackknife-minmax-after-Boostrap
and CV-minmax are slightly too
wide. Note that the widths given by Jackknife and CV strategies
are constant because there is a single model used for prediction,
perturbed models are ignored at prediction time.

It's interesting to observe that CQR strategy offers more varying width,
often giving much higher but also lower interval width than other methods,
therefore,
with homoscedastic noise, CQR would not be the preferred method.

Let’s now compare the *effective* coverage, namely the fraction of test
points whose true values lie within the prediction intervals, given by
the different strategies.

.. GENERATED FROM PYTHON SOURCE LINES 285-300

.. code-block:: Python


    pd.DataFrame(
        [
            [
                regression_coverage_score(y_test, y_pis[strategy])[0],
                (y_pis[strategy][:, 1, 0] - y_pis[strategy][:, 0, 0]).mean(),
            ]
            for strategy in STRATEGIES
        ],
        index=STRATEGIES,
        columns=["Coverage", "Width average"],
    ).round(2)


.. raw:: html

    <div class="output_subarea output_html rendered_html output_result">
    <div>
    <style scoped>
        .dataframe tbody tr th:only-of-type {
            vertical-align: middle;
        }

        .dataframe tbody tr th {
            vertical-align: top;
        }

        .dataframe thead th {
            text-align: right;
        }
    </style>
    <table border="1" class="dataframe">
      <thead>
        <tr style="text-align: right;">
          <th></th>
          <th>Coverage</th>
          <th>Width average</th>
        </tr>
      </thead>
      <tbody>
        <tr>
          <th>cv</th>
          <td>0.96</td>
          <td>1.94</td>
        </tr>
        <tr>
          <th>cv_plus</th>
          <td>0.96</td>
          <td>1.94</td>
        </tr>
        <tr>
          <th>cv_minmax</th>
          <td>0.96</td>
          <td>2.02</td>
        </tr>
        <tr>
          <th>jackknife</th>
          <td>0.96</td>
          <td>1.95</td>
        </tr>
        <tr>
          <th>jackknife_plus</th>
          <td>0.96</td>
          <td>1.95</td>
        </tr>
        <tr>
          <th>jackknife_minmax</th>
          <td>0.96</td>
          <td>1.99</td>
        </tr>
        <tr>
          <th>jackknife_plus_ab</th>
          <td>0.96</td>
          <td>1.93</td>
        </tr>
        <tr>
          <th>jackknife_minmax_ab</th>
          <td>0.96</td>
          <td>1.98</td>
        </tr>
        <tr>
          <th>conformalized_quantile_regression</th>
          <td>0.98</td>
          <td>2.54</td>
        </tr>
      </tbody>
    </table>
    </div>
    </div>
    <br />
    <br />

.. GENERATED FROM PYTHON SOURCE LINES 301-304

All strategies give effective coverage close to the
expected 0.95 value (recall that alpha = 0.05), confirming the theoretical
garantees.

.. GENERATED FROM PYTHON SOURCE LINES 307-313

2. Estimating the aleatoric uncertainty of heteroscedastic noisy data
---------------------------------------------------------------------

Let's define again the `x * sin(x)` function and another simple
function that generates one-dimensional data with normal noise uniformely
in a given interval.

.. GENERATED FROM PYTHON SOURCE LINES 313-330

.. code-block:: Python


    def get_1d_data_with_heteroscedastic_noise(funct, min_x, max_x, n_samples, noise):
        """
        Generate 1D noisy data uniformely from the given function
        and standard deviation for the noise.
        """
        rng = np.random.default_rng(59)
        X_train = np.linspace(min_x, max_x, n_samples)
        rng.shuffle(X_train)
        X_test = np.linspace(min_x, max_x, n_samples)
        y_train = funct(X_train) + (rng.normal(0, noise, len(X_train)) * X_train)
        y_test = funct(X_test) + (rng.normal(0, noise, len(X_test)) * X_test)
        y_mesh = funct(X_test)
        return (X_train.reshape(-1, 1), y_train, X_test.reshape(-1, 1), y_test, y_mesh)


.. GENERATED FROM PYTHON SOURCE LINES 331-334

We first generate noisy one-dimensional data uniformely on an interval.
Here, the noise is considered as *heteroscedastic*, since it will increase
linearly with `x`.

.. GENERATED FROM PYTHON SOURCE LINES 334-342

.. code-block:: Python


    min_x, max_x, n_samples, noise = 0, 5, 300, 0.5
    (X_train_conformalize, y_train_conformalize, X_test, y_test, y_mesh) = (
        get_1d_data_with_heteroscedastic_noise(x_sinx, min_x, max_x, n_samples, noise)
    )


.. GENERATED FROM PYTHON SOURCE LINES 343-345

Let's visualize our noisy function. As x increases, the data becomes more
noisy.

.. GENERATED FROM PYTHON SOURCE LINES 345-352

.. code-block:: Python


    plt.xlabel("x")
    plt.ylabel("y")
    plt.scatter(X_train_conformalize, y_train_conformalize, color="C0")
    plt.plot(X_test, y_mesh, color="C1")
    plt.show()


.. image-sg:: /examples_regression/2-advanced-analysis/images/sphx_glr_plot_main-tutorial-regression_004.png
   :alt: plot main tutorial regression
   :srcset: /examples_regression/2-advanced-analysis/images/sphx_glr_plot_main-tutorial-regression_004.png
   :class: sphx-glr-single-img


.. GENERATED FROM PYTHON SOURCE LINES 353-356

As mentioned previously, we fit our training data with a simple
polynomial function. Here, we choose a degree equal to 10 so the function
is able to perfectly fit `x * sin(x)`.

.. GENERATED FROM PYTHON SOURCE LINES 356-374

.. code-block:: Python


    DEGREE_POLYN = 10
    polyn_model = Pipeline(
        [("poly", PolynomialFeatures(degree=DEGREE_POLYN)), ("linear", LinearRegression())]
    )
    polyn_model_quant = Pipeline(
        [
            ("poly", PolynomialFeatures(degree=DEGREE_POLYN)),
            (
                "linear",
                QuantileRegressor(
                    solver="highs",
                    alpha=0,
                ),
            ),
        ]
    )


.. GENERATED FROM PYTHON SOURCE LINES 375-380

We then estimate the prediction intervals for all the strategies very easily
with a
`fit` and `predict` process. The prediction interval's lower and upper bounds
are then saved in a DataFrame. Here, we set confidence_level=0.95
in order to obtain a 95% confidence for our prediction intervals.

.. GENERATED FROM PYTHON SOURCE LINES 380-442

.. code-block:: Python


    STRATEGIES = {
        "cv": {
            "class": CrossConformalRegressor,
            "init_params": dict(method="base", cv=10),
        },
        "cv_plus": {
            "class": CrossConformalRegressor,
            "init_params": dict(method="plus", cv=10),
        },
        "cv_minmax": {
            "class": CrossConformalRegressor,
            "init_params": dict(method="minmax", cv=10),
        },
        "jackknife": {
            "class": CrossConformalRegressor,
            "init_params": dict(method="base", cv=-1),
        },
        "jackknife_plus": {
            "class": CrossConformalRegressor,
            "init_params": dict(method="plus", cv=-1),
        },
        "jackknife_minmax": {
            "class": CrossConformalRegressor,
            "init_params": dict(method="minmax", cv=-1),
        },
        "jackknife_plus_ab": {
            "class": JackknifeAfterBootstrapRegressor,
            "init_params": dict(method="plus", resampling=50),
        },
        "jackknife_minmax_ab": {
            "class": JackknifeAfterBootstrapRegressor,
            "init_params": dict(method="minmax", resampling=50),
        },
        "conformalized_quantile_regression": {
            "class": ConformalizedQuantileRegressor,
            "init_params": dict(),
        },
    }

    y_pred, y_pis = {}, {}
    for strategy_name, strategy_params in STRATEGIES.items():
        init_params = strategy_params["init_params"]
        class_ = strategy_params["class"]
        if strategy_name == "conformalized_quantile_regression":
            X_train, X_conformalize, y_train, y_conformalize = train_test_split(
                X_train_conformalize,
                y_train_conformalize,
                test_size=0.3,
                random_state=RANDOM_STATE,
            )
            mapie = class_(polyn_model_quant, confidence_level=0.95, **init_params)
            mapie.fit(X_train, y_train)
            mapie.conformalize(X_conformalize, y_conformalize)
            y_pred[strategy_name], y_pis[strategy_name] = mapie.predict_interval(X_test)
        else:
            mapie = class_(
                polyn_model, confidence_level=0.95, random_state=RANDOM_STATE, **init_params
            )
            mapie.fit_conformalize(X_train_conformalize, y_train_conformalize)
            y_pred[strategy_name], y_pis[strategy_name] = mapie.predict_interval(X_test)


.. GENERATED FROM PYTHON SOURCE LINES 443-446

Once again, let’s compare the target confidence intervals with prediction
intervals obtained with the Jackknife+, Jackknife-minmax, CV+, CV-minmax,
Jackknife+-after-Boostrap, and CQR strategies.

.. GENERATED FROM PYTHON SOURCE LINES 446-472

.. code-block:: Python


    strategies = [
        "jackknife_plus",
        "jackknife_minmax",
        "cv_plus",
        "cv_minmax",
        "jackknife_plus_ab",
        "conformalized_quantile_regression",
    ]
    n_figs = len(strategies)
    fig, axs = plt.subplots(3, 2, figsize=(9, 13))
    coords = [axs[0, 0], axs[0, 1], axs[1, 0], axs[1, 1], axs[2, 0], axs[2, 1]]
    for strategy, coord in zip(strategies, coords):
        plot_1d_data(
            X_test.ravel(),
            y_test.ravel(),
            y_mesh.ravel(),
            (1.96 * noise * X_test).ravel(),
            y_pred[strategy].ravel(),
            y_pis[strategy][:, 0, 0].ravel(),
            y_pis[strategy][:, 1, 0].ravel(),
            ax=coord,
            title=strategy,
        )
    plt.show()


.. image-sg:: /examples_regression/2-advanced-analysis/images/sphx_glr_plot_main-tutorial-regression_005.png
   :alt: jackknife_plus, jackknife_minmax, cv_plus, cv_minmax, jackknife_plus_ab, conformalized_quantile_regression
   :srcset: /examples_regression/2-advanced-analysis/images/sphx_glr_plot_main-tutorial-regression_005.png
   :class: sphx-glr-single-img


.. GENERATED FROM PYTHON SOURCE LINES 473-477

We can observe that all of the strategies except CQR seem to have similar
constant prediction intervals.
On the other hand, the CQR strategy offers a solution that adapts the
prediction intervals to the local noise.

.. GENERATED FROM PYTHON SOURCE LINES 477-488

.. code-block:: Python


    fig, ax = plt.subplots(1, 1, figsize=(7, 5))
    ax.plot(X_test, 1.96 * 2 * noise * X_test, ls="--", color="k", label="True width")
    for strategy in STRATEGIES:
        ax.plot(X_test, y_pis[strategy][:, 1, 0] - y_pis[strategy][:, 0, 0], label=strategy)
    ax.set_xlabel("x")
    ax.set_ylabel("Prediction Interval Width")
    ax.legend(fontsize=8)
    plt.show()


.. image-sg:: /examples_regression/2-advanced-analysis/images/sphx_glr_plot_main-tutorial-regression_006.png
   :alt: plot main tutorial regression
   :srcset: /examples_regression/2-advanced-analysis/images/sphx_glr_plot_main-tutorial-regression_006.png
   :class: sphx-glr-single-img


.. GENERATED FROM PYTHON SOURCE LINES 489-500

One can observe that all the strategies behave in a similar way as in the
first example shown previously. One exception is the CQR method which takes
into account the heteroscedasticity of the data. In this method we observe
very low interval widths at low values of `x`.
This is the only method that
even slightly follows the true width, and therefore is the preferred method
for heteroscedastic data. Notice also that the true width is greater (lower)
than the predicted width from the other methods at `x ≳ 3`
(`x ≤ 3`). This means that while the marginal coverage is correct for
these methods, the conditional coverage is likely not guaranteed as we will
observe in the next figure.

.. GENERATED FROM PYTHON SOURCE LINES 500-534

.. code-block:: Python


    def get_heteroscedastic_coverage(y_test, y_pis, STRATEGIES, bins):
        recap = {}
        for i in range(len(bins) - 1):
            bin1, bin2 = bins[i], bins[i + 1]
            name = f"[{bin1}, {bin2}]"
            recap[name] = []
            for strategy in STRATEGIES:
                indices = np.where((X_test >= bins[i]) * (X_test <= bins[i + 1]))
                y_test_trunc = np.take(y_test, indices)
                y_low_ = np.take(y_pis[strategy][:, 0, 0], indices)
                y_high_ = np.take(y_pis[strategy][:, 1, 0], indices)
                score_coverage = regression_coverage_score(
                    y_test_trunc[0], np.stack((y_low_[0], y_high_[0]), axis=-1)
                )[0]
                recap[name].append(score_coverage)
        recap_df = pd.DataFrame(recap, index=STRATEGIES)
        return recap_df


    bins = [0, 1, 2, 3, 4, 5]
    heteroscedastic_coverage = get_heteroscedastic_coverage(y_test, y_pis, STRATEGIES, bins)

    # fig = plt.figure()
    heteroscedastic_coverage.T.plot.bar(figsize=(12, 5), alpha=0.7)
    plt.axhline(0.95, ls="--", color="k")
    plt.ylabel("Conditional coverage")
    plt.xlabel("x bins")
    plt.xticks(rotation=0)
    plt.ylim(0.8, 1.0)
    plt.legend(fontsize=8, loc=[0, 0])
    plt.show()


.. image-sg:: /examples_regression/2-advanced-analysis/images/sphx_glr_plot_main-tutorial-regression_007.png
   :alt: plot main tutorial regression
   :srcset: /examples_regression/2-advanced-analysis/images/sphx_glr_plot_main-tutorial-regression_007.png
   :class: sphx-glr-single-img


.. GENERATED FROM PYTHON SOURCE LINES 535-539

Let’s now conclude by summarizing the *effective* coverage, namely the
fraction of test
points whose true values lie within the prediction intervals, given by
the different strategies.

.. GENERATED FROM PYTHON SOURCE LINES 539-553

.. code-block:: Python


    pd.DataFrame(
        [
            [
                regression_coverage_score(y_test, y_pis[strategy])[0],
                (y_pis[strategy][:, 1, 0] - y_pis[strategy][:, 0, 0]).mean(),
            ]
            for strategy in STRATEGIES
        ],
        index=STRATEGIES,
        columns=["Coverage", "Width average"],
    ).round(2)


.. raw:: html

    <div class="output_subarea output_html rendered_html output_result">
    <div>
    <style scoped>
        .dataframe tbody tr th:only-of-type {
            vertical-align: middle;
        }

        .dataframe tbody tr th {
            vertical-align: top;
        }

        .dataframe thead th {
            text-align: right;
        }
    </style>
    <table border="1" class="dataframe">
      <thead>
        <tr style="text-align: right;">
          <th></th>
          <th>Coverage</th>
          <th>Width average</th>
        </tr>
      </thead>
      <tbody>
        <tr>
          <th>cv</th>
          <td>0.96</td>
          <td>6.53</td>
        </tr>
        <tr>
          <th>cv_plus</th>
          <td>0.96</td>
          <td>6.55</td>
        </tr>
        <tr>
          <th>cv_minmax</th>
          <td>0.98</td>
          <td>6.80</td>
        </tr>
        <tr>
          <th>jackknife</th>
          <td>0.96</td>
          <td>6.50</td>
        </tr>
        <tr>
          <th>jackknife_plus</th>
          <td>0.96</td>
          <td>6.51</td>
        </tr>
        <tr>
          <th>jackknife_minmax</th>
          <td>0.97</td>
          <td>6.66</td>
        </tr>
        <tr>
          <th>jackknife_plus_ab</th>
          <td>0.96</td>
          <td>6.34</td>
        </tr>
        <tr>
          <th>jackknife_minmax_ab</th>
          <td>0.96</td>
          <td>6.39</td>
        </tr>
        <tr>
          <th>conformalized_quantile_regression</th>
          <td>0.97</td>
          <td>5.13</td>
        </tr>
      </tbody>
    </table>
    </div>
    </div>
    <br />
    <br />

.. GENERATED FROM PYTHON SOURCE LINES 554-557

All the strategies have the wanted coverage, however, we notice that the CQR
strategy has much lower interval width than all the other methods, therefore,
with heteroscedastic noise, CQR would be the preferred method.

.. GENERATED FROM PYTHON SOURCE LINES 560-573

3. Estimating the epistemic uncertainty of out-of-distribution data
-------------------------------------------------------------------

Let’s now consider one-dimensional data without noise, but normally
distributed.
The goal is to explore how the prediction intervals evolve for new data
that lie outside the distribution of the training data in order to see how
the strategies can capture the *epistemic* uncertainty.
For a comparison of the epistemic and aleatoric uncertainties, please have
a look at this source:
https://en.wikipedia.org/wiki/Uncertainty_quantification.

Let's start by generating and showing the data.

.. GENERATED FROM PYTHON SOURCE LINES 573-599

.. code-block:: Python


    def get_1d_data_with_normal_distrib(funct, mu, sigma, n_samples, noise):
        """
        Generate noisy 1D data with normal distribution from given function
        and noise standard deviation.
        """
        rng = np.random.default_rng(59)
        X_train = rng.normal(mu, sigma, n_samples)
        X_test = np.arange(mu - 4 * sigma, mu + 4 * sigma, sigma / 10.0)
        y_train, y_mesh, y_test = funct(X_train), funct(X_test), funct(X_test)
        y_train += rng.normal(0, noise, y_train.shape[0])
        y_test += rng.normal(0, noise, y_test.shape[0])
        return (X_train.reshape(-1, 1), y_train, X_test.reshape(-1, 1), y_test, y_mesh)


    mu, sigma, n_samples, noise = 0, 2, 1000, 0.0
    X_train_conformalize, y_train_conformalize, X_test, y_test, y_mesh = (
        get_1d_data_with_normal_distrib(x_sinx, mu, sigma, n_samples, noise)
    )
    plt.xlabel("x")
    plt.ylabel("y")
    plt.scatter(X_train_conformalize, y_train_conformalize, color="C0")
    _ = plt.plot(X_test, y_test, color="C1")
    plt.show()


.. image-sg:: /examples_regression/2-advanced-analysis/images/sphx_glr_plot_main-tutorial-regression_008.png
   :alt: plot main tutorial regression
   :srcset: /examples_regression/2-advanced-analysis/images/sphx_glr_plot_main-tutorial-regression_008.png
   :class: sphx-glr-single-img


.. GENERATED FROM PYTHON SOURCE LINES 600-603

As before, we estimate the prediction intervals using a polynomial
function of degree 10 and show the results for some of the
strategies.

.. GENERATED FROM PYTHON SOURCE LINES 603-705

.. code-block:: Python


    polyn_model_quant = Pipeline(
        [
            ("poly", PolynomialFeatures(degree=DEGREE_POLYN)),
            (
                "linear",
                QuantileRegressor(
                    solver="highs-ds",
                    alpha=0,
                ),
            ),
        ]
    )

    STRATEGIES = {
        "cv": {
            "class": CrossConformalRegressor,
            "init_params": dict(method="base", cv=10),
        },
        "cv_plus": {
            "class": CrossConformalRegressor,
            "init_params": dict(method="plus", cv=10),
        },
        "cv_minmax": {
            "class": CrossConformalRegressor,
            "init_params": dict(method="minmax", cv=10),
        },
        "jackknife": {
            "class": CrossConformalRegressor,
            "init_params": dict(method="base", cv=-1),
        },
        "jackknife_plus": {
            "class": CrossConformalRegressor,
            "init_params": dict(method="plus", cv=-1),
        },
        "jackknife_minmax": {
            "class": CrossConformalRegressor,
            "init_params": dict(method="minmax", cv=-1),
        },
        "jackknife_plus_ab": {
            "class": JackknifeAfterBootstrapRegressor,
            "init_params": dict(method="plus", resampling=50),
        },
        "jackknife_minmax_ab": {
            "class": JackknifeAfterBootstrapRegressor,
            "init_params": dict(method="minmax", resampling=50),
        },
        "conformalized_quantile_regression": {
            "class": ConformalizedQuantileRegressor,
            "init_params": dict(),
        },
    }


    y_pred, y_pis = {}, {}
    for strategy_name, strategy_params in STRATEGIES.items():
        init_params = strategy_params["init_params"]
        class_ = strategy_params["class"]
        if strategy_name == "conformalized_quantile_regression":
            X_train, X_conformalize, y_train, y_conformalize = train_test_split(
                X_train_conformalize,
                y_train_conformalize,
                test_size=0.3,
                random_state=RANDOM_STATE,
            )
            mapie = class_(polyn_model_quant, confidence_level=0.95, **init_params)
            mapie.fit(X_train, y_train)
            mapie.conformalize(X_conformalize, y_conformalize)
            y_pred[strategy_name], y_pis[strategy_name] = mapie.predict_interval(X_test)
        else:
            mapie = class_(
                polyn_model, confidence_level=0.95, random_state=RANDOM_STATE, **init_params
            )
            mapie.fit_conformalize(X_train_conformalize, y_train_conformalize)
            y_pred[strategy_name], y_pis[strategy_name] = mapie.predict_interval(X_test)

    strategies = [
        "jackknife_plus",
        "jackknife_minmax",
        "cv_plus",
        "cv_minmax",
        "jackknife_plus_ab",
        "conformalized_quantile_regression",
    ]
    n_figs = len(strategies)
    fig, axs = plt.subplots(3, 2, figsize=(9, 13))
    coords = [axs[0, 0], axs[0, 1], axs[1, 0], axs[1, 1], axs[2, 0], axs[2, 1]]
    for strategy, coord in zip(strategies, coords):
        plot_1d_data(
            X_test.ravel(),
            y_test.ravel(),
            y_mesh.ravel(),
            1.96 * noise,
            y_pred[strategy].ravel(),
            y_pis[strategy][:, 0, :].ravel(),
            y_pis[strategy][:, 1, :].ravel(),
            ax=coord,
            title=strategy,
        )
    plt.show()


.. image-sg:: /examples_regression/2-advanced-analysis/images/sphx_glr_plot_main-tutorial-regression_009.png
   :alt: jackknife_plus, jackknife_minmax, cv_plus, cv_minmax, jackknife_plus_ab, conformalized_quantile_regression
   :srcset: /examples_regression/2-advanced-analysis/images/sphx_glr_plot_main-tutorial-regression_009.png
   :class: sphx-glr-single-img


.. GENERATED FROM PYTHON SOURCE LINES 706-713

At first glance, our polynomial function does not give accurate
predictions with respect to the true function when `|x| > 6`.
The prediction intervals estimated with the Jackknife+ do not seem to
increase. On the other hand, the CV and other related methods seem to capture
some uncertainty when `x > 6`.

Let's now compare the prediction interval widths between all strategies.

.. GENERATED FROM PYTHON SOURCE LINES 713-722

.. code-block:: Python


    fig, ax = plt.subplots(1, 1, figsize=(7, 5))
    for strategy in STRATEGIES:
        ax.plot(X_test, y_pis[strategy][:, 1, 0] - y_pis[strategy][:, 0, 0], label=strategy)
    ax.set_xlabel("x")
    ax.set_ylabel("Prediction Interval Width")
    ax.legend(fontsize=8)
    plt.show()


.. image-sg:: /examples_regression/2-advanced-analysis/images/sphx_glr_plot_main-tutorial-regression_010.png
   :alt: plot main tutorial regression
   :srcset: /examples_regression/2-advanced-analysis/images/sphx_glr_plot_main-tutorial-regression_010.png
   :class: sphx-glr-single-img


.. GENERATED FROM PYTHON SOURCE LINES 723-735

The prediction interval widths start to increase exponentially
for `|x| > 4` for the CV+, CV-minmax, Jackknife-minmax, and JackknifeAB
strategies. On the other hand, the prediction intervals estimated by
Jackknife+ remain roughly constant until `|x| ≈ 6` before
increasing.
The CQR strategy seems to perform well, however, on the extreme values
of the data the quantile regression fails to give reliable results as it
outputs
negative value for the prediction intervals. This occurs because the quantile
regressor with quantile `1 - α/2` gives higher values than the
quantile regressor with quantile `α/2`. Note that a warning will
be issued when this occurs.

.. GENERATED FROM PYTHON SOURCE LINES 735-748

.. code-block:: Python


    pd.DataFrame(
        [
            [
                regression_coverage_score(y_test, y_pis[strategy])[0],
                (y_pis[strategy][:, 1, 0] - y_pis[strategy][:, 0, 0]).mean(),
            ]
            for strategy in STRATEGIES
        ],
        index=STRATEGIES,
        columns=["Coverage", "Width average"],
    ).round(3)


.. raw:: html

    <div class="output_subarea output_html rendered_html output_result">
    <div>
    <style scoped>
        .dataframe tbody tr th:only-of-type {
            vertical-align: middle;
        }

        .dataframe tbody tr th {
            vertical-align: top;
        }

        .dataframe thead th {
            text-align: right;
        }
    </style>
    <table border="1" class="dataframe">
      <thead>
        <tr style="text-align: right;">
          <th></th>
          <th>Coverage</th>
          <th>Width average</th>
        </tr>
      </thead>
      <tbody>
        <tr>
          <th>cv</th>
          <td>0.525</td>
          <td>0.021</td>
        </tr>
        <tr>
          <th>cv_plus</th>
          <td>0.600</td>
          <td>1.444</td>
        </tr>
        <tr>
          <th>cv_minmax</th>
          <td>0.712</td>
          <td>1.451</td>
        </tr>
        <tr>
          <th>jackknife</th>
          <td>0.525</td>
          <td>0.022</td>
        </tr>
        <tr>
          <th>jackknife_plus</th>
          <td>0.525</td>
          <td>0.022</td>
        </tr>
        <tr>
          <th>jackknife_minmax</th>
          <td>0.712</td>
          <td>1.873</td>
        </tr>
        <tr>
          <th>jackknife_plus_ab</th>
          <td>0.600</td>
          <td>1.251</td>
        </tr>
        <tr>
          <th>jackknife_minmax_ab</th>
          <td>0.750</td>
          <td>2.513</td>
        </tr>
        <tr>
          <th>conformalized_quantile_regression</th>
          <td>0.675</td>
          <td>-0.569</td>
        </tr>
      </tbody>
    </table>
    </div>
    </div>
    <br />
    <br />

.. GENERATED FROM PYTHON SOURCE LINES 749-760

In conclusion, the Jackknife-minmax, CV+, CV-minmax, or JackknifeAB
strategies are more
conservative than the Jackknife+ strategy, and tend to result in more
reliable coverages for *out-of-distribution* data. It is therefore
advised to use the former strategies for predictions with new
out-of-distribution data.
Note however that there are no theoretical guarantees on the coverage level
for out-of-distribution data.
Here it's important to note that the CQR strategy should not be taken into
account for width prediction, and it is abundantly clear from the negative
width coverage that is observed in these results.

.. GENERATED FROM PYTHON SOURCE LINES 763-769

4. More Jupyter notebooks for regression
----------------------------------------

If you would like to run a series of notebooks hosted on the MAPIE Github
repository that can be run on Google Colab, please visit this documentation
link: https://mapie.readthedocs.io/en/stable/notebooks_regression.html.


.. rst-class:: sphx-glr-timing

   **Total running time of the script:** (0 minutes 12.394 seconds)


.. _sphx_glr_download_examples_regression_2-advanced-analysis_plot_main-tutorial-regression.py:

.. only:: html

  .. container:: sphx-glr-footer sphx-glr-footer-example

    .. container:: sphx-glr-download sphx-glr-download-jupyter

      :download:`Download Jupyter notebook: plot_main-tutorial-regression.ipynb <plot_main-tutorial-regression.ipynb>`

    .. container:: sphx-glr-download sphx-glr-download-python

      :download:`Download Python source code: plot_main-tutorial-regression.py <plot_main-tutorial-regression.py>`

    .. container:: sphx-glr-download sphx-glr-download-zip

      :download:`Download zipped: plot_main-tutorial-regression.zip <plot_main-tutorial-regression.zip>`


.. only:: html

 .. rst-class:: sphx-glr-signature

    `Gallery generated by Sphinx-Gallery <https://sphinx-gallery.github.io>`_