.. DO NOT EDIT.
.. THIS FILE WAS AUTOMATICALLY GENERATED BY SPHINX-GALLERY.
.. TO MAKE CHANGES, EDIT THE SOURCE PYTHON FILE:
.. "examples_mondrian/1-quickstart/plot_main-tutorial-mondrian-regression.py"
.. LINE NUMBERS ARE GIVEN BELOW.

.. only:: html

    .. note::
        :class: sphx-glr-download-link-note

        :ref:`Go to the end <sphx_glr_download_examples_mondrian_1-quickstart_plot_main-tutorial-mondrian-regression.py>`
        to download the full example code.

.. rst-class:: sphx-glr-example-title

.. _sphx_glr_examples_mondrian_1-quickstart_plot_main-tutorial-mondrian-regression.py:


====================================================================
Tutorial: how to ensure fairness across groups with Mondrian
====================================================================

Mondrian is a method that allows to build prediction sets (for classification) and
prediction intervals (for regression) with a group-conditional coverage guarantee. To
achieve this, it runs a conformal prediction procedure for each of these groups,
and hence achieves marginal coverage on each of them.

In this tutorial, we compare the prediction intervals estimated by MAPIE on a simple,
one-dimensional, ground truth function with classical conformal prediction intervals
versus Mondrian conformal prediction intervals. The function is a sinusoidal function
with added noise, and the data is split in 10 `disjoint` groups. Such groups can
include categories like gender or demographic segments, such as different age ranges.
Ultimately, the goal is to estimate the prediction intervals for new data points and
compare the coverage of these intervals across groups.

Please note that the coverage obtained with Mondrian depends on the size of the
groups: therefore, the groups must be large enough for the coverage to represent the
model's performance on each of them accurately. If the groups are too small (e.g.,
fewer than 200 samples within the group's conformalization set), the conformalization
may become unstable, likely resulting in high variance in the effective coverage
obtained.


Throughout this tutorial, we will answer the following questions:

- How to use MAPIE to estimate prediction intervals for a regression problem?
- How to build Mondrian conformal prediction intervals using MAPIE for regression?
- How to compare the coverage of the prediction intervals by groups?

Here, :class:`~mapie.regression.SplitConformalRegressor` is used, along with the
``"absolute"`` conformity score.

The Mondrian method is compatible with any MAPIE estimator, except those involving
cross-conformal predictions. There are no restrictions on the conformity scores used.

.. GENERATED FROM PYTHON SOURCE LINES 39-55

.. code-block:: Python


    import os
    import warnings
    from copy import copy

    import matplotlib.pyplot as plt
    import numpy as np
    from sklearn.ensemble import RandomForestRegressor

    from mapie.metrics.regression import regression_coverage_score
    from mapie.utils import train_conformalize_test_split
    from mapie.regression import SplitConformalRegressor

    os.environ["TF_CPP_MIN_LOG_LEVEL"] = "3"
    warnings.filterwarnings("ignore")


.. GENERATED FROM PYTHON SOURCE LINES 56-60

1. Create the noisy dataset
----------------------------------------------------------------------------
We create a dataset with 10 groups, each of those groups having a different
level of noise.

.. GENERATED FROM PYTHON SOURCE LINES 60-100

.. code-block:: Python


    n_points = 100000
    np.random.seed(0)
    X = np.linspace(0, 10, n_points).reshape(-1, 1)
    group_size = n_points // 10
    partition_list = []
    for i in range(10):
        partition_list.append(np.array([i] * group_size))
    # The `partition` array contains the group of each of the 100,000 data points. We
    # ensured that the groups are disjoint.
    partition = np.concatenate(partition_list)

    noise_0_1 = np.random.normal(0, 0.1, group_size)
    noise_1_2 = np.random.normal(0, 0.5, group_size)
    noise_2_3 = np.random.normal(0, 1, group_size)
    noise_3_4 = np.random.normal(0, 0.4, group_size)
    noise_4_5 = np.random.normal(0, 0.2, group_size)
    noise_5_6 = np.random.normal(0, 0.3, group_size)
    noise_6_7 = np.random.normal(0, 0.6, group_size)
    noise_7_8 = np.random.normal(0, 0.7, group_size)
    noise_8_9 = np.random.normal(0, 0.8, group_size)
    noise_9_10 = np.random.normal(0, 0.9, group_size)

    y = np.concatenate(
        [
            np.sin(X[partition == 0, 0] * 2) + noise_0_1,
            np.sin(X[partition == 1, 0] * 2) + noise_1_2,
            np.sin(X[partition == 2, 0] * 2) + noise_2_3,
            np.sin(X[partition == 3, 0] * 2) + noise_3_4,
            np.sin(X[partition == 4, 0] * 2) + noise_4_5,
            np.sin(X[partition == 5, 0] * 2) + noise_5_6,
            np.sin(X[partition == 6, 0] * 2) + noise_6_7,
            np.sin(X[partition == 7, 0] * 2) + noise_7_8,
            np.sin(X[partition == 8, 0] * 2) + noise_8_9,
            np.sin(X[partition == 9, 0] * 2) + noise_9_10,
        ],
        axis=0,
    )


.. GENERATED FROM PYTHON SOURCE LINES 101-102

We plot the dataset with the partition as colors.

.. GENERATED FROM PYTHON SOURCE LINES 102-106

.. code-block:: Python


    plt.scatter(X, y, c=partition)
    plt.show()


.. image-sg:: /examples_mondrian/1-quickstart/images/sphx_glr_plot_main-tutorial-mondrian-regression_001.png
   :alt: plot main tutorial mondrian regression
   :srcset: /examples_mondrian/1-quickstart/images/sphx_glr_plot_main-tutorial-mondrian-regression_001.png
   :class: sphx-glr-single-img


.. GENERATED FROM PYTHON SOURCE LINES 107-109

2. Split the dataset into a training set, a conformalization set, and a test set
------------------------------------------------------------------------------------

.. GENERATED FROM PYTHON SOURCE LINES 109-127

.. code-block:: Python


    (X_train, X_conformalize, X_test, y_train, y_conformalize, y_test) = (
        train_conformalize_test_split(
            X, y, train_size=0.4, conformalize_size=0.4, test_size=0.2, random_state=0
        )
    )

    (partition_train, partition_conformalize, partition_test, _, _, _) = (
        train_conformalize_test_split(
            partition,
            y,
            train_size=0.4,
            conformalize_size=0.4,
            test_size=0.2,
            random_state=0,
        )
    )


.. GENERATED FROM PYTHON SOURCE LINES 128-129

We plot the training set, the conformalization set, and the test set.

.. GENERATED FROM PYTHON SOURCE LINES 129-140

.. code-block:: Python


    f, ax = plt.subplots(1, 3, figsize=(15, 5))
    ax[0].scatter(X_train, y_train, c=partition_train)
    ax[0].set_title("Train set")
    ax[1].scatter(X_conformalize, y_conformalize, c=partition_conformalize)
    ax[1].set_title("Conformalization set")
    ax[2].scatter(X_test, y_test, c=partition_test)
    ax[2].set_title("Test set")
    plt.show()


.. image-sg:: /examples_mondrian/1-quickstart/images/sphx_glr_plot_main-tutorial-mondrian-regression_002.png
   :alt: Train set, Conformalization set, Test set
   :srcset: /examples_mondrian/1-quickstart/images/sphx_glr_plot_main-tutorial-mondrian-regression_002.png
   :class: sphx-glr-single-img


.. GENERATED FROM PYTHON SOURCE LINES 141-143

3. Fit a random forest regressor on the training set
----------------------------------------------------------------------------

.. GENERATED FROM PYTHON SOURCE LINES 143-147

.. code-block:: Python


    random_forest = RandomForestRegressor(n_estimators=100)
    random_forest.fit(X_train, y_train)


.. raw:: html

    <div class="output_subarea output_html rendered_html output_result">
    <style>#sk-container-id-5 {
      /* Definition of color scheme common for light and dark mode */
      --sklearn-color-text: #000;
      --sklearn-color-text-muted: #666;
      --sklearn-color-line: gray;
      /* Definition of color scheme for unfitted estimators */
      --sklearn-color-unfitted-level-0: #fff5e6;
      --sklearn-color-unfitted-level-1: #f6e4d2;
      --sklearn-color-unfitted-level-2: #ffe0b3;
      --sklearn-color-unfitted-level-3: chocolate;
      /* Definition of color scheme for fitted estimators */
      --sklearn-color-fitted-level-0: #f0f8ff;
      --sklearn-color-fitted-level-1: #d4ebff;
      --sklearn-color-fitted-level-2: #b3dbfd;
      --sklearn-color-fitted-level-3: cornflowerblue;

      /* Specific color for light theme */
      --sklearn-color-text-on-default-background: var(--sg-text-color, var(--theme-code-foreground, var(--jp-content-font-color1, black)));
      --sklearn-color-background: var(--sg-background-color, var(--theme-background, var(--jp-layout-color0, white)));
      --sklearn-color-border-box: var(--sg-text-color, var(--theme-code-foreground, var(--jp-content-font-color1, black)));
      --sklearn-color-icon: #696969;

      @media (prefers-color-scheme: dark) {
        /* Redefinition of color scheme for dark theme */
        --sklearn-color-text-on-default-background: var(--sg-text-color, var(--theme-code-foreground, var(--jp-content-font-color1, white)));
        --sklearn-color-background: var(--sg-background-color, var(--theme-background, var(--jp-layout-color0, #111)));
        --sklearn-color-border-box: var(--sg-text-color, var(--theme-code-foreground, var(--jp-content-font-color1, white)));
        --sklearn-color-icon: #878787;
      }
    }

    #sk-container-id-5 {
      color: var(--sklearn-color-text);
    }

    #sk-container-id-5 pre {
      padding: 0;
    }

    #sk-container-id-5 input.sk-hidden--visually {
      border: 0;
      clip: rect(1px 1px 1px 1px);
      clip: rect(1px, 1px, 1px, 1px);
      height: 1px;
      margin: -1px;
      overflow: hidden;
      padding: 0;
      position: absolute;
      width: 1px;
    }

    #sk-container-id-5 div.sk-dashed-wrapped {
      border: 1px dashed var(--sklearn-color-line);
      margin: 0 0.4em 0.5em 0.4em;
      box-sizing: border-box;
      padding-bottom: 0.4em;
      background-color: var(--sklearn-color-background);
    }

    #sk-container-id-5 div.sk-container {
      /* jupyter's `normalize.less` sets `[hidden] { display: none; }`
         but bootstrap.min.css set `[hidden] { display: none !important; }`
         so we also need the `!important` here to be able to override the
         default hidden behavior on the sphinx rendered scikit-learn.org.
         See: https://github.com/scikit-learn/scikit-learn/issues/21755 */
      display: inline-block !important;
      position: relative;
    }

    #sk-container-id-5 div.sk-text-repr-fallback {
      display: none;
    }

    div.sk-parallel-item,
    div.sk-serial,
    div.sk-item {
      /* draw centered vertical line to link estimators */
      background-image: linear-gradient(var(--sklearn-color-text-on-default-background), var(--sklearn-color-text-on-default-background));
      background-size: 2px 100%;
      background-repeat: no-repeat;
      background-position: center center;
    }

    /* Parallel-specific style estimator block */

    #sk-container-id-5 div.sk-parallel-item::after {
      content: "";
      width: 100%;
      border-bottom: 2px solid var(--sklearn-color-text-on-default-background);
      flex-grow: 1;
    }

    #sk-container-id-5 div.sk-parallel {
      display: flex;
      align-items: stretch;
      justify-content: center;
      background-color: var(--sklearn-color-background);
      position: relative;
    }

    #sk-container-id-5 div.sk-parallel-item {
      display: flex;
      flex-direction: column;
    }

    #sk-container-id-5 div.sk-parallel-item:first-child::after {
      align-self: flex-end;
      width: 50%;
    }

    #sk-container-id-5 div.sk-parallel-item:last-child::after {
      align-self: flex-start;
      width: 50%;
    }

    #sk-container-id-5 div.sk-parallel-item:only-child::after {
      width: 0;
    }

    /* Serial-specific style estimator block */

    #sk-container-id-5 div.sk-serial {
      display: flex;
      flex-direction: column;
      align-items: center;
      background-color: var(--sklearn-color-background);
      padding-right: 1em;
      padding-left: 1em;
    }


    /* Toggleable style: style used for estimator/Pipeline/ColumnTransformer box that is
    clickable and can be expanded/collapsed.
    - Pipeline and ColumnTransformer use this feature and define the default style
    - Estimators will overwrite some part of the style using the `sk-estimator` class
    */

    /* Pipeline and ColumnTransformer style (default) */

    #sk-container-id-5 div.sk-toggleable {
      /* Default theme specific background. It is overwritten whether we have a
      specific estimator or a Pipeline/ColumnTransformer */
      background-color: var(--sklearn-color-background);
    }

    /* Toggleable label */
    #sk-container-id-5 label.sk-toggleable__label {
      cursor: pointer;
      display: flex;
      width: 100%;
      margin-bottom: 0;
      padding: 0.5em;
      box-sizing: border-box;
      text-align: center;
      align-items: start;
      justify-content: space-between;
      gap: 0.5em;
    }

    #sk-container-id-5 label.sk-toggleable__label .caption {
      font-size: 0.6rem;
      font-weight: lighter;
      color: var(--sklearn-color-text-muted);
    }

    #sk-container-id-5 label.sk-toggleable__label-arrow:before {
      /* Arrow on the left of the label */
      content: "▸";
      float: left;
      margin-right: 0.25em;
      color: var(--sklearn-color-icon);
    }

    #sk-container-id-5 label.sk-toggleable__label-arrow:hover:before {
      color: var(--sklearn-color-text);
    }

    /* Toggleable content - dropdown */

    #sk-container-id-5 div.sk-toggleable__content {
      display: none;
      text-align: left;
      /* unfitted */
      background-color: var(--sklearn-color-unfitted-level-0);
    }

    #sk-container-id-5 div.sk-toggleable__content.fitted {
      /* fitted */
      background-color: var(--sklearn-color-fitted-level-0);
    }

    #sk-container-id-5 div.sk-toggleable__content pre {
      margin: 0.2em;
      border-radius: 0.25em;
      color: var(--sklearn-color-text);
      /* unfitted */
      background-color: var(--sklearn-color-unfitted-level-0);
    }

    #sk-container-id-5 div.sk-toggleable__content.fitted pre {
      /* unfitted */
      background-color: var(--sklearn-color-fitted-level-0);
    }

    #sk-container-id-5 input.sk-toggleable__control:checked~div.sk-toggleable__content {
      /* Expand drop-down */
      display: block;
      width: 100%;
      overflow: visible;
    }

    #sk-container-id-5 input.sk-toggleable__control:checked~label.sk-toggleable__label-arrow:before {
      content: "▾";
    }

    /* Pipeline/ColumnTransformer-specific style */

    #sk-container-id-5 div.sk-label input.sk-toggleable__control:checked~label.sk-toggleable__label {
      color: var(--sklearn-color-text);
      background-color: var(--sklearn-color-unfitted-level-2);
    }

    #sk-container-id-5 div.sk-label.fitted input.sk-toggleable__control:checked~label.sk-toggleable__label {
      background-color: var(--sklearn-color-fitted-level-2);
    }

    /* Estimator-specific style */

    /* Colorize estimator box */
    #sk-container-id-5 div.sk-estimator input.sk-toggleable__control:checked~label.sk-toggleable__label {
      /* unfitted */
      background-color: var(--sklearn-color-unfitted-level-2);
    }

    #sk-container-id-5 div.sk-estimator.fitted input.sk-toggleable__control:checked~label.sk-toggleable__label {
      /* fitted */
      background-color: var(--sklearn-color-fitted-level-2);
    }

    #sk-container-id-5 div.sk-label label.sk-toggleable__label,
    #sk-container-id-5 div.sk-label label {
      /* The background is the default theme color */
      color: var(--sklearn-color-text-on-default-background);
    }

    /* On hover, darken the color of the background */
    #sk-container-id-5 div.sk-label:hover label.sk-toggleable__label {
      color: var(--sklearn-color-text);
      background-color: var(--sklearn-color-unfitted-level-2);
    }

    /* Label box, darken color on hover, fitted */
    #sk-container-id-5 div.sk-label.fitted:hover label.sk-toggleable__label.fitted {
      color: var(--sklearn-color-text);
      background-color: var(--sklearn-color-fitted-level-2);
    }

    /* Estimator label */

    #sk-container-id-5 div.sk-label label {
      font-family: monospace;
      font-weight: bold;
      display: inline-block;
      line-height: 1.2em;
    }

    #sk-container-id-5 div.sk-label-container {
      text-align: center;
    }

    /* Estimator-specific */
    #sk-container-id-5 div.sk-estimator {
      font-family: monospace;
      border: 1px dotted var(--sklearn-color-border-box);
      border-radius: 0.25em;
      box-sizing: border-box;
      margin-bottom: 0.5em;
      /* unfitted */
      background-color: var(--sklearn-color-unfitted-level-0);
    }

    #sk-container-id-5 div.sk-estimator.fitted {
      /* fitted */
      background-color: var(--sklearn-color-fitted-level-0);
    }

    /* on hover */
    #sk-container-id-5 div.sk-estimator:hover {
      /* unfitted */
      background-color: var(--sklearn-color-unfitted-level-2);
    }

    #sk-container-id-5 div.sk-estimator.fitted:hover {
      /* fitted */
      background-color: var(--sklearn-color-fitted-level-2);
    }

    /* Specification for estimator info (e.g. "i" and "?") */

    /* Common style for "i" and "?" */

    .sk-estimator-doc-link,
    a:link.sk-estimator-doc-link,
    a:visited.sk-estimator-doc-link {
      float: right;
      font-size: smaller;
      line-height: 1em;
      font-family: monospace;
      background-color: var(--sklearn-color-background);
      border-radius: 1em;
      height: 1em;
      width: 1em;
      text-decoration: none !important;
      margin-left: 0.5em;
      text-align: center;
      /* unfitted */
      border: var(--sklearn-color-unfitted-level-1) 1pt solid;
      color: var(--sklearn-color-unfitted-level-1);
    }

    .sk-estimator-doc-link.fitted,
    a:link.sk-estimator-doc-link.fitted,
    a:visited.sk-estimator-doc-link.fitted {
      /* fitted */
      border: var(--sklearn-color-fitted-level-1) 1pt solid;
      color: var(--sklearn-color-fitted-level-1);
    }

    /* On hover */
    div.sk-estimator:hover .sk-estimator-doc-link:hover,
    .sk-estimator-doc-link:hover,
    div.sk-label-container:hover .sk-estimator-doc-link:hover,
    .sk-estimator-doc-link:hover {
      /* unfitted */
      background-color: var(--sklearn-color-unfitted-level-3);
      color: var(--sklearn-color-background);
      text-decoration: none;
    }

    div.sk-estimator.fitted:hover .sk-estimator-doc-link.fitted:hover,
    .sk-estimator-doc-link.fitted:hover,
    div.sk-label-container:hover .sk-estimator-doc-link.fitted:hover,
    .sk-estimator-doc-link.fitted:hover {
      /* fitted */
      background-color: var(--sklearn-color-fitted-level-3);
      color: var(--sklearn-color-background);
      text-decoration: none;
    }

    /* Span, style for the box shown on hovering the info icon */
    .sk-estimator-doc-link span {
      display: none;
      z-index: 9999;
      position: relative;
      font-weight: normal;
      right: .2ex;
      padding: .5ex;
      margin: .5ex;
      width: min-content;
      min-width: 20ex;
      max-width: 50ex;
      color: var(--sklearn-color-text);
      box-shadow: 2pt 2pt 4pt #999;
      /* unfitted */
      background: var(--sklearn-color-unfitted-level-0);
      border: .5pt solid var(--sklearn-color-unfitted-level-3);
    }

    .sk-estimator-doc-link.fitted span {
      /* fitted */
      background: var(--sklearn-color-fitted-level-0);
      border: var(--sklearn-color-fitted-level-3);
    }

    .sk-estimator-doc-link:hover span {
      display: block;
    }

    /* "?"-specific style due to the `<a>` HTML tag */

    #sk-container-id-5 a.estimator_doc_link {
      float: right;
      font-size: 1rem;
      line-height: 1em;
      font-family: monospace;
      background-color: var(--sklearn-color-background);
      border-radius: 1rem;
      height: 1rem;
      width: 1rem;
      text-decoration: none;
      /* unfitted */
      color: var(--sklearn-color-unfitted-level-1);
      border: var(--sklearn-color-unfitted-level-1) 1pt solid;
    }

    #sk-container-id-5 a.estimator_doc_link.fitted {
      /* fitted */
      border: var(--sklearn-color-fitted-level-1) 1pt solid;
      color: var(--sklearn-color-fitted-level-1);
    }

    /* On hover */
    #sk-container-id-5 a.estimator_doc_link:hover {
      /* unfitted */
      background-color: var(--sklearn-color-unfitted-level-3);
      color: var(--sklearn-color-background);
      text-decoration: none;
    }

    #sk-container-id-5 a.estimator_doc_link.fitted:hover {
      /* fitted */
      background-color: var(--sklearn-color-fitted-level-3);
    }

    .estimator-table summary {
        padding: .5rem;
        font-family: monospace;
        cursor: pointer;
    }

    .estimator-table details[open] {
        padding-left: 0.1rem;
        padding-right: 0.1rem;
        padding-bottom: 0.3rem;
    }

    .estimator-table .parameters-table {
        margin-left: auto !important;
        margin-right: auto !important;
    }

    .estimator-table .parameters-table tr:nth-child(odd) {
        background-color: #fff;
    }

    .estimator-table .parameters-table tr:nth-child(even) {
        background-color: #f6f6f6;
    }

    .estimator-table .parameters-table tr:hover {
        background-color: #e0e0e0;
    }

    .estimator-table table td {
        border: 1px solid rgba(106, 105, 104, 0.232);
    }

    .user-set td {
        color:rgb(255, 94, 0);
        text-align: left;
    }

    .user-set td.value pre {
        color:rgb(255, 94, 0) !important;
        background-color: transparent !important;
    }

    .default td {
        color: black;
        text-align: left;
    }

    .user-set td i,
    .default td i {
        color: black;
    }

    .copy-paste-icon {
        background-image: url(data:image/svg+xml;base64,PHN2ZyB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciIHZpZXdCb3g9IjAgMCA0NDggNTEyIj48IS0tIUZvbnQgQXdlc29tZSBGcmVlIDYuNy4yIGJ5IEBmb250YXdlc29tZSAtIGh0dHBzOi8vZm9udGF3ZXNvbWUuY29tIExpY2Vuc2UgLSBodHRwczovL2ZvbnRhd2Vzb21lLmNvbS9saWNlbnNlL2ZyZWUgQ29weXJpZ2h0IDIwMjUgRm9udGljb25zLCBJbmMuLS0+PHBhdGggZD0iTTIwOCAwTDMzMi4xIDBjMTIuNyAwIDI0LjkgNS4xIDMzLjkgMTQuMWw2Ny45IDY3LjljOSA5IDE0LjEgMjEuMiAxNC4xIDMzLjlMNDQ4IDMzNmMwIDI2LjUtMjEuNSA0OC00OCA0OGwtMTkyIDBjLTI2LjUgMC00OC0yMS41LTQ4LTQ4bDAtMjg4YzAtMjYuNSAyMS41LTQ4IDQ4LTQ4ek00OCAxMjhsODAgMCAwIDY0LTY0IDAgMCAyNTYgMTkyIDAgMC0zMiA2NCAwIDAgNDhjMCAyNi41LTIxLjUgNDgtNDggNDhMNDggNTEyYy0yNi41IDAtNDgtMjEuNS00OC00OEwwIDE3NmMwLTI2LjUgMjEuNS00OCA0OC00OHoiLz48L3N2Zz4=);
        background-repeat: no-repeat;
        background-size: 14px 14px;
        background-position: 0;
        display: inline-block;
        width: 14px;
        height: 14px;
        cursor: pointer;
    }
    </style><body><div id="sk-container-id-5" class="sk-top-container"><div class="sk-text-repr-fallback"><pre>RandomForestRegressor()</pre><b>In a Jupyter environment, please rerun this cell to show the HTML representation or trust the notebook. <br />On GitHub, the HTML representation is unable to render, please try loading this page with nbviewer.org.</b></div><div class="sk-container" hidden><div class="sk-item"><div class="sk-estimator fitted sk-toggleable"><input class="sk-toggleable__control sk-hidden--visually" id="sk-estimator-id-5" type="checkbox" checked><label for="sk-estimator-id-5" class="sk-toggleable__label fitted sk-toggleable__label-arrow"><div><div>RandomForestRegressor</div></div><div><a class="sk-estimator-doc-link fitted" rel="noreferrer" target="_blank" href="https://scikit-learn.org/1.7/modules/generated/sklearn.ensemble.RandomForestRegressor.html">?<span>Documentation for RandomForestRegressor</span></a><span class="sk-estimator-doc-link fitted">i<span>Fitted</span></span></div></label><div class="sk-toggleable__content fitted" data-param-prefix="">
            <div class="estimator-table">
                <details>
                    <summary>Parameters</summary>
                    <table class="parameters-table">
                      <tbody>
                    
            <tr class="default">
                <td><i class="copy-paste-icon"
                     onclick="copyToClipboard('n_estimators',
                              this.parentElement.nextElementSibling)"
                ></i></td>
                <td class="param">n_estimators&nbsp;</td>
                <td class="value">100</td>
            </tr>
    

            <tr class="default">
                <td><i class="copy-paste-icon"
                     onclick="copyToClipboard('criterion',
                              this.parentElement.nextElementSibling)"
                ></i></td>
                <td class="param">criterion&nbsp;</td>
                <td class="value">&#x27;squared_error&#x27;</td>
            </tr>
    

            <tr class="default">
                <td><i class="copy-paste-icon"
                     onclick="copyToClipboard('max_depth',
                              this.parentElement.nextElementSibling)"
                ></i></td>
                <td class="param">max_depth&nbsp;</td>
                <td class="value">None</td>
            </tr>
    

            <tr class="default">
                <td><i class="copy-paste-icon"
                     onclick="copyToClipboard('min_samples_split',
                              this.parentElement.nextElementSibling)"
                ></i></td>
                <td class="param">min_samples_split&nbsp;</td>
                <td class="value">2</td>
            </tr>
    

            <tr class="default">
                <td><i class="copy-paste-icon"
                     onclick="copyToClipboard('min_samples_leaf',
                              this.parentElement.nextElementSibling)"
                ></i></td>
                <td class="param">min_samples_leaf&nbsp;</td>
                <td class="value">1</td>
            </tr>
    

            <tr class="default">
                <td><i class="copy-paste-icon"
                     onclick="copyToClipboard('min_weight_fraction_leaf',
                              this.parentElement.nextElementSibling)"
                ></i></td>
                <td class="param">min_weight_fraction_leaf&nbsp;</td>
                <td class="value">0.0</td>
            </tr>
    

            <tr class="default">
                <td><i class="copy-paste-icon"
                     onclick="copyToClipboard('max_features',
                              this.parentElement.nextElementSibling)"
                ></i></td>
                <td class="param">max_features&nbsp;</td>
                <td class="value">1.0</td>
            </tr>
    

            <tr class="default">
                <td><i class="copy-paste-icon"
                     onclick="copyToClipboard('max_leaf_nodes',
                              this.parentElement.nextElementSibling)"
                ></i></td>
                <td class="param">max_leaf_nodes&nbsp;</td>
                <td class="value">None</td>
            </tr>
    

            <tr class="default">
                <td><i class="copy-paste-icon"
                     onclick="copyToClipboard('min_impurity_decrease',
                              this.parentElement.nextElementSibling)"
                ></i></td>
                <td class="param">min_impurity_decrease&nbsp;</td>
                <td class="value">0.0</td>
            </tr>
    

            <tr class="default">
                <td><i class="copy-paste-icon"
                     onclick="copyToClipboard('bootstrap',
                              this.parentElement.nextElementSibling)"
                ></i></td>
                <td class="param">bootstrap&nbsp;</td>
                <td class="value">True</td>
            </tr>
    

            <tr class="default">
                <td><i class="copy-paste-icon"
                     onclick="copyToClipboard('oob_score',
                              this.parentElement.nextElementSibling)"
                ></i></td>
                <td class="param">oob_score&nbsp;</td>
                <td class="value">False</td>
            </tr>
    

            <tr class="default">
                <td><i class="copy-paste-icon"
                     onclick="copyToClipboard('n_jobs',
                              this.parentElement.nextElementSibling)"
                ></i></td>
                <td class="param">n_jobs&nbsp;</td>
                <td class="value">None</td>
            </tr>
    

            <tr class="default">
                <td><i class="copy-paste-icon"
                     onclick="copyToClipboard('random_state',
                              this.parentElement.nextElementSibling)"
                ></i></td>
                <td class="param">random_state&nbsp;</td>
                <td class="value">None</td>
            </tr>
    

            <tr class="default">
                <td><i class="copy-paste-icon"
                     onclick="copyToClipboard('verbose',
                              this.parentElement.nextElementSibling)"
                ></i></td>
                <td class="param">verbose&nbsp;</td>
                <td class="value">0</td>
            </tr>
    

            <tr class="default">
                <td><i class="copy-paste-icon"
                     onclick="copyToClipboard('warm_start',
                              this.parentElement.nextElementSibling)"
                ></i></td>
                <td class="param">warm_start&nbsp;</td>
                <td class="value">False</td>
            </tr>
    

            <tr class="default">
                <td><i class="copy-paste-icon"
                     onclick="copyToClipboard('ccp_alpha',
                              this.parentElement.nextElementSibling)"
                ></i></td>
                <td class="param">ccp_alpha&nbsp;</td>
                <td class="value">0.0</td>
            </tr>
    

            <tr class="default">
                <td><i class="copy-paste-icon"
                     onclick="copyToClipboard('max_samples',
                              this.parentElement.nextElementSibling)"
                ></i></td>
                <td class="param">max_samples&nbsp;</td>
                <td class="value">None</td>
            </tr>
    

            <tr class="default">
                <td><i class="copy-paste-icon"
                     onclick="copyToClipboard('monotonic_cst',
                              this.parentElement.nextElementSibling)"
                ></i></td>
                <td class="param">monotonic_cst&nbsp;</td>
                <td class="value">None</td>
            </tr>
    
                      </tbody>
                    </table>
                </details>
            </div>
        </div></div></div></div></div><script>function copyToClipboard(text, element) {
        // Get the parameter prefix from the closest toggleable content
        const toggleableContent = element.closest('.sk-toggleable__content');
        const paramPrefix = toggleableContent ? toggleableContent.dataset.paramPrefix : '';
        const fullParamName = paramPrefix ? `${paramPrefix}${text}` : text;

        const originalStyle = element.style;
        const computedStyle = window.getComputedStyle(element);
        const originalWidth = computedStyle.width;
        const originalHTML = element.innerHTML.replace('Copied!', '');

        navigator.clipboard.writeText(fullParamName)
            .then(() => {
                element.style.width = originalWidth;
                element.style.color = 'green';
                element.innerHTML = "Copied!";

                setTimeout(() => {
                    element.innerHTML = originalHTML;
                    element.style = originalStyle;
                }, 2000);
            })
            .catch(err => {
                console.error('Failed to copy:', err);
                element.style.color = 'red';
                element.innerHTML = "Failed!";
                setTimeout(() => {
                    element.innerHTML = originalHTML;
                    element.style = originalStyle;
                }, 2000);
            });
        return false;
    }

    document.querySelectorAll('.fa-regular.fa-copy').forEach(function(element) {
        const toggleableContent = element.closest('.sk-toggleable__content');
        const paramPrefix = toggleableContent ? toggleableContent.dataset.paramPrefix : '';
        const paramName = element.parentElement.nextElementSibling.textContent.trim();
        const fullParamName = paramPrefix ? `${paramPrefix}${paramName}` : paramName;

        element.setAttribute('title', fullParamName);
    });
    </script></body>
    </div>
    <br />
    <br />

.. GENERATED FROM PYTHON SOURCE LINES 148-152

4. Build the classical conformal prediction intervals
----------------------------------------------------------------------------
In this first part, let us build the prediction intervals with MAPIE using a single
:class:`~mapie.regression.SplitConformalRegressor`.

.. GENERATED FROM PYTHON SOURCE LINES 155-157

Conformalize a SplitConformalRegressor on the conformalization set
*************************************************************************************

.. GENERATED FROM PYTHON SOURCE LINES 157-165

.. code-block:: Python


    # We aim for a coverage score of at least 90%.
    split_regressor = SplitConformalRegressor(
        random_forest, prefit=True, confidence_level=0.9
    )
    split_regressor.conformalize(X_conformalize, y_conformalize)


.. rst-class:: sphx-glr-script-out

 .. code-block:: none


    <mapie.regression.regression.SplitConformalRegressor object at 0x7a60c92f1760>


.. GENERATED FROM PYTHON SOURCE LINES 166-168

Predict the prediction intervals on the test set
*************************************************************************************

.. GENERATED FROM PYTHON SOURCE LINES 168-172

.. code-block:: Python


    _, y_prediction_intervals_split = split_regressor.predict_interval(X_test)


.. GENERATED FROM PYTHON SOURCE LINES 173-175

Evaluate the coverage score by group
*************************************************************************************

.. GENERATED FROM PYTHON SOURCE LINES 175-205

.. code-block:: Python


    coverages = {}
    for group in np.unique(partition_test):
        coverages[group] = {}
        coverages[group]["split"] = regression_coverage_score(
            y_test[partition_test == group],
            y_prediction_intervals_split[partition_test == group],
        )

    # Plot the coverage by group with the SplitConformalRegressor
    plt.bar(
        np.arange(len(coverages)),
        [float(coverages[group]["split"]) for group in coverages],
        label="Split",
    )
    plt.xticks(
        np.arange(len(coverages)), [f"Group {group}" for group in coverages], rotation=45
    )
    plt.hlines(0.9, -1, 10, label="90% coverage", color="black", linestyle="--")
    plt.ylabel("Coverage")
    plt.legend(loc="upper left", bbox_to_anchor=(1, 1))
    plt.tight_layout()
    plt.show()

    # Compute the coverage average across the 10 groups in the test set
    split_coverages = [coverages[group]["split"] for group in coverages]
    average_coverage = np.mean(split_coverages)
    print("Average coverage across the 10 groups:", average_coverage)


.. image-sg:: /examples_mondrian/1-quickstart/images/sphx_glr_plot_main-tutorial-mondrian-regression_003.png
   :alt: plot main tutorial mondrian regression
   :srcset: /examples_mondrian/1-quickstart/images/sphx_glr_plot_main-tutorial-mondrian-regression_003.png
   :class: sphx-glr-single-img


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    Average coverage across the 10 groups: 0.9007750922701531


.. GENERATED FROM PYTHON SOURCE LINES 206-212

As shown in the graph above, the average coverage across the 10 groups (i.e., marginal
coverage) is above the target coverage (90%), which was expected. However, the
coverage varies greatly from one group to another; this behavior is not desirable, as
we want to achieve 90% coverage in each group (i.e., conditional coverage).

Let us see how Mondrian allows us to handle this situation.

.. GENERATED FROM PYTHON SOURCE LINES 215-218

5. Build the Mondrian conformal prediction intervals
----------------------------------------------------------------------------
In this part, we will let us build the prediction intervals using the Mondrian method.

.. GENERATED FROM PYTHON SOURCE LINES 221-225

Conformalize a SplitConformalRegressor on the conformalization set for each group
*************************************************************************************
For each group in the conformalization set, we conformalize a distinct
:class:`~mapie.regression.SplitConformalRegressor`.

.. GENERATED FROM PYTHON SOURCE LINES 225-241

.. code-block:: Python


    mondrian_regressor = {}

    partition_groups_conformity = np.unique(partition_conformalize)

    for group in partition_groups_conformity:
        mapie_group_estimator = SplitConformalRegressor(
            copy(random_forest), prefit=True, confidence_level=0.9
        )
        indices_groups = np.argwhere(partition_conformalize == group)[:, 0]
        X_group = X_conformalize[indices_groups]
        y_group = y_conformalize[indices_groups]
        mapie_group_estimator.conformalize(X_group, y_group)
        mondrian_regressor[group] = mapie_group_estimator


.. GENERATED FROM PYTHON SOURCE LINES 242-246

Predict the prediction intervals on the test set
*************************************************************************************
Next, for each group in the test set, we build the prediction intervals using the
:class:`~mapie.regression.SplitConformalRegressor` associated with the group.

.. GENERATED FROM PYTHON SOURCE LINES 246-262

.. code-block:: Python


    partition_groups_test = np.unique(partition_test)

    y_pred_mondrian = np.empty((len(X_test),))
    y_prediction_intervals_mondrian = np.empty((len(X_test), 2, 1))

    for _, group in enumerate(partition_groups_test):
        indices_groups = np.argwhere(partition_test == group)[:, 0]
        X_group = X_test[indices_groups]
        y_pred_group, y_prediction_intervals_group = mondrian_regressor[
            group
        ].predict_interval(X_group)
        y_pred_mondrian[indices_groups] = y_pred_group
        y_prediction_intervals_mondrian[indices_groups] = y_prediction_intervals_group


.. GENERATED FROM PYTHON SOURCE LINES 263-266

6. Compare the coverage by partition, plot both methods side by side
----------------------------------------------------------------------------
Finally, we can compare the coverage scores for each group using both methods.

.. GENERATED FROM PYTHON SOURCE LINES 266-319

.. code-block:: Python


    coverages = {}
    for group in np.unique(partition_test):
        coverages[group] = {}
        coverages[group]["split"] = regression_coverage_score(
            y_test[partition_test == group],
            y_prediction_intervals_split[partition_test == group],
        )
        coverages[group]["mondrian"] = regression_coverage_score(
            y_test[partition_test == group],
            y_prediction_intervals_mondrian[partition_test == group],
        )

    # Plot the coverage by group, plot both methods side by side
    plt.figure(figsize=(10, 5))
    plt.bar(
        np.arange(len(coverages)) * 2,
        [float(coverages[group]["split"]) for group in coverages],
        label="Split",
    )
    plt.bar(
        np.arange(len(coverages)) * 2 + 1,
        [float(coverages[group]["mondrian"]) for group in coverages],
        label="Mondrian",
    )
    plt.xticks(
        np.arange(len(coverages)) * 2 + 0.5,
        [f"Group {group}" for group in coverages],
        rotation=45,
    )
    plt.hlines(0.9, -1, 20, label="90% coverage", color="black", linestyle="--")
    plt.ylabel("Coverage")
    plt.legend(loc="upper left", bbox_to_anchor=(1, 1))
    plt.tight_layout()
    plt.show()

    # Compute the coverage average across the 10 groups in the test set with the classic
    # method
    split_coverages = [coverages[group]["split"] for group in coverages]
    average_coverage = np.mean(split_coverages)
    print(
        "Average coverage across the 10 groups with the classic method:", average_coverage
    )

    # Compute the coverage average across the 10 groups in the test set with the Mondrian
    # method
    split_coverages = [coverages[group]["mondrian"] for group in coverages]
    average_coverage = np.mean(split_coverages)
    print(
        "Average coverage across the 10 groups with the Mondrian method:", average_coverage
    )


.. image-sg:: /examples_mondrian/1-quickstart/images/sphx_glr_plot_main-tutorial-mondrian-regression_004.png
   :alt: plot main tutorial mondrian regression
   :srcset: /examples_mondrian/1-quickstart/images/sphx_glr_plot_main-tutorial-mondrian-regression_004.png
   :class: sphx-glr-single-img


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    Average coverage across the 10 groups with the classic method: 0.9007750922701531
    Average coverage across the 10 groups with the Mondrian method: 0.9000044379819128


.. GENERATED FROM PYTHON SOURCE LINES 320-324

As expected, both methods achieve an average coverage (marginal coverage) above
90% across the 10 groups.
However, the Mondrian method provides coverage for each group (conditional coverage)
that is much closer to the target coverage compared to the classic method.


.. rst-class:: sphx-glr-timing

   **Total running time of the script:** (0 minutes 11.397 seconds)


.. _sphx_glr_download_examples_mondrian_1-quickstart_plot_main-tutorial-mondrian-regression.py:

.. only:: html

  .. container:: sphx-glr-footer sphx-glr-footer-example

    .. container:: sphx-glr-download sphx-glr-download-jupyter

      :download:`Download Jupyter notebook: plot_main-tutorial-mondrian-regression.ipynb <plot_main-tutorial-mondrian-regression.ipynb>`

    .. container:: sphx-glr-download sphx-glr-download-python

      :download:`Download Python source code: plot_main-tutorial-mondrian-regression.py <plot_main-tutorial-mondrian-regression.py>`

    .. container:: sphx-glr-download sphx-glr-download-zip

      :download:`Download zipped: plot_main-tutorial-mondrian-regression.zip <plot_main-tutorial-mondrian-regression.zip>`


.. only:: html

 .. rst-class:: sphx-glr-signature

    `Gallery generated by Sphinx-Gallery <https://sphinx-gallery.github.io>`_