.. _neuraltreeonnxrst:

=====================
NeuralTreeNet et ONNX
=====================


.. only:: html

    **Links:** :download:`notebook <neural_tree_onnx.ipynb>`, :downloadlink:`html <neural_tree_onnx2html.html>`, :download:`PDF <neural_tree_onnx.pdf>`, :download:`python <neural_tree_onnx.py>`, :downloadlink:`slides <neural_tree_onnx.slides.html>`, :githublink:`GitHub|_doc/notebooks/ml/neural_tree_onnx.ipynb|*`


La conversion d’un arbre de décision au format ONNX peut créer des
différences entre le modèle original et le modèle converti (voir `Issues
when switching to
float <http://www.xavierdupre.fr/app/onnxcustom/helpsphinx/gyexamples/plot_ebegin_float_double.html>`__.
Le problème vient d’un changement de type, les seuils de décisions sont
arrondis au float32 le plus proche de leur valeur en float64 (double).
Qu’advient-il si l’arbre de décision est converti en réseau de neurones
d’abord.

L’approximation des seuils de décision ne change pas grand chose dans la
majorité des cas. Cependant, il est possible que la comparaison d’une
variable à un seuil de décision arrondi soit l’opposé de celle avec le
seuil non arrondi. Dans ce cas, la décision suit un chemin différent
dans l’arbre.

.. code:: ipython3

    from jyquickhelper import add_notebook_menu
    add_notebook_menu()


.. contents::
    :local:


.. code:: ipython3

    %matplotlib inline

.. code:: ipython3

    %load_ext mlprodict

Jeu de données
--------------

On construit un jeu de donnée aléatoire.

.. code:: ipython3

    import numpy
    
    X = numpy.random.randn(10000, 10)
    y = X.sum(axis=1) / X.shape[1]
    X = X.astype(numpy.float64)
    y = y.astype(numpy.float64)

.. code:: ipython3

    middle = X.shape[0] // 2
    X_train, X_test = X[:middle], X[middle:]
    y_train, y_test = y[:middle], y[middle:]

Partie scikit-learn
-------------------

Caler un arbre de décision
~~~~~~~~~~~~~~~~~~~~~~~~~~

.. code:: ipython3

    from sklearn.tree import DecisionTreeRegressor
    
    tree = DecisionTreeRegressor(max_depth=7)
    tree.fit(X_train, y_train)
    tree.score(X_train, y_train), tree.score(X_test, y_test)


.. parsed-literal::
    (0.6179766027481131, 0.33709933420465643)


.. code:: ipython3

    from sklearn.metrics import r2_score
    r2_score(y_test, tree.predict(X_test))


.. parsed-literal::
    0.33709933420465643


La profondeur de l’arbre est insuffisante mais ce n’est pas ce qui nous
intéresse ici.

Conversion au format ONNX
~~~~~~~~~~~~~~~~~~~~~~~~~

.. code:: ipython3

    from mlprodict.onnx_conv import to_onnx
    
    onx = to_onnx(tree, X[:1].astype(numpy.float32))

.. code:: ipython3

    from mlprodict.onnxrt import OnnxInference
    
    x_exp = X_test
    
    oinf = OnnxInference(onx, runtime='onnxruntime1')
    expected = tree.predict(x_exp)
    
    got = oinf.run({'X': x_exp.astype(numpy.float32)})['variable']
    numpy.abs(got - expected).max()


.. parsed-literal::
    1.7421041873949668


.. code:: ipython3

    from mlprodict.plotting.text_plot import onnx_simple_text_plot
    print(onnx_simple_text_plot(onx))


.. parsed-literal::
    opset: domain='ai.onnx.ml' version=1
    opset: domain='' version=15
    input: name='X' type=dtype('float32') shape=[None, 10]
    TreeEnsembleRegressor(X, n_targets=1, nodes_falsenodeids=253:[128,65,34...252,0,0], nodes_featureids=253:[8,3,9...2,0,0], nodes_hitrates=253:[1.0,1.0...1.0,1.0], nodes_missing_value_tracks_true=253:[0,0,0...0,0,0], nodes_modes=253:[b'BRANCH_LEQ',b'BRANCH_LEQ'...b'LEAF',b'LEAF'], nodes_nodeids=253:[0,1,2...250,251,252], nodes_treeids=253:[0,0,0...0,0,0], nodes_truenodeids=253:[1,2,3...251,0,0], nodes_values=253:[0.00792999193072319,-0.12246682494878769...0.0,0.0], post_transform=b'NONE', target_ids=127:[0,0,0...0,0,0], target_nodeids=127:[7,8,10...249,251,252], target_treeids=127:[0,0,0...0,0,0], target_weights=127:[-0.9345570802688599,-0.6372960805892944...0.6169403195381165,1.0096807479858398]) -> variable
    output: name='variable' type=dtype('float32') shape=[None, 1]


Après la conversion en un réseau de neurones
--------------------------------------------

Conversion en un réseau de neurones
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Un paramètre permet de faire varier la pente des fonctions sigmoïdes
utilisées.

.. code:: ipython3

    from tqdm import tqdm
    from pandas import DataFrame
    from mlstatpy.ml.neural_tree import NeuralTreeNet
    
    xe = x_exp[:500]
    expected = tree.predict(xe)
    
    data = []
    trees = {}
    for i in tqdm([0.3, 0.4, 0.5, 0.7, 0.9, 1] + list(range(5, 61, 5))):
        root = NeuralTreeNet.create_from_tree(tree, k=i, arch='compact')
        got = root.predict(xe)[:, -1]
        me = numpy.abs(got - expected).mean()
        mx = numpy.abs(got - expected).max()
        obs = dict(k=i, max=mx, mean=me)
        data.append(obs)
        trees[i] = root


.. parsed-literal::
    100%|██████████| 18/18 [00:01<00:00, 12.49it/s]


.. code:: ipython3

    df = DataFrame(data)
    df


.. raw:: html

    <div>
    <style scoped>
        .dataframe tbody tr th:only-of-type {
            vertical-align: middle;
        }

        .dataframe tbody tr th {
            vertical-align: top;
        }

        .dataframe thead th {
            text-align: right;
        }
    </style>
    <table border="1" class="dataframe">
      <thead>
        <tr style="text-align: right;">
          <th></th>
          <th>k</th>
          <th>max</th>
          <th>mean</th>
        </tr>
      </thead>
      <tbody>
        <tr>
          <th>0</th>
          <td>0.3</td>
          <td>0.568981</td>
          <td>0.158758</td>
        </tr>
        <tr>
          <th>1</th>
          <td>0.4</td>
          <td>0.608304</td>
          <td>0.132576</td>
        </tr>
        <tr>
          <th>2</th>
          <td>0.5</td>
          <td>0.692657</td>
          <td>0.128525</td>
        </tr>
        <tr>
          <th>3</th>
          <td>0.7</td>
          <td>0.780543</td>
          <td>0.131497</td>
        </tr>
        <tr>
          <th>4</th>
          <td>0.9</td>
          <td>0.809866</td>
          <td>0.128368</td>
        </tr>
        <tr>
          <th>5</th>
          <td>1.0</td>
          <td>0.813889</td>
          <td>0.124802</td>
        </tr>
        <tr>
          <th>6</th>
          <td>5.0</td>
          <td>0.392482</td>
          <td>0.022466</td>
        </tr>
        <tr>
          <th>7</th>
          <td>10.0</td>
          <td>0.341749</td>
          <td>0.006350</td>
        </tr>
        <tr>
          <th>8</th>
          <td>15.0</td>
          <td>0.270649</td>
          <td>0.002939</td>
        </tr>
        <tr>
          <th>9</th>
          <td>20.0</td>
          <td>0.299713</td>
          <td>0.002110</td>
        </tr>
        <tr>
          <th>10</th>
          <td>25.0</td>
          <td>0.305493</td>
          <td>0.001842</td>
        </tr>
        <tr>
          <th>11</th>
          <td>30.0</td>
          <td>0.306111</td>
          <td>0.001767</td>
        </tr>
        <tr>
          <th>12</th>
          <td>35.0</td>
          <td>0.299371</td>
          <td>0.001665</td>
        </tr>
        <tr>
          <th>13</th>
          <td>40.0</td>
          <td>0.233556</td>
          <td>0.001011</td>
        </tr>
        <tr>
          <th>14</th>
          <td>45.0</td>
          <td>0.233606</td>
          <td>0.000801</td>
        </tr>
        <tr>
          <th>15</th>
          <td>50.0</td>
          <td>0.233614</td>
          <td>0.000547</td>
        </tr>
        <tr>
          <th>16</th>
          <td>55.0</td>
          <td>0.233615</td>
          <td>0.000499</td>
        </tr>
        <tr>
          <th>17</th>
          <td>60.0</td>
          <td>0.233615</td>
          <td>0.000484</td>
        </tr>
      </tbody>
    </table>
    </div>


.. code:: ipython3

    df.set_index('k').plot(title="Précision de la conversion\nen réseau de neurones");


.. image:: neural_tree_onnx_20_0.png


L’erreur est meilleure mais il faudrait recommencer l’expérience
plusieurs fois avant de pouvoir conclure afin d’obtenir un interval de
confiance pour le même type de jeu de données. Ce sera pour une autre
fois. Le résultat dépend du jeu de données et surtout de la proximité
des seuils de décisions. Néanmoins, on calcule l’erreur sur l’ensemble
de la base de test. Celle-ci a été tronquée pour aller plus vite.

.. code:: ipython3

    expected = tree.predict(x_exp)
    got = trees[50].predict(x_exp)[:, -1]
    numpy.abs(got - expected).max(), numpy.abs(got - expected).mean()


.. parsed-literal::
    (0.2336143002078063, 0.0002511855017989173)


On voit que l’erreur peut-être très grande. Elle reste néanmoins plus
petite que l’erreur de conversion introduite par ONNX.

Conversion au format ONNX
~~~~~~~~~~~~~~~~~~~~~~~~~

On crée tout d’abord une classe qui suit l’API de scikit-learn et qui
englobe l’arbre qui vient d’être créé qui sera ensuite convertit en
ONNX.

.. code:: ipython3

    from mlstatpy.ml.neural_tree import NeuralTreeNetRegressor
    
    reg = NeuralTreeNetRegressor(trees[50])
    onx2 = to_onnx(reg, X[:1].astype(numpy.float32))

.. code:: ipython3

    print(onnx_simple_text_plot(onx2))


.. parsed-literal::
    opset: domain='' version=15
    input: name='X' type=dtype('float32') shape=[None, 10]
    init: name='Ma_MatMulcst' type=dtype('float32') shape=(1260,)
    init: name='Ad_Addcst' type=dtype('float32') shape=(126,)
    init: name='Mu_Mulcst' type=dtype('float32') shape=(1,) -- array([4.], dtype=float32)
    init: name='Ma_MatMulcst1' type=dtype('float32') shape=(16002,)
    init: name='Ad_Addcst1' type=dtype('float32') shape=(127,)
    init: name='Ma_MatMulcst2' type=dtype('float32') shape=(127,)
    init: name='Ad_Addcst2' type=dtype('float32') shape=(1,) -- array([0.], dtype=float32)
    MatMul(X, Ma_MatMulcst) -> Ma_Y02
      Add(Ma_Y02, Ad_Addcst) -> Ad_C02
        Mul(Ad_C02, Mu_Mulcst) -> Mu_C01
          Sigmoid(Mu_C01) -> Si_Y01
            MatMul(Si_Y01, Ma_MatMulcst1) -> Ma_Y01
              Add(Ma_Y01, Ad_Addcst1) -> Ad_C01
                Mul(Ad_C01, Mu_Mulcst) -> Mu_C0
                  Sigmoid(Mu_C0) -> Si_Y0
                    MatMul(Si_Y0, Ma_MatMulcst2) -> Ma_Y0
                      Add(Ma_Y0, Ad_Addcst2) -> Ad_C0
                        Identity(Ad_C0) -> variable
    output: name='variable' type=dtype('float32') shape=[None, 1]


.. code:: ipython3

    oinf2 = OnnxInference(onx2, runtime='onnxruntime1')
    expected = tree.predict(x_exp)
    
    got = oinf2.run({'X': x_exp.astype(numpy.float32)})['variable']
    numpy.abs(got - expected).max()


.. parsed-literal::
    1.7421041873949668


L’erreur est la même.

Temps de calcul
---------------

.. code:: ipython3

    x_exp32 = x_exp.astype(numpy.float32)

Tout d’abord le temps de calcul pour scikit-learn.

.. code:: ipython3

    %timeit tree.predict(x_exp32)


.. parsed-literal::
    513 µs ± 7.52 µs per loop (mean ± std. dev. of 7 runs, 1,000 loops each)


Le temps de calcul pour l’arbre de décision au format ONNX.

.. code:: ipython3

    %timeit oinf.run({'X': x_exp32})['variable']


.. parsed-literal::
    186 µs ± 3.41 µs per loop (mean ± std. dev. of 7 runs, 10,000 loops each)


Et le temps de calcul pour le réseau de neurones au format ONNX.m

.. code:: ipython3

    %timeit oinf2.run({'X': x_exp32})['variable']


.. parsed-literal::
    3.75 ms ± 311 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)


Ce temps de calcul très long est attendu car le modèle contient une
multiplication de matrice très grande et surtout que tous les seuils de
l’arbre sont calculés pour chaque observation. Là où l’implémentation de
l’arbre de décision calcule *d* seuils, la profondeur de l’arbre, la
nouvelle implémentation calcule tous les seuils soit :math:`2^d` pour
chaque feuille. Il y a :math:`2^d` feuilles. Même en étant sparse, on
peut réduire les calculs à :math:`d * 2^d` ce qui fait encore beaucoup
de calculs inutiles.

.. code:: ipython3

    for node in trees[50].nodes:
        print(node.coef.shape, node.bias.shape)


.. parsed-literal::
    (126, 11) (126,)
    (127, 127) (127,)
    (128,) ()


Cela dit, la plus grande matrice est creuse, elle peut être réduite
considérablement.

.. code:: ipython3

    from scipy.sparse import csr_matrix
    
    for node in trees[50].nodes:
        csr = csr_matrix(node.coef)
        print(f"coef.shape={node.coef.shape}, size dense={node.coef.size}, "
              f"size sparse={csr.size}, ratio={csr.size / node.coef.size}")


.. parsed-literal::
    coef.shape=(126, 11), size dense=1386, size sparse=252, ratio=0.18181818181818182
    coef.shape=(127, 127), size dense=16129, size sparse=1015, ratio=0.06293012586025172
    coef.shape=(128,), size dense=128, size sparse=127, ratio=0.9921875


.. code:: ipython3

    r = numpy.random.randn(trees[50].nodes[1].coef.shape[0])
    mat = trees[50].nodes[1].coef
    %timeit mat @ r


.. parsed-literal::
    49.8 µs ± 1.25 µs per loop (mean ± std. dev. of 7 runs, 10,000 loops each)


.. code:: ipython3

    csr = csr_matrix(mat)
    %timeit csr @ r


.. parsed-literal::
    7.08 µs ± 173 ns per loop (mean ± std. dev. of 7 runs, 100,000 loops each)


Ce serait beaucoup plus rapide avec une matrice sparse et d’autant plus
rapide que l’arbre est profond. Le modèle ONNX se décompose comme suit.

.. code:: ipython3

    print(onnx_simple_text_plot(onx2))


.. parsed-literal::
    opset: domain='' version=15
    input: name='X' type=dtype('float32') shape=[None, 10]
    init: name='Ma_MatMulcst' type=dtype('float32') shape=(1260,)
    init: name='Ad_Addcst' type=dtype('float32') shape=(126,)
    init: name='Mu_Mulcst' type=dtype('float32') shape=(1,) -- array([4.], dtype=float32)
    init: name='Ma_MatMulcst1' type=dtype('float32') shape=(16002,)
    init: name='Ad_Addcst1' type=dtype('float32') shape=(127,)
    init: name='Ma_MatMulcst2' type=dtype('float32') shape=(127,)
    init: name='Ad_Addcst2' type=dtype('float32') shape=(1,) -- array([0.], dtype=float32)
    MatMul(X, Ma_MatMulcst) -> Ma_Y02
      Add(Ma_Y02, Ad_Addcst) -> Ad_C02
        Mul(Ad_C02, Mu_Mulcst) -> Mu_C01
          Sigmoid(Mu_C01) -> Si_Y01
            MatMul(Si_Y01, Ma_MatMulcst1) -> Ma_Y01
              Add(Ma_Y01, Ad_Addcst1) -> Ad_C01
                Mul(Ad_C01, Mu_Mulcst) -> Mu_C0
                  Sigmoid(Mu_C0) -> Si_Y0
                    MatMul(Si_Y0, Ma_MatMulcst2) -> Ma_Y0
                      Add(Ma_Y0, Ad_Addcst2) -> Ad_C0
                        Identity(Ad_C0) -> variable
    output: name='variable' type=dtype('float32') shape=[None, 1]


Voyons comment le temps de calcul se répartit.

.. code:: ipython3

    oinfpr = OnnxInference(onx2, runtime="onnxruntime1",
                          runtime_options={"enable_profiling": True})
    for i in range(0, 43):
        oinfpr.run({"X": x_exp32})

.. code:: ipython3

    df = oinfpr.get_profiling(as_df=True)
    df


.. raw:: html

    <div>
    <style scoped>
        .dataframe tbody tr th:only-of-type {
            vertical-align: middle;
        }

        .dataframe tbody tr th {
            vertical-align: top;
        }

        .dataframe thead th {
            text-align: right;
        }
    </style>
    <table border="1" class="dataframe">
      <thead>
        <tr style="text-align: right;">
          <th></th>
          <th>cat</th>
          <th>pid</th>
          <th>tid</th>
          <th>dur</th>
          <th>ts</th>
          <th>ph</th>
          <th>name</th>
          <th>args_op_name</th>
          <th>args_parameter_size</th>
          <th>args_graph_index</th>
          <th>args_provider</th>
          <th>args_exec_plan_index</th>
          <th>args_activation_size</th>
          <th>args_output_size</th>
          <th>args_input_type_shape</th>
          <th>args_output_type_shape</th>
          <th>args_thread_scheduling_stats</th>
        </tr>
      </thead>
      <tbody>
        <tr>
          <th>0</th>
          <td>Session</td>
          <td>78116</td>
          <td>8820</td>
          <td>387</td>
          <td>4</td>
          <td>X</td>
          <td>model_loading_array</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
        </tr>
        <tr>
          <th>1</th>
          <td>Session</td>
          <td>78116</td>
          <td>8820</td>
          <td>2532</td>
          <td>428</td>
          <td>X</td>
          <td>session_initialization</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
        </tr>
        <tr>
          <th>2</th>
          <td>Node</td>
          <td>78116</td>
          <td>8820</td>
          <td>0</td>
          <td>3294</td>
          <td>X</td>
          <td>gemm_fence_before</td>
          <td>Gemm</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
        </tr>
        <tr>
          <th>3</th>
          <td>Node</td>
          <td>78116</td>
          <td>8820</td>
          <td>1315</td>
          <td>3300</td>
          <td>X</td>
          <td>gemm_kernel_time</td>
          <td>Gemm</td>
          <td>5544</td>
          <td>11</td>
          <td>CPUExecutionProvider</td>
          <td>11</td>
          <td>200000</td>
          <td>2520000</td>
          <td>[{'float': [5000, 10]}, {'float': [10, 126]}, ...</td>
          <td>[{'float': [5000, 126]}]</td>
          <td>{'main_thread': {'thread_pool_name': 'session-...</td>
        </tr>
        <tr>
          <th>4</th>
          <td>Node</td>
          <td>78116</td>
          <td>8820</td>
          <td>0</td>
          <td>4635</td>
          <td>X</td>
          <td>gemm_fence_after</td>
          <td>Gemm</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
        </tr>
        <tr>
          <th>...</th>
          <td>...</td>
          <td>...</td>
          <td>...</td>
          <td>...</td>
          <td>...</td>
          <td>...</td>
          <td>...</td>
          <td>...</td>
          <td>...</td>
          <td>...</td>
          <td>...</td>
          <td>...</td>
          <td>...</td>
          <td>...</td>
          <td>...</td>
          <td>...</td>
          <td>...</td>
        </tr>
        <tr>
          <th>986</th>
          <td>Node</td>
          <td>78116</td>
          <td>8820</td>
          <td>0</td>
          <td>210170</td>
          <td>X</td>
          <td>Ma_MatMul2_fence_before</td>
          <td>MatMul</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
        </tr>
        <tr>
          <th>987</th>
          <td>Node</td>
          <td>78116</td>
          <td>8820</td>
          <td>124</td>
          <td>210172</td>
          <td>X</td>
          <td>Ma_MatMul2_kernel_time</td>
          <td>MatMul</td>
          <td>508</td>
          <td>8</td>
          <td>CPUExecutionProvider</td>
          <td>8</td>
          <td>2540000</td>
          <td>20000</td>
          <td>[{'float': [5000, 127]}, {'float': [127, 1]}]</td>
          <td>[{'float': [5000, 1]}]</td>
          <td>{'main_thread': {'thread_pool_name': 'session-...</td>
        </tr>
        <tr>
          <th>988</th>
          <td>Node</td>
          <td>78116</td>
          <td>8820</td>
          <td>0</td>
          <td>210305</td>
          <td>X</td>
          <td>Ma_MatMul2_fence_after</td>
          <td>MatMul</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
        </tr>
        <tr>
          <th>989</th>
          <td>Session</td>
          <td>78116</td>
          <td>8820</td>
          <td>4378</td>
          <td>205930</td>
          <td>X</td>
          <td>SequentialExecutor::Execute</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
        </tr>
        <tr>
          <th>990</th>
          <td>Session</td>
          <td>78116</td>
          <td>8820</td>
          <td>4388</td>
          <td>205925</td>
          <td>X</td>
          <td>model_run</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
          <td>NaN</td>
        </tr>
      </tbody>
    </table>
    <p>991 rows × 17 columns</p>
    </div>


.. code:: ipython3

    set(df['args_provider'])


.. parsed-literal::
    {'CPUExecutionProvider', nan}


.. code:: ipython3

    dfp = df[df.args_provider == 'CPUExecutionProvider'].copy()
    dfp['name'] = dfp['name'].apply(lambda s: s.replace("_kernel_time", ""))
    gr_dur = dfp[['dur', "args_op_name", "name"]].groupby(["args_op_name", "name"]).sum().sort_values('dur')
    gr_dur


.. raw:: html

    <div>
    <style scoped>
        .dataframe tbody tr th:only-of-type {
            vertical-align: middle;
        }

        .dataframe tbody tr th {
            vertical-align: top;
        }

        .dataframe thead th {
            text-align: right;
        }
    </style>
    <table border="1" class="dataframe">
      <thead>
        <tr style="text-align: right;">
          <th></th>
          <th></th>
          <th>dur</th>
        </tr>
        <tr>
          <th>args_op_name</th>
          <th>name</th>
          <th></th>
        </tr>
      </thead>
      <tbody>
        <tr>
          <th>MatMul</th>
          <th>Ma_MatMul2</th>
          <td>6778</td>
        </tr>
        <tr>
          <th>Mul</th>
          <th>Mu_Mul</th>
          <td>12923</td>
        </tr>
        <tr>
          <th>Sigmoid</th>
          <th>Si_Sigmoid</th>
          <td>14849</td>
        </tr>
        <tr>
          <th>Mul</th>
          <th>Mu_Mul1</th>
          <td>15151</td>
        </tr>
        <tr>
          <th>Sigmoid</th>
          <th>Si_Sigmoid1</th>
          <td>15608</td>
        </tr>
        <tr>
          <th rowspan="2" valign="top">Gemm</th>
          <th>gemm</th>
          <td>31763</td>
        </tr>
        <tr>
          <th>gemm_token_0</th>
          <td>99047</td>
        </tr>
      </tbody>
    </table>
    </div>


.. code:: ipython3

    gr_n = dfp[['dur', "args_op_name", "name"]].groupby(["args_op_name", "name"]).count().sort_values('dur')
    gr_n = gr_n.loc[gr_dur.index, :]
    gr_n


.. raw:: html

    <div>
    <style scoped>
        .dataframe tbody tr th:only-of-type {
            vertical-align: middle;
        }

        .dataframe tbody tr th {
            vertical-align: top;
        }

        .dataframe thead th {
            text-align: right;
        }
    </style>
    <table border="1" class="dataframe">
      <thead>
        <tr style="text-align: right;">
          <th></th>
          <th></th>
          <th>dur</th>
        </tr>
        <tr>
          <th>args_op_name</th>
          <th>name</th>
          <th></th>
        </tr>
      </thead>
      <tbody>
        <tr>
          <th>MatMul</th>
          <th>Ma_MatMul2</th>
          <td>43</td>
        </tr>
        <tr>
          <th>Mul</th>
          <th>Mu_Mul</th>
          <td>43</td>
        </tr>
        <tr>
          <th>Sigmoid</th>
          <th>Si_Sigmoid</th>
          <td>43</td>
        </tr>
        <tr>
          <th>Mul</th>
          <th>Mu_Mul1</th>
          <td>43</td>
        </tr>
        <tr>
          <th>Sigmoid</th>
          <th>Si_Sigmoid1</th>
          <td>43</td>
        </tr>
        <tr>
          <th rowspan="2" valign="top">Gemm</th>
          <th>gemm</th>
          <td>43</td>
        </tr>
        <tr>
          <th>gemm_token_0</th>
          <td>43</td>
        </tr>
      </tbody>
    </table>
    </div>


.. code:: ipython3

    import matplotlib.pyplot as plt
    
    fig, ax = plt.subplots(1, 2, figsize=(12, 4))
    gr_dur.plot.barh(ax=ax[0])
    gr_n.plot.barh(ax=ax[1])
    ax[0].set_title("duration")
    ax[1].set_title("n occurences");


.. image:: neural_tree_onnx_51_0.png


onnxruntime passe principalement son temps dans un produit matriciel. On
vérifie plus précisément.

.. code:: ipython3

    df[(df.args_op_name == 'Gemm') & (df.dur > 0)].sort_values('dur', ascending=False).head(n=2).T


.. raw:: html

    <div>
    <style scoped>
        .dataframe tbody tr th:only-of-type {
            vertical-align: middle;
        }

        .dataframe tbody tr th {
            vertical-align: top;
        }

        .dataframe thead th {
            text-align: right;
        }
    </style>
    <table border="1" class="dataframe">
      <thead>
        <tr style="text-align: right;">
          <th></th>
          <th>127</th>
          <th>12</th>
        </tr>
      </thead>
      <tbody>
        <tr>
          <th>cat</th>
          <td>Node</td>
          <td>Node</td>
        </tr>
        <tr>
          <th>pid</th>
          <td>78116</td>
          <td>78116</td>
        </tr>
        <tr>
          <th>tid</th>
          <td>8820</td>
          <td>8820</td>
        </tr>
        <tr>
          <th>dur</th>
          <td>4603</td>
          <td>4083</td>
        </tr>
        <tr>
          <th>ts</th>
          <td>37173</td>
          <td>5949</td>
        </tr>
        <tr>
          <th>ph</th>
          <td>X</td>
          <td>X</td>
        </tr>
        <tr>
          <th>name</th>
          <td>gemm_token_0_kernel_time</td>
          <td>gemm_token_0_kernel_time</td>
        </tr>
        <tr>
          <th>args_op_name</th>
          <td>Gemm</td>
          <td>Gemm</td>
        </tr>
        <tr>
          <th>args_parameter_size</th>
          <td>64516</td>
          <td>64516</td>
        </tr>
        <tr>
          <th>args_graph_index</th>
          <td>12</td>
          <td>12</td>
        </tr>
        <tr>
          <th>args_provider</th>
          <td>CPUExecutionProvider</td>
          <td>CPUExecutionProvider</td>
        </tr>
        <tr>
          <th>args_exec_plan_index</th>
          <td>12</td>
          <td>12</td>
        </tr>
        <tr>
          <th>args_activation_size</th>
          <td>2520000</td>
          <td>2520000</td>
        </tr>
        <tr>
          <th>args_output_size</th>
          <td>2540000</td>
          <td>2540000</td>
        </tr>
        <tr>
          <th>args_input_type_shape</th>
          <td>[{'float': [5000, 126]}, {'float': [126, 127]}...</td>
          <td>[{'float': [5000, 126]}, {'float': [126, 127]}...</td>
        </tr>
        <tr>
          <th>args_output_type_shape</th>
          <td>[{'float': [5000, 127]}]</td>
          <td>[{'float': [5000, 127]}]</td>
        </tr>
        <tr>
          <th>args_thread_scheduling_stats</th>
          <td>{'main_thread': {'thread_pool_name': 'session-...</td>
          <td>{'main_thread': {'thread_pool_name': 'session-...</td>
        </tr>
      </tbody>
    </table>
    </div>


C’est un produit matriciel d’environ *5000x800* par *800x800*.

.. code:: ipython3

    gr_dur / gr_dur.dur.sum()


.. raw:: html

    <div>
    <style scoped>
        .dataframe tbody tr th:only-of-type {
            vertical-align: middle;
        }

        .dataframe tbody tr th {
            vertical-align: top;
        }

        .dataframe thead th {
            text-align: right;
        }
    </style>
    <table border="1" class="dataframe">
      <thead>
        <tr style="text-align: right;">
          <th></th>
          <th></th>
          <th>dur</th>
        </tr>
        <tr>
          <th>args_op_name</th>
          <th>name</th>
          <th></th>
        </tr>
      </thead>
      <tbody>
        <tr>
          <th>MatMul</th>
          <th>Ma_MatMul2</th>
          <td>0.034561</td>
        </tr>
        <tr>
          <th>Mul</th>
          <th>Mu_Mul</th>
          <td>0.065894</td>
        </tr>
        <tr>
          <th>Sigmoid</th>
          <th>Si_Sigmoid</th>
          <td>0.075714</td>
        </tr>
        <tr>
          <th>Mul</th>
          <th>Mu_Mul1</th>
          <td>0.077254</td>
        </tr>
        <tr>
          <th>Sigmoid</th>
          <th>Si_Sigmoid1</th>
          <td>0.079584</td>
        </tr>
        <tr>
          <th rowspan="2" valign="top">Gemm</th>
          <th>gemm</th>
          <td>0.161958</td>
        </tr>
        <tr>
          <th>gemm_token_0</th>
          <td>0.505035</td>
        </tr>
      </tbody>
    </table>
    </div>


.. code:: ipython3

    r = (gr_dur / gr_dur.dur.sum()).dur.max()
    r


.. parsed-literal::
    0.5050352082154203


Il occupe 82% du temps. et d’après l’expérience précédente, son temps
d’éxecution peut-être réduit par 10 en le remplaçant par une matrice
sparse. Cela ne suffira pas pour accélerer le temps de calcul de ce
réseau de neurones. Il est 84 ms comparé à 247 µs pour l’arbre de
décision. Avec cette optimisation, il pourrait passer de :

.. code:: ipython3

    t = 3.75  # ms
    t * (1 - r) + r * t / 12


.. parsed-literal::
    2.013941471759493


Soit une réduction du temps de calcul. Ce n’est pas mal mais pas assez.

Hummingbird
-----------

`hummingbird <https://github.com/microsoft/hummingbird>`__ est une
librairie qui convertit un arbre de décision en réseau de neurones.
Voyons ses performances.

.. code:: ipython3

    from hummingbird.ml import convert
    
    model = convert(tree, 'torch')
    
    expected = tree.predict(x_exp)
    got = model.predict(x_exp)
    numpy.abs(got - expected).max(), numpy.abs(got - expected).mean()


.. parsed-literal::
    C:\xavierdupre\__home_\github_fork\scikit-learn\sklearn\utils\deprecation.py:103: FutureWarning: The attribute `n_features_` is deprecated in 1.0 and will be removed in 1.2. Use `n_features_in_` instead.
      warnings.warn(msg, category=FutureWarning)


.. parsed-literal::

    (4.3419181139370266e-08, 4.430287026515114e-09)


Le résultat est beaucoup plus fidèle au modèle.

.. code:: ipython3

    %timeit model.predict(x_exp)


.. parsed-literal::
    1.17 ms ± 34.8 µs per loop (mean ± std. dev. of 7 runs, 1,000 loops each)


Il reste plus lent mais beaucoup plus rapide que la solution manuelle
proposée dans les précédents paragraphes. Il contient un attribut
``model``.

.. code:: ipython3

    from torch.nn import Module
    isinstance(model.model, Module)


.. parsed-literal::
    True


On convertit ce modèle au format ONNX.

.. code:: ipython3

    import torch.onnx
    
    x = torch.randn(x_exp.shape[0], x_exp.shape[1], requires_grad=True)
    torch.onnx.export(model.model, x, 'tree_torch.onnx', opset_version=15,                         
                      input_names=['X'], output_names=['variable'],
                      dynamic_axes={
                          'X' : {0 : 'batch_size'},
                          'variable' : {0 : 'batch_size'}})

.. code:: ipython3

    import onnx
    
    onxh = onnx.load('tree_torch.onnx')

.. code:: ipython3

    print(onnx_simple_text_plot(onxh, raise_exc=False))


.. parsed-literal::
    opset: domain='' version=15
    input: name='X' type=dtype('float32') shape=['batch_size', 10]
    init: name='_operators.0.root_nodes' type=dtype('int64') shape=(0,) -- array([8], dtype=int64)
    init: name='_operators.0.root_biases' type=dtype('float32') shape=(0,) -- array([0.00792999], dtype=float32)
    init: name='_operators.0.tree_indices' type=dtype('int64') shape=(0,) -- array([0], dtype=int64)
    init: name='_operators.0.leaf_nodes' type=dtype('float32') shape=(0,) -- array([ 1.0096807 ,  0.6169403 ,  0.61055773,  0.37810475,  0.31796893,
            0.13317925,  0.0193846 , -0.2317742 ,  0.39089343,  0.23506087,
            0.3711936 ,  0.10317916,  0.14956598, -0.14193445, -0.05965868,
           -0.27377078,  0.4128183 ,  0.19658326,  0.25545415,  0.08118545,
            0.08400188, -0.1502193 , -0.36846825, -0.79687625,  0.35822242,
            0.49021915,  0.30870998,  0.01033915,  0.6740977 ,  0.6740977 ,
           -0.15315758, -0.41128033,  0.42920846,  0.13145493,  0.21853392,
           -0.10986731,  0.4493652 ,  0.11318789,  0.12666471, -0.0623082 ,
            0.2872893 ,  0.09948976,  0.11439473, -0.08801427,  0.16091613,
           -0.02319027, -0.10097775, -0.37583745,  0.18612385, -0.00453244,
            0.3287116 , -0.1499349 ,  0.7919218 ,  0.04704398, -0.15423109,
           -0.43160027,  0.10802375, -0.1073833 , -0.07759219, -0.29175794,
           -0.1528881 , -0.4909434 , -0.23361537, -0.43578717,  0.7831867 ,
            0.45349318,  0.34956965, -0.3199535 ,  0.3061573 , -0.34267113,
            0.34963542,  0.04491445,  0.35399815,  0.14815213,  0.06678926,
           -0.16095412,  0.3214274 ,  0.01484008, -0.1012276 , -0.3257699 ,
            0.26727676,  0.01970094,  0.10760042, -0.09169976,  0.20044112,
           -0.0324069 , -0.11015374, -0.28358367,  0.8083656 ,  0.13358633,
           -0.07912118, -0.27182895, -0.07054728, -0.24895027, -0.20600456,
           -0.42033467,  0.34701794, -0.0638995 ,  0.14252576, -0.06025055,
            0.4228329 ,  0.06789401,  0.03919645, -0.17267554,  0.07274943,
           -0.487512  ,  0.04517636, -0.18857062, -0.03975222, -0.2652712 ,
           -0.30853328, -0.50844556,  0.03321444, -0.15481217, -0.20701212,
           -0.40578464, -0.25884995, -0.46550158, -0.4797585 , -0.7324234 ,
            0.43939307, -0.06170902, -0.51546025, -0.19215119, -0.3705445 ,
           -0.57504356, -0.6372961 , -0.9345571 ], dtype=float32)
    init: name='_operators.0.nodes.0' type=dtype('int64') shape=(0,) -- array([0, 3], dtype=int64)
    init: name='_operators.0.nodes.1' type=dtype('int64') shape=(0,) -- array([1, 2, 5, 9], dtype=int64)
    init: name='_operators.0.nodes.2' type=dtype('int64') shape=(0,) -- array([5, 6, 3, 7, 2, 0, 7, 1], dtype=int64)
    init: name='_operators.0.nodes.3' type=dtype('int64') shape=(0,) -- array([3, 9, 5, 3, 6, 4, 1, 3, 6, 6, 1, 6, 5, 4, 6, 2], dtype=int64)
    init: name='_operators.0.nodes.4' type=dtype('int64') shape=(0,) -- array([3, 2, 7, 6, 2, 4, 7, 8, 9, 5, 7, 8, 9, 4, 6, 9, 7, 9, 0, 7, 7, 9,
           2, 7, 6, 4, 6, 5, 4, 0, 6, 0], dtype=int64)
    init: name='_operators.0.nodes.5' type=dtype('int64') shape=(0,) -- array([2, 8, 7, 6, 6, 3, 4, 9, 7, 3, 2, 6, 3, 3, 0, 1, 1, 0, 4, 7, 9, 5,
           7, 9, 5, 3, 5, 9, 0, 5, 1, 4, 9, 4, 7, 7, 1, 9, 1, 1, 6, 2, 7, 7,
           6, 1, 4, 4, 0, 0, 9, 8, 8, 2, 6, 2, 0, 3, 4, 2, 5, 6, 7, 3],
          dtype=int64)
    init: name='_operators.0.biases.0' type=dtype('float32') shape=(0,) -- array([ 0.19169255, -0.12246682], dtype=float32)
    init: name='_operators.0.biases.1' type=dtype('float32') shape=(0,) -- array([-0.40610337, -0.1467492 , -0.01880287,  0.15879431], dtype=float32)
    init: name='_operators.0.biases.2' type=dtype('float32') shape=(0,) -- array([ 0.736786  , -0.32427853,  0.30860555,  0.17994082,  0.6917758 ,
           -0.00594712,  0.35950053, -0.9819274 ], dtype=float32)
    init: name='_operators.0.biases.3' type=dtype('float32') shape=(0,) -- array([-1.3495584 , -1.082793  , -0.6906011 , -0.08978076, -0.4007622 ,
            0.10756078, -0.68507075,  0.15814054,  0.5132364 , -0.18426335,
            0.13685235,  0.10721841,  0.01814443, -0.41644228, -0.59770894,
            0.607365  ], dtype=float32)
    init: name='_operators.0.biases.4' type=dtype('float32') shape=(0,) -- array([ 1.4203796 , -0.49269757, -0.12210988, -0.09692484,  0.5076643 ,
           -1.3609421 ,  1.154743  ,  2.8748922 , -0.08181615,  0.7741028 ,
            0.20604724,  0.666296  , -0.6474025 ,  0.6459148 ,  0.02262808,
           -0.42282397,  0.46360654, -0.10058792,  0.25486696,  0.60041225,
           -0.06933744,  0.21294908,  0.96443814,  0.07923891,  0.4797698 ,
            1.2852331 ,  0.24348404, -0.3404966 , -0.07175394, -0.8248828 ,
           -0.74071133, -1.2140133 ], dtype=float32)
    init: name='_operators.0.biases.5' type=dtype('float32') shape=(0,) -- array([ 1.0626682 ,  1.4745288 ,  0.01898679,  0.5451088 ,  0.15444604,
            1.0631477 , -0.7555804 , -1.7192128 , -0.20905146,  0.19752283,
           -0.40471953,  0.13069782,  0.60331047,  1.5060809 ,  0.        ,
           -1.8283446 , -0.8124372 , -1.381897  ,  0.59209645,  0.3239226 ,
           -0.42840806, -0.43624896,  0.58229303, -1.0196047 , -0.5632828 ,
            0.91483426,  1.8038778 , -0.5665638 , -1.2530733 , -0.6500004 ,
           -1.3069727 ,  0.48267984,  0.73503745, -1.871724  , -1.4965518 ,
            1.3147466 ,  0.03919952, -0.885836  ,  0.5479692 , -0.8086383 ,
           -0.74240863,  0.14582941,  0.6496967 , -0.00911551,  2.4541488 ,
           -0.90482277,  0.26108736,  0.7569448 , -1.0786855 , -0.45229852,
            1.2146595 , -0.6756766 , -2.3066258 ,  0.7911504 ,  0.57490873,
           -0.40741247,  0.24633038, -1.2022957 , -0.65162694, -0.04244827,
            1.558136  , -1.6220782 ,  0.1574643 , -1.4209061 ], dtype=float32)
    Constant(value=[-1]) -> onnx::Reshape_27
    Gather(X, _operators.0.root_nodes, axis=1) -> onnx::LessOrEqual_17
      LessOrEqual(onnx::LessOrEqual_17, _operators.0.root_biases) -> onnx::Cast_18
        Cast(onnx::Cast_18, to=7) -> onnx::Add_19
          Add(onnx::Add_19, _operators.0.tree_indices) -> onnx::Reshape_20
    Constant(value=[-1]) -> onnx::Reshape_21
      Reshape(onnx::Reshape_20, onnx::Reshape_21, allowzero=0) -> onnx::Gather_22
        Gather(_operators.0.nodes.0, onnx::Gather_22, axis=0) -> onnx::Reshape_23
    Constant(value=[-1, 1]) -> onnx::Reshape_24
      Reshape(onnx::Reshape_23, onnx::Reshape_24, allowzero=0) -> onnx::GatherElements_25
        GatherElements(X, onnx::GatherElements_25, axis=1) -> onnx::Reshape_26
      Reshape(onnx::Reshape_26, onnx::Reshape_27, allowzero=0) -> onnx::LessOrEqual_28
    Constant(value=2) -> onnx::Mul_29
      Mul(onnx::Gather_22, onnx::Mul_29) -> onnx::Add_30
    Gather(_operators.0.biases.0, onnx::Gather_22, axis=0) -> onnx::LessOrEqual_31
      LessOrEqual(onnx::LessOrEqual_28, onnx::LessOrEqual_31) -> onnx::Cast_32
        Cast(onnx::Cast_32, to=7) -> onnx::Add_33
        Add(onnx::Add_30, onnx::Add_33) -> onnx::Gather_34
          Gather(_operators.0.nodes.1, onnx::Gather_34, axis=0) -> onnx::Reshape_35
    Constant(value=[-1, 1]) -> onnx::Reshape_36
      Reshape(onnx::Reshape_35, onnx::Reshape_36, allowzero=0) -> onnx::GatherElements_37
        GatherElements(X, onnx::GatherElements_37, axis=1) -> onnx::Reshape_38
    Constant(value=[-1]) -> onnx::Reshape_39
      Reshape(onnx::Reshape_38, onnx::Reshape_39, allowzero=0) -> onnx::LessOrEqual_40
    Constant(value=2) -> onnx::Mul_41
      Mul(onnx::Gather_34, onnx::Mul_41) -> onnx::Add_42
    Gather(_operators.0.biases.1, onnx::Gather_34, axis=0) -> onnx::LessOrEqual_43
      LessOrEqual(onnx::LessOrEqual_40, onnx::LessOrEqual_43) -> onnx::Cast_44
        Cast(onnx::Cast_44, to=7) -> onnx::Add_45
        Add(onnx::Add_42, onnx::Add_45) -> onnx::Gather_46
          Gather(_operators.0.nodes.2, onnx::Gather_46, axis=0) -> onnx::Reshape_47
    Constant(value=[-1, 1]) -> onnx::Reshape_48
      Reshape(onnx::Reshape_47, onnx::Reshape_48, allowzero=0) -> onnx::GatherElements_49
        GatherElements(X, onnx::GatherElements_49, axis=1) -> onnx::Reshape_50
    Constant(value=[-1]) -> onnx::Reshape_51
      Reshape(onnx::Reshape_50, onnx::Reshape_51, allowzero=0) -> onnx::LessOrEqual_52
    Constant(value=2) -> onnx::Mul_53
      Mul(onnx::Gather_46, onnx::Mul_53) -> onnx::Add_54
    Gather(_operators.0.biases.2, onnx::Gather_46, axis=0) -> onnx::LessOrEqual_55
      LessOrEqual(onnx::LessOrEqual_52, onnx::LessOrEqual_55) -> onnx::Cast_56
        Cast(onnx::Cast_56, to=7) -> onnx::Add_57
        Add(onnx::Add_54, onnx::Add_57) -> onnx::Gather_58
          Gather(_operators.0.nodes.3, onnx::Gather_58, axis=0) -> onnx::Reshape_59
    Constant(value=[-1, 1]) -> onnx::Reshape_60
      Reshape(onnx::Reshape_59, onnx::Reshape_60, allowzero=0) -> onnx::GatherElements_61
        GatherElements(X, onnx::GatherElements_61, axis=1) -> onnx::Reshape_62
    Constant(value=[-1]) -> onnx::Reshape_63
      Reshape(onnx::Reshape_62, onnx::Reshape_63, allowzero=0) -> onnx::LessOrEqual_64
    Constant(value=2) -> onnx::Mul_65
      Mul(onnx::Gather_58, onnx::Mul_65) -> onnx::Add_66
    Gather(_operators.0.biases.3, onnx::Gather_58, axis=0) -> onnx::LessOrEqual_67
      LessOrEqual(onnx::LessOrEqual_64, onnx::LessOrEqual_67) -> onnx::Cast_68
        Cast(onnx::Cast_68, to=7) -> onnx::Add_69
        Add(onnx::Add_66, onnx::Add_69) -> onnx::Gather_70
          Gather(_operators.0.nodes.4, onnx::Gather_70, axis=0) -> onnx::Reshape_71
    Constant(value=[-1, 1]) -> onnx::Reshape_72
      Reshape(onnx::Reshape_71, onnx::Reshape_72, allowzero=0) -> onnx::GatherElements_73
        GatherElements(X, onnx::GatherElements_73, axis=1) -> onnx::Reshape_74
    Constant(value=[-1]) -> onnx::Reshape_75
      Reshape(onnx::Reshape_74, onnx::Reshape_75, allowzero=0) -> onnx::LessOrEqual_76
    Constant(value=2) -> onnx::Mul_77
      Mul(onnx::Gather_70, onnx::Mul_77) -> onnx::Add_78
    Gather(_operators.0.biases.4, onnx::Gather_70, axis=0) -> onnx::LessOrEqual_79
      LessOrEqual(onnx::LessOrEqual_76, onnx::LessOrEqual_79) -> onnx::Cast_80
        Cast(onnx::Cast_80, to=7) -> onnx::Add_81
        Add(onnx::Add_78, onnx::Add_81) -> onnx::Gather_82
          Gather(_operators.0.nodes.5, onnx::Gather_82, axis=0) -> onnx::Reshape_83
    Constant(value=[-1, 1]) -> onnx::Reshape_84
      Reshape(onnx::Reshape_83, onnx::Reshape_84, allowzero=0) -> onnx::GatherElements_85
        GatherElements(X, onnx::GatherElements_85, axis=1) -> onnx::Reshape_86
    Constant(value=[-1]) -> onnx::Reshape_87
      Reshape(onnx::Reshape_86, onnx::Reshape_87, allowzero=0) -> onnx::LessOrEqual_88
    Constant(value=2) -> onnx::Mul_89
      Mul(onnx::Gather_82, onnx::Mul_89) -> onnx::Add_90
    Gather(_operators.0.biases.5, onnx::Gather_82, axis=0) -> onnx::LessOrEqual_91
      LessOrEqual(onnx::LessOrEqual_88, onnx::LessOrEqual_91) -> onnx::Cast_92
        Cast(onnx::Cast_92, to=7) -> onnx::Add_93
        Add(onnx::Add_90, onnx::Add_93) -> onnx::Gather_94
          Gather(_operators.0.leaf_nodes, onnx::Gather_94, axis=0) -> onnx::Reshape_95
    Constant(value=[-1, 1, 1]) -> onnx::Reshape_96
      Reshape(onnx::Reshape_95, onnx::Reshape_96, allowzero=0) -> output
    Constant(value=[1]) -> onnx::ReduceSum_98
      ReduceSum(output, onnx::ReduceSum_98, keepdims=0) -> variable
    output: name='variable' type=dtype('float32') shape=['batch_size', 'ReduceSumvariable_dim_1']


.. code:: ipython3

    %onnxview onxh


.. raw:: html

    <div id="M383513f79aa048579be42d593ab48485-cont"><div id="M383513f79aa048579be42d593ab48485" style="width:;height:;"></div></div>
    <script>

    require(['http://www.xavierdupre.fr/js/vizjs/viz.js'], function() { var svgGraph = Viz("digraph{\n  nodesep=0.05;\n  orientation=portrait;\n  size=7;\n  ranksep=0.25;\n\n  X [shape=box color=red label=\"X\nfloat((0, 10))\" fontsize=10];\n\n  variable [shape=box color=green label=\"variable\nfloat((0, 0))\" fontsize=10];\n\n  _operators_0_root_nodes [shape=box label=\"_operators_0_root_nodes\nint64((1,))\n[8]\" fontsize=10];\n  _operators_0_root_biases [shape=box label=\"_operators_0_root_biases\nfloat32((1,))\n[0.00792999]\" fontsize=10];\n  _operators_0_tree_indices [shape=box label=\"_operators_0_tree_indices\nint64((1,))\n[0]\" fontsize=10];\n  _operators_0_leaf_nodes [shape=box label=\"_operators_0_leaf_nodes\nfloat32((128, 1))\n[[ 1.0096807 ]\n [ 0.6169403 ]\n [ 0.61055773]\n [ 0....\" fontsize=10];\n  _operators_0_nodes_0 [shape=box label=\"_operators_0_nodes_0\nint64((2,))\n[0 3]\" fontsize=10];\n  _operators_0_nodes_1 [shape=box label=\"_operators_0_nodes_1\nint64((4,))\n[1 2 5 9]\" fontsize=10];\n  _operators_0_nodes_2 [shape=box label=\"_operators_0_nodes_2\nint64((8,))\n[5 6 3 7 2 0 7 1]\" fontsize=10];\n  _operators_0_nodes_3 [shape=box label=\"_operators_0_nodes_3\nint64((16,))\n[3 9 5 3 6 4 1 3 6 6 1 6 5 4 6 2]\" fontsize=10];\n  _operators_0_nodes_4 [shape=box label=\"_operators_0_nodes_4\nint64((32,))\n[3 2 7 6 2 4 7 8 9 5 7 8 9 4 6 9 7 9 0 7 7 9 2 7 6...\" fontsize=10];\n  _operators_0_nodes_5 [shape=box label=\"_operators_0_nodes_5\nint64((64,))\n[2 8 7 6 6 3 4 9 7 3 2 6 3 3 0 1 1 0 4 7 9 5 7 9 5...\" fontsize=10];\n  _operators_0_biases_0 [shape=box label=\"_operators_0_biases_0\nfloat32((2,))\n[ 0.19169255 -0.12246682]\" fontsize=10];\n  _operators_0_biases_1 [shape=box label=\"_operators_0_biases_1\nfloat32((4,))\n[-0.40610337 -0.1467492  -0.01880287  0.15879431]\" fontsize=10];\n  _operators_0_biases_2 [shape=box label=\"_operators_0_biases_2\nfloat32((8,))\n[ 0.736786   -0.32427853  0.30860555  0.17994082  0.6917758  -0.00594712\n  0.35950053 -0.9819274 ]\" fontsize=10];\n  _operators_0_biases_3 [shape=box label=\"_operators_0_biases_3\nfloat32((16,))\n[-1.3495584  -1.082793   -0.6906011  -0.08978076 -...\" fontsize=10];\n  _operators_0_biases_4 [shape=box label=\"_operators_0_biases_4\nfloat32((32,))\n[ 1.4203796  -0.49269757 -0.12210988 -0.09692484  ...\" fontsize=10];\n  _operators_0_biases_5 [shape=box label=\"_operators_0_biases_5\nfloat32((64,))\n[ 1.0626682   1.4745288   0.01898679  0.5451088   ...\" fontsize=10];\n\n  onnx____LessOrEqual_17 [shape=box label=\"onnx____LessOrEqual_17\" fontsize=10];\n  Gather_0 [shape=box style=\"filled,rounded\" color=orange label=\"Gather\n(Gather_0)\naxis=1\" fontsize=10];\n  X -> Gather_0;\n  _operators_0_root_nodes -> Gather_0;\n  Gather_0 -> onnx____LessOrEqual_17;\n\n  onnx____Cast_18 [shape=box label=\"onnx____Cast_18\" fontsize=10];\n  LessOrEqual_1 [shape=box style=\"filled,rounded\" color=orange label=\"LessOrEqual\n(LessOrEqual_1)\" fontsize=10];\n  onnx____LessOrEqual_17 -> LessOrEqual_1;\n  _operators_0_root_biases -> LessOrEqual_1;\n  LessOrEqual_1 -> onnx____Cast_18;\n\n  onnx____Add_19 [shape=box label=\"onnx____Add_19\" fontsize=10];\n  Cast_2 [shape=box style=\"filled,rounded\" color=orange label=\"Cast\n(Cast_2)\nto=7\" fontsize=10];\n  onnx____Cast_18 -> Cast_2;\n  Cast_2 -> onnx____Add_19;\n\n  onnx____Reshape_20 [shape=box label=\"onnx____Reshape_20\" fontsize=10];\n  Add_3 [shape=box style=\"filled,rounded\" color=orange label=\"Add\n(Add_3)\" fontsize=10];\n  onnx____Add_19 -> Add_3;\n  _operators_0_tree_indices -> Add_3;\n  Add_3 -> onnx____Reshape_20;\n\n  onnx____Reshape_21 [shape=box label=\"onnx____Reshape_21\" fontsize=10];\n  Constant_4 [shape=box style=\"filled,rounded\" color=orange label=\"Constant\n(Constant_4)\nvalue=[-1]\" fontsize=10];\n  Constant_4 -> onnx____Reshape_21;\n\n  onnx____Gather_22 [shape=box label=\"onnx____Gather_22\" fontsize=10];\n  Reshape_5 [shape=box style=\"filled,rounded\" color=orange label=\"Reshape\n(Reshape_5)\nallowzero=0\" fontsize=10];\n  onnx____Reshape_20 -> Reshape_5;\n  onnx____Reshape_21 -> Reshape_5;\n  Reshape_5 -> onnx____Gather_22;\n\n  onnx____Reshape_23 [shape=box label=\"onnx____Reshape_23\" fontsize=10];\n  Gather_6 [shape=box style=\"filled,rounded\" color=orange label=\"Gather\n(Gather_6)\naxis=0\" fontsize=10];\n  _operators_0_nodes_0 -> Gather_6;\n  onnx____Gather_22 -> Gather_6;\n  Gather_6 -> onnx____Reshape_23;\n\n  onnx____Reshape_24 [shape=box label=\"onnx____Reshape_24\" fontsize=10];\n  Constant_7 [shape=box style=\"filled,rounded\" color=orange label=\"Constant\n(Constant_7)\nvalue=[-1  1]\" fontsize=10];\n  Constant_7 -> onnx____Reshape_24;\n\n  onnx____GatherElements_25 [shape=box label=\"onnx____GatherElements_25\" fontsize=10];\n  Reshape_8 [shape=box style=\"filled,rounded\" color=orange label=\"Reshape\n(Reshape_8)\nallowzero=0\" fontsize=10];\n  onnx____Reshape_23 -> Reshape_8;\n  onnx____Reshape_24 -> Reshape_8;\n  Reshape_8 -> onnx____GatherElements_25;\n\n  onnx____Reshape_26 [shape=box label=\"onnx____Reshape_26\" fontsize=10];\n  GatherElements_9 [shape=box style=\"filled,rounded\" color=orange label=\"GatherElements\n(GatherElements_9)\naxis=1\" fontsize=10];\n  X -> GatherElements_9;\n  onnx____GatherElements_25 -> GatherElements_9;\n  GatherElements_9 -> onnx____Reshape_26;\n\n  onnx____Reshape_27 [shape=box label=\"onnx____Reshape_27\" fontsize=10];\n  Constant_10 [shape=box style=\"filled,rounded\" color=orange label=\"Constant\n(Constant_10)\nvalue=[-1]\" fontsize=10];\n  Constant_10 -> onnx____Reshape_27;\n\n  onnx____LessOrEqual_28 [shape=box label=\"onnx____LessOrEqual_28\" fontsize=10];\n  Reshape_11 [shape=box style=\"filled,rounded\" color=orange label=\"Reshape\n(Reshape_11)\nallowzero=0\" fontsize=10];\n  onnx____Reshape_26 -> Reshape_11;\n  onnx____Reshape_27 -> Reshape_11;\n  Reshape_11 -> onnx____LessOrEqual_28;\n\n  onnx____Mul_29 [shape=box label=\"onnx____Mul_29\" fontsize=10];\n  Constant_12 [shape=box style=\"filled,rounded\" color=orange label=\"Constant\n(Constant_12)\nvalue=2\" fontsize=10];\n  Constant_12 -> onnx____Mul_29;\n\n  onnx____Add_30 [shape=box label=\"onnx____Add_30\" fontsize=10];\n  Mul_13 [shape=box style=\"filled,rounded\" color=orange label=\"Mul\n(Mul_13)\" fontsize=10];\n  onnx____Gather_22 -> Mul_13;\n  onnx____Mul_29 -> Mul_13;\n  Mul_13 -> onnx____Add_30;\n\n  onnx____LessOrEqual_31 [shape=box label=\"onnx____LessOrEqual_31\" fontsize=10];\n  Gather_14 [shape=box style=\"filled,rounded\" color=orange label=\"Gather\n(Gather_14)\naxis=0\" fontsize=10];\n  _operators_0_biases_0 -> Gather_14;\n  onnx____Gather_22 -> Gather_14;\n  Gather_14 -> onnx____LessOrEqual_31;\n\n  onnx____Cast_32 [shape=box label=\"onnx____Cast_32\" fontsize=10];\n  LessOrEqual_15 [shape=box style=\"filled,rounded\" color=orange label=\"LessOrEqual\n(LessOrEqual_15)\" fontsize=10];\n  onnx____LessOrEqual_28 -> LessOrEqual_15;\n  onnx____LessOrEqual_31 -> LessOrEqual_15;\n  LessOrEqual_15 -> onnx____Cast_32;\n\n  onnx____Add_33 [shape=box label=\"onnx____Add_33\" fontsize=10];\n  Cast_16 [shape=box style=\"filled,rounded\" color=orange label=\"Cast\n(Cast_16)\nto=7\" fontsize=10];\n  onnx____Cast_32 -> Cast_16;\n  Cast_16 -> onnx____Add_33;\n\n  onnx____Gather_34 [shape=box label=\"onnx____Gather_34\" fontsize=10];\n  Add_17 [shape=box style=\"filled,rounded\" color=orange label=\"Add\n(Add_17)\" fontsize=10];\n  onnx____Add_30 -> Add_17;\n  onnx____Add_33 -> Add_17;\n  Add_17 -> onnx____Gather_34;\n\n  onnx____Reshape_35 [shape=box label=\"onnx____Reshape_35\" fontsize=10];\n  Gather_18 [shape=box style=\"filled,rounded\" color=orange label=\"Gather\n(Gather_18)\naxis=0\" fontsize=10];\n  _operators_0_nodes_1 -> Gather_18;\n  onnx____Gather_34 -> Gather_18;\n  Gather_18 -> onnx____Reshape_35;\n\n  onnx____Reshape_36 [shape=box label=\"onnx____Reshape_36\" fontsize=10];\n  Constant_19 [shape=box style=\"filled,rounded\" color=orange label=\"Constant\n(Constant_19)\nvalue=[-1  1]\" fontsize=10];\n  Constant_19 -> onnx____Reshape_36;\n\n  onnx____GatherElements_37 [shape=box label=\"onnx____GatherElements_37\" fontsize=10];\n  Reshape_20 [shape=box style=\"filled,rounded\" color=orange label=\"Reshape\n(Reshape_20)\nallowzero=0\" fontsize=10];\n  onnx____Reshape_35 -> Reshape_20;\n  onnx____Reshape_36 -> Reshape_20;\n  Reshape_20 -> onnx____GatherElements_37;\n\n  onnx____Reshape_38 [shape=box label=\"onnx____Reshape_38\" fontsize=10];\n  GatherElements_21 [shape=box style=\"filled,rounded\" color=orange label=\"GatherElements\n(GatherElements_21)\naxis=1\" fontsize=10];\n  X -> GatherElements_21;\n  onnx____GatherElements_37 -> GatherElements_21;\n  GatherElements_21 -> onnx____Reshape_38;\n\n  onnx____Reshape_39 [shape=box label=\"onnx____Reshape_39\" fontsize=10];\n  Constant_22 [shape=box style=\"filled,rounded\" color=orange label=\"Constant\n(Constant_22)\nvalue=[-1]\" fontsize=10];\n  Constant_22 -> onnx____Reshape_39;\n\n  onnx____LessOrEqual_40 [shape=box label=\"onnx____LessOrEqual_40\" fontsize=10];\n  Reshape_23 [shape=box style=\"filled,rounded\" color=orange label=\"Reshape\n(Reshape_23)\nallowzero=0\" fontsize=10];\n  onnx____Reshape_38 -> Reshape_23;\n  onnx____Reshape_39 -> Reshape_23;\n  Reshape_23 -> onnx____LessOrEqual_40;\n\n  onnx____Mul_41 [shape=box label=\"onnx____Mul_41\" fontsize=10];\n  Constant_24 [shape=box style=\"filled,rounded\" color=orange label=\"Constant\n(Constant_24)\nvalue=2\" fontsize=10];\n  Constant_24 -> onnx____Mul_41;\n\n  onnx____Add_42 [shape=box label=\"onnx____Add_42\" fontsize=10];\n  Mul_25 [shape=box style=\"filled,rounded\" color=orange label=\"Mul\n(Mul_25)\" fontsize=10];\n  onnx____Gather_34 -> Mul_25;\n  onnx____Mul_41 -> Mul_25;\n  Mul_25 -> onnx____Add_42;\n\n  onnx____LessOrEqual_43 [shape=box label=\"onnx____LessOrEqual_43\" fontsize=10];\n  Gather_26 [shape=box style=\"filled,rounded\" color=orange label=\"Gather\n(Gather_26)\naxis=0\" fontsize=10];\n  _operators_0_biases_1 -> Gather_26;\n  onnx____Gather_34 -> Gather_26;\n  Gather_26 -> onnx____LessOrEqual_43;\n\n  onnx____Cast_44 [shape=box label=\"onnx____Cast_44\" fontsize=10];\n  LessOrEqual_27 [shape=box style=\"filled,rounded\" color=orange label=\"LessOrEqual\n(LessOrEqual_27)\" fontsize=10];\n  onnx____LessOrEqual_40 -> LessOrEqual_27;\n  onnx____LessOrEqual_43 -> LessOrEqual_27;\n  LessOrEqual_27 -> onnx____Cast_44;\n\n  onnx____Add_45 [shape=box label=\"onnx____Add_45\" fontsize=10];\n  Cast_28 [shape=box style=\"filled,rounded\" color=orange label=\"Cast\n(Cast_28)\nto=7\" fontsize=10];\n  onnx____Cast_44 -> Cast_28;\n  Cast_28 -> onnx____Add_45;\n\n  onnx____Gather_46 [shape=box label=\"onnx____Gather_46\" fontsize=10];\n  Add_29 [shape=box style=\"filled,rounded\" color=orange label=\"Add\n(Add_29)\" fontsize=10];\n  onnx____Add_42 -> Add_29;\n  onnx____Add_45 -> Add_29;\n  Add_29 -> onnx____Gather_46;\n\n  onnx____Reshape_47 [shape=box label=\"onnx____Reshape_47\" fontsize=10];\n  Gather_30 [shape=box style=\"filled,rounded\" color=orange label=\"Gather\n(Gather_30)\naxis=0\" fontsize=10];\n  _operators_0_nodes_2 -> Gather_30;\n  onnx____Gather_46 -> Gather_30;\n  Gather_30 -> onnx____Reshape_47;\n\n  onnx____Reshape_48 [shape=box label=\"onnx____Reshape_48\" fontsize=10];\n  Constant_31 [shape=box style=\"filled,rounded\" color=orange label=\"Constant\n(Constant_31)\nvalue=[-1  1]\" fontsize=10];\n  Constant_31 -> onnx____Reshape_48;\n\n  onnx____GatherElements_49 [shape=box label=\"onnx____GatherElements_49\" fontsize=10];\n  Reshape_32 [shape=box style=\"filled,rounded\" color=orange label=\"Reshape\n(Reshape_32)\nallowzero=0\" fontsize=10];\n  onnx____Reshape_47 -> Reshape_32;\n  onnx____Reshape_48 -> Reshape_32;\n  Reshape_32 -> onnx____GatherElements_49;\n\n  onnx____Reshape_50 [shape=box label=\"onnx____Reshape_50\" fontsize=10];\n  GatherElements_33 [shape=box style=\"filled,rounded\" color=orange label=\"GatherElements\n(GatherElements_33)\naxis=1\" fontsize=10];\n  X -> GatherElements_33;\n  onnx____GatherElements_49 -> GatherElements_33;\n  GatherElements_33 -> onnx____Reshape_50;\n\n  onnx____Reshape_51 [shape=box label=\"onnx____Reshape_51\" fontsize=10];\n  Constant_34 [shape=box style=\"filled,rounded\" color=orange label=\"Constant\n(Constant_34)\nvalue=[-1]\" fontsize=10];\n  Constant_34 -> onnx____Reshape_51;\n\n  onnx____LessOrEqual_52 [shape=box label=\"onnx____LessOrEqual_52\" fontsize=10];\n  Reshape_35 [shape=box style=\"filled,rounded\" color=orange label=\"Reshape\n(Reshape_35)\nallowzero=0\" fontsize=10];\n  onnx____Reshape_50 -> Reshape_35;\n  onnx____Reshape_51 -> Reshape_35;\n  Reshape_35 -> onnx____LessOrEqual_52;\n\n  onnx____Mul_53 [shape=box label=\"onnx____Mul_53\" fontsize=10];\n  Constant_36 [shape=box style=\"filled,rounded\" color=orange label=\"Constant\n(Constant_36)\nvalue=2\" fontsize=10];\n  Constant_36 -> onnx____Mul_53;\n\n  onnx____Add_54 [shape=box label=\"onnx____Add_54\" fontsize=10];\n  Mul_37 [shape=box style=\"filled,rounded\" color=orange label=\"Mul\n(Mul_37)\" fontsize=10];\n  onnx____Gather_46 -> Mul_37;\n  onnx____Mul_53 -> Mul_37;\n  Mul_37 -> onnx____Add_54;\n\n  onnx____LessOrEqual_55 [shape=box label=\"onnx____LessOrEqual_55\" fontsize=10];\n  Gather_38 [shape=box style=\"filled,rounded\" color=orange label=\"Gather\n(Gather_38)\naxis=0\" fontsize=10];\n  _operators_0_biases_2 -> Gather_38;\n  onnx____Gather_46 -> Gather_38;\n  Gather_38 -> onnx____LessOrEqual_55;\n\n  onnx____Cast_56 [shape=box label=\"onnx____Cast_56\" fontsize=10];\n  LessOrEqual_39 [shape=box style=\"filled,rounded\" color=orange label=\"LessOrEqual\n(LessOrEqual_39)\" fontsize=10];\n  onnx____LessOrEqual_52 -> LessOrEqual_39;\n  onnx____LessOrEqual_55 -> LessOrEqual_39;\n  LessOrEqual_39 -> onnx____Cast_56;\n\n  onnx____Add_57 [shape=box label=\"onnx____Add_57\" fontsize=10];\n  Cast_40 [shape=box style=\"filled,rounded\" color=orange label=\"Cast\n(Cast_40)\nto=7\" fontsize=10];\n  onnx____Cast_56 -> Cast_40;\n  Cast_40 -> onnx____Add_57;\n\n  onnx____Gather_58 [shape=box label=\"onnx____Gather_58\" fontsize=10];\n  Add_41 [shape=box style=\"filled,rounded\" color=orange label=\"Add\n(Add_41)\" fontsize=10];\n  onnx____Add_54 -> Add_41;\n  onnx____Add_57 -> Add_41;\n  Add_41 -> onnx____Gather_58;\n\n  onnx____Reshape_59 [shape=box label=\"onnx____Reshape_59\" fontsize=10];\n  Gather_42 [shape=box style=\"filled,rounded\" color=orange label=\"Gather\n(Gather_42)\naxis=0\" fontsize=10];\n  _operators_0_nodes_3 -> Gather_42;\n  onnx____Gather_58 -> Gather_42;\n  Gather_42 -> onnx____Reshape_59;\n\n  onnx____Reshape_60 [shape=box label=\"onnx____Reshape_60\" fontsize=10];\n  Constant_43 [shape=box style=\"filled,rounded\" color=orange label=\"Constant\n(Constant_43)\nvalue=[-1  1]\" fontsize=10];\n  Constant_43 -> onnx____Reshape_60;\n\n  onnx____GatherElements_61 [shape=box label=\"onnx____GatherElements_61\" fontsize=10];\n  Reshape_44 [shape=box style=\"filled,rounded\" color=orange label=\"Reshape\n(Reshape_44)\nallowzero=0\" fontsize=10];\n  onnx____Reshape_59 -> Reshape_44;\n  onnx____Reshape_60 -> Reshape_44;\n  Reshape_44 -> onnx____GatherElements_61;\n\n  onnx____Reshape_62 [shape=box label=\"onnx____Reshape_62\" fontsize=10];\n  GatherElements_45 [shape=box style=\"filled,rounded\" color=orange label=\"GatherElements\n(GatherElements_45)\naxis=1\" fontsize=10];\n  X -> GatherElements_45;\n  onnx____GatherElements_61 -> GatherElements_45;\n  GatherElements_45 -> onnx____Reshape_62;\n\n  onnx____Reshape_63 [shape=box label=\"onnx____Reshape_63\" fontsize=10];\n  Constant_46 [shape=box style=\"filled,rounded\" color=orange label=\"Constant\n(Constant_46)\nvalue=[-1]\" fontsize=10];\n  Constant_46 -> onnx____Reshape_63;\n\n  onnx____LessOrEqual_64 [shape=box label=\"onnx____LessOrEqual_64\" fontsize=10];\n  Reshape_47 [shape=box style=\"filled,rounded\" color=orange label=\"Reshape\n(Reshape_47)\nallowzero=0\" fontsize=10];\n  onnx____Reshape_62 -> Reshape_47;\n  onnx____Reshape_63 -> Reshape_47;\n  Reshape_47 -> onnx____LessOrEqual_64;\n\n  onnx____Mul_65 [shape=box label=\"onnx____Mul_65\" fontsize=10];\n  Constant_48 [shape=box style=\"filled,rounded\" color=orange label=\"Constant\n(Constant_48)\nvalue=2\" fontsize=10];\n  Constant_48 -> onnx____Mul_65;\n\n  onnx____Add_66 [shape=box label=\"onnx____Add_66\" fontsize=10];\n  Mul_49 [shape=box style=\"filled,rounded\" color=orange label=\"Mul\n(Mul_49)\" fontsize=10];\n  onnx____Gather_58 -> Mul_49;\n  onnx____Mul_65 -> Mul_49;\n  Mul_49 -> onnx____Add_66;\n\n  onnx____LessOrEqual_67 [shape=box label=\"onnx____LessOrEqual_67\" fontsize=10];\n  Gather_50 [shape=box style=\"filled,rounded\" color=orange label=\"Gather\n(Gather_50)\naxis=0\" fontsize=10];\n  _operators_0_biases_3 -> Gather_50;\n  onnx____Gather_58 -> Gather_50;\n  Gather_50 -> onnx____LessOrEqual_67;\n\n  onnx____Cast_68 [shape=box label=\"onnx____Cast_68\" fontsize=10];\n  LessOrEqual_51 [shape=box style=\"filled,rounded\" color=orange label=\"LessOrEqual\n(LessOrEqual_51)\" fontsize=10];\n  onnx____LessOrEqual_64 -> LessOrEqual_51;\n  onnx____LessOrEqual_67 -> LessOrEqual_51;\n  LessOrEqual_51 -> onnx____Cast_68;\n\n  onnx____Add_69 [shape=box label=\"onnx____Add_69\" fontsize=10];\n  Cast_52 [shape=box style=\"filled,rounded\" color=orange label=\"Cast\n(Cast_52)\nto=7\" fontsize=10];\n  onnx____Cast_68 -> Cast_52;\n  Cast_52 -> onnx____Add_69;\n\n  onnx____Gather_70 [shape=box label=\"onnx____Gather_70\" fontsize=10];\n  Add_53 [shape=box style=\"filled,rounded\" color=orange label=\"Add\n(Add_53)\" fontsize=10];\n  onnx____Add_66 -> Add_53;\n  onnx____Add_69 -> Add_53;\n  Add_53 -> onnx____Gather_70;\n\n  onnx____Reshape_71 [shape=box label=\"onnx____Reshape_71\" fontsize=10];\n  Gather_54 [shape=box style=\"filled,rounded\" color=orange label=\"Gather\n(Gather_54)\naxis=0\" fontsize=10];\n  _operators_0_nodes_4 -> Gather_54;\n  onnx____Gather_70 -> Gather_54;\n  Gather_54 -> onnx____Reshape_71;\n\n  onnx____Reshape_72 [shape=box label=\"onnx____Reshape_72\" fontsize=10];\n  Constant_55 [shape=box style=\"filled,rounded\" color=orange label=\"Constant\n(Constant_55)\nvalue=[-1  1]\" fontsize=10];\n  Constant_55 -> onnx____Reshape_72;\n\n  onnx____GatherElements_73 [shape=box label=\"onnx____GatherElements_73\" fontsize=10];\n  Reshape_56 [shape=box style=\"filled,rounded\" color=orange label=\"Reshape\n(Reshape_56)\nallowzero=0\" fontsize=10];\n  onnx____Reshape_71 -> Reshape_56;\n  onnx____Reshape_72 -> Reshape_56;\n  Reshape_56 -> onnx____GatherElements_73;\n\n  onnx____Reshape_74 [shape=box label=\"onnx____Reshape_74\" fontsize=10];\n  GatherElements_57 [shape=box style=\"filled,rounded\" color=orange label=\"GatherElements\n(GatherElements_57)\naxis=1\" fontsize=10];\n  X -> GatherElements_57;\n  onnx____GatherElements_73 -> GatherElements_57;\n  GatherElements_57 -> onnx____Reshape_74;\n\n  onnx____Reshape_75 [shape=box label=\"onnx____Reshape_75\" fontsize=10];\n  Constant_58 [shape=box style=\"filled,rounded\" color=orange label=\"Constant\n(Constant_58)\nvalue=[-1]\" fontsize=10];\n  Constant_58 -> onnx____Reshape_75;\n\n  onnx____LessOrEqual_76 [shape=box label=\"onnx____LessOrEqual_76\" fontsize=10];\n  Reshape_59 [shape=box style=\"filled,rounded\" color=orange label=\"Reshape\n(Reshape_59)\nallowzero=0\" fontsize=10];\n  onnx____Reshape_74 -> Reshape_59;\n  onnx____Reshape_75 -> Reshape_59;\n  Reshape_59 -> onnx____LessOrEqual_76;\n\n  onnx____Mul_77 [shape=box label=\"onnx____Mul_77\" fontsize=10];\n  Constant_60 [shape=box style=\"filled,rounded\" color=orange label=\"Constant\n(Constant_60)\nvalue=2\" fontsize=10];\n  Constant_60 -> onnx____Mul_77;\n\n  onnx____Add_78 [shape=box label=\"onnx____Add_78\" fontsize=10];\n  Mul_61 [shape=box style=\"filled,rounded\" color=orange label=\"Mul\n(Mul_61)\" fontsize=10];\n  onnx____Gather_70 -> Mul_61;\n  onnx____Mul_77 -> Mul_61;\n  Mul_61 -> onnx____Add_78;\n\n  onnx____LessOrEqual_79 [shape=box label=\"onnx____LessOrEqual_79\" fontsize=10];\n  Gather_62 [shape=box style=\"filled,rounded\" color=orange label=\"Gather\n(Gather_62)\naxis=0\" fontsize=10];\n  _operators_0_biases_4 -> Gather_62;\n  onnx____Gather_70 -> Gather_62;\n  Gather_62 -> onnx____LessOrEqual_79;\n\n  onnx____Cast_80 [shape=box label=\"onnx____Cast_80\" fontsize=10];\n  LessOrEqual_63 [shape=box style=\"filled,rounded\" color=orange label=\"LessOrEqual\n(LessOrEqual_63)\" fontsize=10];\n  onnx____LessOrEqual_76 -> LessOrEqual_63;\n  onnx____LessOrEqual_79 -> LessOrEqual_63;\n  LessOrEqual_63 -> onnx____Cast_80;\n\n  onnx____Add_81 [shape=box label=\"onnx____Add_81\" fontsize=10];\n  Cast_64 [shape=box style=\"filled,rounded\" color=orange label=\"Cast\n(Cast_64)\nto=7\" fontsize=10];\n  onnx____Cast_80 -> Cast_64;\n  Cast_64 -> onnx____Add_81;\n\n  onnx____Gather_82 [shape=box label=\"onnx____Gather_82\" fontsize=10];\n  Add_65 [shape=box style=\"filled,rounded\" color=orange label=\"Add\n(Add_65)\" fontsize=10];\n  onnx____Add_78 -> Add_65;\n  onnx____Add_81 -> Add_65;\n  Add_65 -> onnx____Gather_82;\n\n  onnx____Reshape_83 [shape=box label=\"onnx____Reshape_83\" fontsize=10];\n  Gather_66 [shape=box style=\"filled,rounded\" color=orange label=\"Gather\n(Gather_66)\naxis=0\" fontsize=10];\n  _operators_0_nodes_5 -> Gather_66;\n  onnx____Gather_82 -> Gather_66;\n  Gather_66 -> onnx____Reshape_83;\n\n  onnx____Reshape_84 [shape=box label=\"onnx____Reshape_84\" fontsize=10];\n  Constant_67 [shape=box style=\"filled,rounded\" color=orange label=\"Constant\n(Constant_67)\nvalue=[-1  1]\" fontsize=10];\n  Constant_67 -> onnx____Reshape_84;\n\n  onnx____GatherElements_85 [shape=box label=\"onnx____GatherElements_85\" fontsize=10];\n  Reshape_68 [shape=box style=\"filled,rounded\" color=orange label=\"Reshape\n(Reshape_68)\nallowzero=0\" fontsize=10];\n  onnx____Reshape_83 -> Reshape_68;\n  onnx____Reshape_84 -> Reshape_68;\n  Reshape_68 -> onnx____GatherElements_85;\n\n  onnx____Reshape_86 [shape=box label=\"onnx____Reshape_86\" fontsize=10];\n  GatherElements_69 [shape=box style=\"filled,rounded\" color=orange label=\"GatherElements\n(GatherElements_69)\naxis=1\" fontsize=10];\n  X -> GatherElements_69;\n  onnx____GatherElements_85 -> GatherElements_69;\n  GatherElements_69 -> onnx____Reshape_86;\n\n  onnx____Reshape_87 [shape=box label=\"onnx____Reshape_87\" fontsize=10];\n  Constant_70 [shape=box style=\"filled,rounded\" color=orange label=\"Constant\n(Constant_70)\nvalue=[-1]\" fontsize=10];\n  Constant_70 -> onnx____Reshape_87;\n\n  onnx____LessOrEqual_88 [shape=box label=\"onnx____LessOrEqual_88\" fontsize=10];\n  Reshape_71 [shape=box style=\"filled,rounded\" color=orange label=\"Reshape\n(Reshape_71)\nallowzero=0\" fontsize=10];\n  onnx____Reshape_86 -> Reshape_71;\n  onnx____Reshape_87 -> Reshape_71;\n  Reshape_71 -> onnx____LessOrEqual_88;\n\n  onnx____Mul_89 [shape=box label=\"onnx____Mul_89\" fontsize=10];\n  Constant_72 [shape=box style=\"filled,rounded\" color=orange label=\"Constant\n(Constant_72)\nvalue=2\" fontsize=10];\n  Constant_72 -> onnx____Mul_89;\n\n  onnx____Add_90 [shape=box label=\"onnx____Add_90\" fontsize=10];\n  Mul_73 [shape=box style=\"filled,rounded\" color=orange label=\"Mul\n(Mul_73)\" fontsize=10];\n  onnx____Gather_82 -> Mul_73;\n  onnx____Mul_89 -> Mul_73;\n  Mul_73 -> onnx____Add_90;\n\n  onnx____LessOrEqual_91 [shape=box label=\"onnx____LessOrEqual_91\" fontsize=10];\n  Gather_74 [shape=box style=\"filled,rounded\" color=orange label=\"Gather\n(Gather_74)\naxis=0\" fontsize=10];\n  _operators_0_biases_5 -> Gather_74;\n  onnx____Gather_82 -> Gather_74;\n  Gather_74 -> onnx____LessOrEqual_91;\n\n  onnx____Cast_92 [shape=box label=\"onnx____Cast_92\" fontsize=10];\n  LessOrEqual_75 [shape=box style=\"filled,rounded\" color=orange label=\"LessOrEqual\n(LessOrEqual_75)\" fontsize=10];\n  onnx____LessOrEqual_88 -> LessOrEqual_75;\n  onnx____LessOrEqual_91 -> LessOrEqual_75;\n  LessOrEqual_75 -> onnx____Cast_92;\n\n  onnx____Add_93 [shape=box label=\"onnx____Add_93\" fontsize=10];\n  Cast_76 [shape=box style=\"filled,rounded\" color=orange label=\"Cast\n(Cast_76)\nto=7\" fontsize=10];\n  onnx____Cast_92 -> Cast_76;\n  Cast_76 -> onnx____Add_93;\n\n  onnx____Gather_94 [shape=box label=\"onnx____Gather_94\" fontsize=10];\n  Add_77 [shape=box style=\"filled,rounded\" color=orange label=\"Add\n(Add_77)\" fontsize=10];\n  onnx____Add_90 -> Add_77;\n  onnx____Add_93 -> Add_77;\n  Add_77 -> onnx____Gather_94;\n\n  onnx____Reshape_95 [shape=box label=\"onnx____Reshape_95\" fontsize=10];\n  Gather_78 [shape=box style=\"filled,rounded\" color=orange label=\"Gather\n(Gather_78)\naxis=0\" fontsize=10];\n  _operators_0_leaf_nodes -> Gather_78;\n  onnx____Gather_94 -> Gather_78;\n  Gather_78 -> onnx____Reshape_95;\n\n  onnx____Reshape_96 [shape=box label=\"onnx____Reshape_96\" fontsize=10];\n  Constant_79 [shape=box style=\"filled,rounded\" color=orange label=\"Constant\n(Constant_79)\nvalue=[-1  1  1]\" fontsize=10];\n  Constant_79 -> onnx____Reshape_96;\n\n  output [shape=box label=\"output\" fontsize=10];\n  Reshape_80 [shape=box style=\"filled,rounded\" color=orange label=\"Reshape\n(Reshape_80)\nallowzero=0\" fontsize=10];\n  onnx____Reshape_95 -> Reshape_80;\n  onnx____Reshape_96 -> Reshape_80;\n  Reshape_80 -> output;\n\n  onnx____ReduceSum_98 [shape=box label=\"onnx____ReduceSum_98\" fontsize=10];\n  Constant_81 [shape=box style=\"filled,rounded\" color=orange label=\"Constant\n(Constant_81)\nvalue=[1]\" fontsize=10];\n  Constant_81 -> onnx____ReduceSum_98;\n\n  ReduceSum_82 [shape=box style=\"filled,rounded\" color=orange label=\"ReduceSum\n(ReduceSum_82)\nkeepdims=0\" fontsize=10];\n  output -> ReduceSum_82;\n  onnx____ReduceSum_98 -> ReduceSum_82;\n  ReduceSum_82 -> variable;\n}");
    document.getElementById('M383513f79aa048579be42d593ab48485').innerHTML = svgGraph; });

    </script>


La librairie réimplémente la décision d’un arbre décision à partir d’un
produit matriciel pour chaque niveau de l’arbre. Tous les seuils sont
évalués. Les matrices n’ont pas besoin d’être sparses car les features
nécessaires sont récupérées. Le seuil de décision est implémenté avec un
test et non une sigmoïde. Ce modèle est donc identique en terme de
prédiction au modèle initial.

.. code:: ipython3

    oinfh = OnnxInference(onxh, runtime='onnxruntime1')
    expected = tree.predict(x_exp)
    
    got = oinfh.run({'X': x_exp.astype(numpy.float32)})['variable']
    numpy.abs(got - expected).max()


.. parsed-literal::
    1.7421041873949668


La conversion reste imparfaite également.

.. code:: ipython3

    %timeit oinfh.run({'X': x_exp32})['variable']


.. parsed-literal::
    3.13 ms ± 445 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)


Et le temps de calcul est aussi plus long.

Apprentissage
-------------

L’idée derrière tout cela est aussi de pouvoir réestimer les
coefficients du réseau de neurones une fois converti.

.. code:: ipython3

    x_train = X_train[:100]
    expected = tree.predict(x_train)
    reg = NeuralTreeNetRegressor(trees[1], verbose=1, max_iter=10, lr=1e-4)

.. code:: ipython3

    got = reg.predict(x_train)
    numpy.abs(got - expected).max(), numpy.abs(got - expected).mean()


.. parsed-literal::
    (1.0246115055833722, 0.24094382754240642)


La différence est grande.

.. code:: ipython3

    reg.fit(x_train, expected)


.. parsed-literal::
    0/10: loss: 3.201 lr=0.0001 max(coef): 6.5 l1=0/1.5e+03 l2=0/2.5e+03
    1/10: loss: 2.593 lr=9.95e-06 max(coef): 6.5 l1=2e+03/1.5e+03 l2=1.3e+03/2.5e+03
    2/10: loss: 2.506 lr=7.05e-06 max(coef): 6.5 l1=1.4e+02/1.5e+03 l2=6.2/2.5e+03
    3/10: loss: 2.461 lr=5.76e-06 max(coef): 6.5 l1=1.2e+03/1.5e+03 l2=6.8e+02/2.5e+03
    4/10: loss: 2.429 lr=4.99e-06 max(coef): 6.5 l1=6.5e+02/1.5e+03 l2=2.1e+02/2.5e+03
    5/10: loss: 2.405 lr=4.47e-06 max(coef): 6.5 l1=1.9e+02/1.5e+03 l2=13/2.5e+03
    6/10: loss: 2.392 lr=4.08e-06 max(coef): 6.5 l1=1.6e+02/1.5e+03 l2=6.8/2.5e+03
    7/10: loss: 2.375 lr=3.78e-06 max(coef): 6.5 l1=1.8e+02/1.5e+03 l2=9.5/2.5e+03
    8/10: loss: 2.358 lr=3.53e-06 max(coef): 6.5 l1=1.1e+02/1.5e+03 l2=7/2.5e+03
    9/10: loss: 2.345 lr=3.33e-06 max(coef): 6.5 l1=3.7e+02/1.5e+03 l2=56/2.5e+03
    10/10: loss: 2.333 lr=3.16e-06 max(coef): 6.5 l1=6.1e+02/1.5e+03 l2=1.3e+02/2.5e+03


.. raw:: html

    <style>#sk-container-id-2 {color: black;background-color: white;}#sk-container-id-2 pre{padding: 0;}#sk-container-id-2 div.sk-toggleable {background-color: white;}#sk-container-id-2 label.sk-toggleable__label {cursor: pointer;display: block;width: 100%;margin-bottom: 0;padding: 0.3em;box-sizing: border-box;text-align: center;}#sk-container-id-2 label.sk-toggleable__label-arrow:before {content: "▸";float: left;margin-right: 0.25em;color: #696969;}#sk-container-id-2 label.sk-toggleable__label-arrow:hover:before {color: black;}#sk-container-id-2 div.sk-estimator:hover label.sk-toggleable__label-arrow:before {color: black;}#sk-container-id-2 div.sk-toggleable__content {max-height: 0;max-width: 0;overflow: hidden;text-align: left;background-color: #f0f8ff;}#sk-container-id-2 div.sk-toggleable__content pre {margin: 0.2em;color: black;border-radius: 0.25em;background-color: #f0f8ff;}#sk-container-id-2 input.sk-toggleable__control:checked~div.sk-toggleable__content {max-height: 200px;max-width: 100%;overflow: auto;}#sk-container-id-2 input.sk-toggleable__control:checked~label.sk-toggleable__label-arrow:before {content: "▾";}#sk-container-id-2 div.sk-estimator input.sk-toggleable__control:checked~label.sk-toggleable__label {background-color: #d4ebff;}#sk-container-id-2 div.sk-label input.sk-toggleable__control:checked~label.sk-toggleable__label {background-color: #d4ebff;}#sk-container-id-2 input.sk-hidden--visually {border: 0;clip: rect(1px 1px 1px 1px);clip: rect(1px, 1px, 1px, 1px);height: 1px;margin: -1px;overflow: hidden;padding: 0;position: absolute;width: 1px;}#sk-container-id-2 div.sk-estimator {font-family: monospace;background-color: #f0f8ff;border: 1px dotted black;border-radius: 0.25em;box-sizing: border-box;margin-bottom: 0.5em;}#sk-container-id-2 div.sk-estimator:hover {background-color: #d4ebff;}#sk-container-id-2 div.sk-parallel-item::after {content: "";width: 100%;border-bottom: 1px solid gray;flex-grow: 1;}#sk-container-id-2 div.sk-label:hover label.sk-toggleable__label {background-color: #d4ebff;}#sk-container-id-2 div.sk-serial::before {content: "";position: absolute;border-left: 1px solid gray;box-sizing: border-box;top: 0;bottom: 0;left: 50%;z-index: 0;}#sk-container-id-2 div.sk-serial {display: flex;flex-direction: column;align-items: center;background-color: white;padding-right: 0.2em;padding-left: 0.2em;position: relative;}#sk-container-id-2 div.sk-item {position: relative;z-index: 1;}#sk-container-id-2 div.sk-parallel {display: flex;align-items: stretch;justify-content: center;background-color: white;position: relative;}#sk-container-id-2 div.sk-item::before, #sk-container-id-2 div.sk-parallel-item::before {content: "";position: absolute;border-left: 1px solid gray;box-sizing: border-box;top: 0;bottom: 0;left: 50%;z-index: -1;}#sk-container-id-2 div.sk-parallel-item {display: flex;flex-direction: column;z-index: 1;position: relative;background-color: white;}#sk-container-id-2 div.sk-parallel-item:first-child::after {align-self: flex-end;width: 50%;}#sk-container-id-2 div.sk-parallel-item:last-child::after {align-self: flex-start;width: 50%;}#sk-container-id-2 div.sk-parallel-item:only-child::after {width: 0;}#sk-container-id-2 div.sk-dashed-wrapped {border: 1px dashed gray;margin: 0 0.4em 0.5em 0.4em;box-sizing: border-box;padding-bottom: 0.4em;background-color: white;}#sk-container-id-2 div.sk-label label {font-family: monospace;font-weight: bold;display: inline-block;line-height: 1.2em;}#sk-container-id-2 div.sk-label-container {text-align: center;}#sk-container-id-2 div.sk-container {/* jupyter's `normalize.less` sets `[hidden] { display: none; }` but bootstrap.min.css set `[hidden] { display: none !important; }` so we also need the `!important` here to be able to override the default hidden behavior on the sphinx rendered scikit-learn.org. See: https://github.com/scikit-learn/scikit-learn/issues/21755 */display: inline-block !important;position: relative;}#sk-container-id-2 div.sk-text-repr-fallback {display: none;}</style><div id="sk-container-id-2" class="sk-top-container"><div class="sk-text-repr-fallback"><pre>NeuralTreeNetRegressor(estimator=None, lr=0.0001, max_iter=10, verbose=1)</pre><b>In a Jupyter environment, please rerun this cell to show the HTML representation or trust the notebook. <br />On GitHub, the HTML representation is unable to render, please try loading this page with nbviewer.org.</b></div><div class="sk-container" hidden><div class="sk-item"><div class="sk-estimator sk-toggleable"><input class="sk-toggleable__control sk-hidden--visually" id="sk-estimator-id-2" type="checkbox" checked><label for="sk-estimator-id-2" class="sk-toggleable__label sk-toggleable__label-arrow">NeuralTreeNetRegressor</label><div class="sk-toggleable__content"><pre>NeuralTreeNetRegressor(estimator=None, lr=0.0001, max_iter=10, verbose=1)</pre></div></div></div></div></div>


.. code:: ipython3

    got = reg.predict(x_train)
    numpy.abs(got - expected).max(), numpy.abs(got - expected).mean()


.. parsed-literal::
    (1.256860512819292, 0.25663312220721907)


Ca ne marche pas aussi bien que prévu. Il faudrait sans doute plusieurs
itérations et jouer avec les paramètres d’apprentissage.