XGNN: Towards Model-Level Explanations of Graph
Neural Networks

Hao Yuan hao.yuan@tamu.edu Texas A&M UniversityCollege StationTexasUnited States77840 , Jiliang Tang tangjili@msu.edu Michigan State UniversityEast LansingMichiganUnited States48824 , Xia Hu hu@cse.tamu.edu Texas A&M UniversityCollege StationTexasUnited States77840 and Shuiwang ji sji@tamu.edu Texas A&M UniversityCollege StationTexasUnited States77840

(2020)

Abstract.

Graphs neural networks (GNNs) learn node features by aggregating and combining neighbor information, which have achieved promising performance on many graph tasks. However, GNNs are mostly treated as black-boxes and lack human intelligible explanations. Thus, they cannot be fully trusted and used in certain application domains if GNN models cannot be explained. In this work, we propose a novel approach, known as XGNN, to interpret GNNs at the model-level. Our approach can provide high-level insights and generic understanding of how GNNs work. In particular, we propose to explain GNNs by training a graph generator so that the generated graph patterns maximize a certain prediction of the model. We formulate the graph generation as a reinforcement learning task, where for each step, the graph generator predicts how to add an edge into the current graph. The graph generator is trained via a policy gradient method based on information from the trained GNNs. In addition, we incorporate several graph rules to encourage the generated graphs to be valid. Experimental results on both synthetic and real-world datasets show that our proposed methods help understand and verify the trained GNNs. Furthermore, our experimental results indicate that the generated graphs can provide guidance on how to improve the trained GNNs.

Deep learning, Interpretability, Graph Neural Networks

^†^†journalyear: 2020^†^†copyright: acmcopyright^†^†conference: Proceedings of the 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining; August 23–27, 2020; Virtual Event, CA, USA^†^†booktitle: Proceedings of the 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD ’20), August 23–27, 2020, Virtual Event, CA, USA^†^†price: 15.00^†^†doi: 10.1145/3394486.3403085^†^†isbn: 978-1-4503-7998-4/20/08^†^†ccs: Computing methodologies Neural networks^†^†ccs: Computing methodologies Artificial intelligence^†^†ccs: Mathematics of computing Graph algorithms

1. Introduction

Graph Neural Networks (GNNs) have shown their effectiveness and obtained the state-of-the-art performance on different graph tasks, such as node classification (Gao and Ji, 2019b; Veličković et al., 2018), graph classification (Xu et al., 2019; Zhang et al., 2018), and link prediction (Zhang and Chen, 2018). In addition, extensive efforts have been made towards different graph operations, such as graph convolution (Kipf and Welling, 2017; Gilmer et al., 2017; Hamilton et al., 2017), graph pooling (Yuan and Ji, 2020; Lee et al., 2019), and graph attention (Veličković et al., 2018; Thekumparampil et al., 2018; Gao and Ji, 2019a). Since graph data widely exist in different real-world applications, such as social networks, chemistry, and biology, GNNs are becoming increasingly important and useful. Despite their great performance, GNNs share the same drawback as other deep learning models; that is, they are usually treated as black-boxes and lack human-intelligible explanations. Without understanding and verifying the inner working mechanisms, GNNs cannot be fully trusted, which prevents their use in critical applications pertaining to fairness, privacy, and safety (Doshi-Velez and Kim, 2017; Ying et al., 2019). For example, we can train a GNN model to predict the effects of drugs where we treat each drug as a molecular graph. Without exploring the working mechanisms, we do not know what chemical groups in a molecular graph lead to the predictions. Then we cannot verify whether the rules of the GNN model are consistent with real-world chemical rules, and hence we cannot fully trust the GNN model. This raises the need of developing interpretation techniques for GNNs.

Recently, several interpretations techniques have been proposed to explain deep learning models on image and text data. Depending on what kind of interpretations are provided, existing techniques can be categorized into example-level (Simonyan et al., 2013; Smilkov et al., 2017; Yuan et al., 2019; Selvaraju et al., 2017; Zhou et al., 2016; Zeiler and Fergus, 2014; Fong and Vedaldi, 2017; Dabkowski and Gal, 2017) or model-level (Erhan et al., 2009; Nguyen et al., 2017; Nguyen et al., 2015) methods. Example-level interpretations explain the prediction for a given input example, by determining important features in the input or the decision procedure for this input through the model. Common techniques in this category include gradient-based methods (Simonyan et al., 2013; Smilkov et al., 2017; Yuan et al., 2019), visualizations of intermediate feature maps (Selvaraju et al., 2017; Zhou et al., 2016), and occlusion-based methods (Zeiler and Fergus, 2014; Fong and Vedaldi, 2017; Dabkowski and Gal, 2017). Instead of providing input-dependent explanations, model-level interpretations aim to explain the general behavior of the model by investigating what input patterns can lead to a certain prediction, without respect to any specific input example. Input optimization (Erhan et al., 2009; Nguyen et al., 2017; Nguyen et al., 2015; Olah et al., 2017) is the most popular model-level interpretation method. These two categories of interpretation methods aim at explaining deep models in different views. Since the ultimate goal of interpretations is to verify and understand deep models, we need to manually check the interpretation results and conclude if the deep models work in our expected way. For example-level methods, we may need to explore the explanations for a large number of examples before we can trust the models. However, it is time-consuming and requires extensive expert efforts. For model-level methods, the explanations are more general and high-level, and hence need less human supervision. However, the explanations of model-level methods are less precise compared with example-level interpretations. Overall, both model-level and example-level methods are important for interpreting and understanding deep models.

Interpreting deep learning models on graph data become increasingly important but is still less explored. To the best of our knowledge, there is no existing study on interpreting GNNs at the model-level. The existing study (Ying et al., 2019; Baldassarre and Azizpour, 2019) only provides example-level explanations for graph models. As a radical departure from existing work, we propose a novel interpretation technique, known as XGNN, for explaining deep graph models at the model-level. We propose to investigate what graph patterns can maximize a certain prediction. Specifically, we propose to train a graph generator such that the generated graph patterns can be used to explain deep graph models. We formulate it as a reinforcement learning problem that at each step, the graph generator predicts how to add an edge to a given graph and form a new graph. Then the generator is trained based on the feedback from the trained graph models using policy gradient (Sutton et al., 2000). We also incorporate several graph rules to encourage the generated graphs to be valid. Note that the graph generation part in our XGNN framework can be generalized to any suitable graph generation method, determined by the dataset at hand and the GNNs to be interpreted. Finally, we trained GNN models on both real-world and synthetic datasets which can yield good performance. Then we employ our proposed XGNN to explain these trained models. Experimental results show that our proposed XGNN can find the desired graph patterns and explains these models. With our generated graph patterns, we can verify, understand, and even improve the trained GNN models.

2. Related Work

2.1. Graph Neural Networks

Graphs are wildly employed to represent data in different real-world domains and graph neural networks have shown promising performance on these data. Different from image and text data, a graph is represented by a feature matrix and an adjacency matrix. Formally, a graph $G$ with $n$ nodes is represented by its feature matrix $X\in\mathbb{R}^{n\times d}$ and its adjacency matrix $A\in\{0,1\}^{n\times n}$ . Note that we assume each node has a $d$ -dimension vector to represent its features. Graph neural networks learn node features based on these matrices. Even though there are several variants of GNNs, such as graph convolution networks (GCNs) (Kipf and Welling, 2017), graph attention networks (GATs) (Veličković et al., 2018), and graph isomorphism networks (GINs) (Xu et al., 2019), they share a similar feature learning strategy. For each node, GNNs update its node features by aggregating the features from its neighbors and combining them with its own features. We take GCNs as an example to illustrate the neighborhood information aggregation scheme. The operation of GCNs is defined as

(1)

X_{i+1}=f(D^{-\frac{1}{2}}\hat{A}D^{-\frac{1}{2}}X_{i}W_{i}),

where $X_{i}\in\mathbb{R}^{n\times d_{i}}$ and $X_{i+1}\in\mathbb{R}^{n\times d_{i+1}}$ are the input and output feature matrices of the $i^{th}$ graph convolution layer. In addition, $\hat{A}=A+I$ is used to add self-loops to the adjacency matrix, $D$ denotes the diagonal node degree matrix to normalize $\hat{A}$ . The matrix $W_{i}\in\mathbb{R}^{d_{i}\times d_{i+1}}$ is a trainable matrix for layer $i$ and is used to perform linear feature transformation and $f(\cdot)$ denotes a non-linear activation function. By stacking $j$ graph convolution layers, the $j$ -hop neighborhood information can be aggregated. Due to its superior performance, we incorporate the graph convolution in Equation (1) as our graph neural network operator.

2.2. Model-level Interpretations

Next, we briefly discuss popular model-level interpretation techniques for deep learning models on image data, known as input optimization methods (Erhan et al., 2009; Nguyen et al., 2017; Nguyen et al., 2015; Olah et al., 2017). These methods generally generate optimized input that can maximize a certain behavior of deep models. They randomly initialize the input and iteratively update the input towards an objective, such as maximizing a class score. Then such optimized input can be regarded as the explanations for the target behavior. Such a procedure is known as optimization and is similar to training deep neural networks. The main difference is that in such input optimization techniques, all network parameters are fixed while the input is treated as trainable variables. While such methods can provide meaningful model-level explanations for deep models on images, they cannot be directly applied to interpret GNNs due to three challenges. First, the structural information of a graph is represented by a discrete adjacency matrix, which cannot be directly optimized via back-propagation. Second, for images, the optimized input is an abstract image and the visualization shows high-level patterns and meanings. In the case of graphs, the abstract graph is not meaningful and hard to visualize. Third, the obtained graphs may not be valid for chemical or biological rules since non-differentiable graph rules cannot be directly incorporated into optimization. For example, the node degree of an atom should not exceed its maximum chemical valency.

2.3. Graph Model Interpretations

To the best of our knowledge, there are only a few existing studies focusing on the interpretability of deep graph models (Ying et al., 2019; Baldassarre and Azizpour, 2019). The recent GNN interpretation tool GNN Explainer (Ying et al., 2019) proposes to explain deep graph models at the example-level by learning soft masks. For a given example, it applies soft masks to graph edges and node features and updates the masks such that the prediction remains the same as the original one. Then some graph edges and node features are selected by thresholding the masks, and they are treated as important edges and features for making the prediction for the given example.

Refer to caption — Figure 1. Illustrations of our proposed XGNN for graph interpretation via graph generation. The GNNs represent a trained graph classification model that we try to explain. All graph examples in the graph set are classified to the third class. The left part shows that we can manually conclude the key graph patterns for the third class but it is challenging. The right part shows that we propose to train a graph generator to generate graphs that can maximize the class score and be valid according to graph rules.

The other work (Baldassarre and Azizpour, 2019) also focuses on the example-level interpretations of deep graph models. It applies several well-known image interpretation methods to graph models, such as sensitivity analysis (SA) (Gevrey et al., 2003), guided backpropagation (GBP) (Springenberg et al., 2014), and layer-wise relevance propagation (LRP) (Bach et al., 2015). The SA and GBP methods are based on the gradients while the LRP method computes the saliency maps by decomposing the output prediction into a combination of its inputs. In addition, both of these studies generate input-dependent explanations for individual examples. To verify and understand a deep model, humans need to check explanations for all examples, which is time-consuming or even not feasible.

While input-dependent explanations are important for understanding deep models, model-level interpretations should not be ignored. However, none of the existing work investigates the model-level interpretations of deep graph models. In this work, we argue that model-level interpretations can provide higher-level insights and a more general understanding in how a deep learning model works. Therefore, we aim at providing model-level interpretations for GNNs. We propose a novel method, known as XGNN, to explain GNNs by graph generation such that the generated graphs can maximize a certain behavior.

3. XGNN: Explainable Graph Neural Networks

3.1. Model-Level GNN Interpretation

Intuitively, given a trained GNN model, the model-level interpretations for it should explain what graph patterns or sub-graph patterns lead to a certain prediction. For example, one possible type of patterns is known as network motifs that represent simple building blocks of complex networks (graphs), which widely exist in graphs from biochemistry, neurobiology, ecology, and engineering (Milo et al., 2002; Alon, 2006, 2007; Shen-Orr et al., 2002). Different motif sets can be found in graphs with different functions (Milo et al., 2002; Alon, 2006), which means different motifs may directly relate to the functions of graphs. However, it is still unknown whether GNNs make predictions based on such motifs or other graph information. By identifying the relationships between graph patterns and the predictions of GNNs, we can better understand the models and verify whether a model works as expected. Therefore, we propose our XGNN, which explains GNNs using such graph patterns. Specifically, in this work, we investigate the model-level interpretations of GNNs for graph classification tasks and the graph patterns are obtained by graph generations.

Formally, let $f(\cdot)$ denote a trained GNN classification model, and $y\in\{c_{1},\cdots,c_{\ell}\}$ denote the classification prediction. Given $f(\cdot)$ and a chosen class $c_{i}$ , $i\in\{1,\cdots,\ell\}$ , our goal is to investigate what input graph patterns maximize the predicted probability for this class. The obtained patterns can be treated as model-level interpretations with respect to $c_{i}$ . Formally, the task can be defined as

(2)

G^{*}=\operatorname*{argmax}_{G}P(f(G)=c_{i}),

where $G^{*}$ is the optimized input graph we need. A popular way to obtain such optimized input for interpreting image and text models is known as input optimization (Yuan et al., 2019; Erhan et al., 2009; Nguyen et al., 2017; Nguyen et al., 2015; Olah et al., 2017). However, as discussed in Section 2.2, such optimization method cannot be applied to interpret graph models because of the special representations of graph data. Instead, we propose to obtain the optimized graph $G^{*}$ via graph generation. The general illustration of our proposed method is shown in Figure 1. Given a pre-trained graph classification model, we interpret it by providing explanations for its third class. We may manually conclude the graph patterns from the graph dataset. By evaluating all graph examples in the dataset, we can obtain the graphs that are predicted to be the third class. Then we can manually check what are the common graph patterns among these graphs. For example, the left part of Figure 1 shows that a set of four graphs are classified into the third class. Based on human observations, we know that the important graph pattern leading to the prediction is the triangle pattern consisting of a red node, a yellow node, and a blue node. However, such manual analysis is time-consuming and not applicable for large-scale and complex graph datasets. As shown in the right part, we propose to train a graph generator to generate graph patterns that can maximize the prediction score of the third class. In addition, we incorporate graph rules, such as the chemical valency check, to encourage valid and human-intelligible explanations. Finally, we can analyze the generated graphs to obtain model-level explanations for the third class. Compared with directly manual analysis on the original dataset, our proposed method generates small-scale and less complex graphs, which can significantly reduce the cost for further manual analysis.

3.2. Interpreting GNNs via Graph Generation

Recent advances in graph generation lead to many successful graph generation models, such as GraphGAN (Wang et al., 2018), ORGAN (Guimaraes et al., 2017), Junction Tree VAE (Jin et al., 2018), DGMG (Li et al., 2018), and Graph Convolutional Policy Network (GCPN) (You et al., 2018). Inspired by these methods, we propose to train a graph generator which generates $G^{*}$ step by step. For each step, the graph generator generates a new graph based on the current graph. Formally, we define the partially generated graph at step $t$ as $G_{t}$ , which contains $n_{t}$ nodes. It is represented as a feature matrix $X_{t}\in\mathbb{R}^{n_{t}\times d}$ and an adjacency matrix $A_{t}\in\{0,1\}^{n_{t}\times n_{t}}$ , assuming each node has a $d$ -dimensional feature vector. Then we define a $\theta$ -parameterized graph generator as $g_{\theta}(\cdot)$ , which takes $G_{t}$ as input, and outputs a new graph $G_{t+1}$ that

(3)

X_{t+1},A_{t+1}=g_{\theta}(X_{t},A_{t}).

Then the generator is trained with the guidance from the pre-trained GNNs $f(\cdot)$ . Since generating the new graph $G_{t+1}$ from $G_{t}$ is non-differentiable, we formulate the generation procedure as a reinforcement learning problem. Specifically, assuming there are $k$ types of nodes in the dataset, we define a candidate set $C=\{s_{1},s_{2},\cdots,s_{k}\}$ denoting these possible node types. For example, in a chemical molecular dataset, the candidate set can be $C=\{Carbon,Nitrogen,\cdots,Oxygen,Fluorine\}$ . In a social network dataset where nodes are not labeled, the candidate set only contains a single node type. Then at each step $t$ , based on the partially generated graph $G_{t}$ , the generator $g(\cdot)$ generates $G_{t+1}$ by predicting how to add an edge to the current graph $G_{t}$ . Note that the generator may add an edge between two nodes in the current graph $G_{t}$ or add a node from the candidate set $C$ to the current graph $G_{t}$ and connect it with an existing node in $G_{t}$ . Formally, we formulate it as a reinforcement learning problem, which consists of four elements: state, action, policy, and reward.

State: The state of the reinforcement learning environment at step $t$ is the partially generated graph $G_{t}$ . The initial graph at the first step can be either a random node from the candidate set $C$ or manually designed based on prior domain knowledge. For example, for the dataset describing organic molecules, we can set the initial graph as a single node labeled with carbon atom since any organic compound contains carbon generally (Seager and Slabaugh, 2013).

Action: The action at step $t$ , denoted as $a_{t}$ , is to generate the new graph $G_{t+1}$ based on the current graph $G_{t}$ . Specifically, given the current state $G_{t}$ , the action $a_{t}$ is to add an edge to $G_{t}$ by determining the starting node and the ending node of the edge. Note that the starting node $a_{t,start}$ can be any node from the current graph $G_{t}$ while the ending node $a_{t,end}$ is selected from the union of the current graph $G_{t}$ and the candidate set $C$ excluding the selected starting node $a_{t,start}$ , denoted as $(G_{t}\bigcup C)\setminus a_{t,start}$ . Note that with the predefined maximum action step and maximum node number, we can control the termination of graph generation.

Policy: We employ graph neural networks to serve as the policy. The policy determines the action $a_{t}$ based on the state $G_{t}$ . Specifically, the policy is the graph generator $g_{\theta}(\cdot)$ , which takes $G_{t}$ and $C$ as the input and outputs the probabilities of possible actions. With the reward function, the generator $g_{\theta}(\cdot)$ can be trained via policy gradient (Sutton et al., 2000).

Reward: The reward for step $t$ , denoted as $R_{t}$ , is employed to evaluate the action at step $t$ , which consists of two parts. The first part is the guidance from the trained GNNs $f(\cdot)$ , which encourages the generated graph to maximize the class score of class $c_{i}$ . By feeding the generated graphs to $f(\cdot)$ , we can obtain the predicted probabilities for class $c_{i}$ and use them as the feedback to update $g_{\theta}(\cdot)$ . The second part encourages the generated graphs to be valid in terms of certain graph rules. For example, for social network datasets, it is may not allowed to add multiple edges between two nodes. In addition, for chemical molecular datasets, the degree of an atom cannot exceed its chemical valency. Note that for each step, we include both intermediate rewards and overall rewards to evaluate the action.

While we formulate the graph generation as a reinforcement learning problem, it is noteworthy that our proposed XGNN is a novel and general framework for interpreting GNNs at the model-level. The graph generation part in this framework can be generalized to any suitable graph generation method, determined by the dataset at hand and the GNNs to be interpreted.

3.3. Graph Generator

For step $t$ , the graph generator $g_{\theta}(\cdot)$ incorporates the partially generated graph $G_{t}$ and the candidate set $C$ to predict the probabilities of different actions, denoted as $p_{t,start}$ and $p_{t,end}$ . Assume there are $n_{t}$ nodes in $G_{t}$ and $k$ nodes in $C$ , then both $p_{t,start}$ and $p_{t,end}$ are with $n_{t}+k$ dimensionality. Then the action $a_{t}=(a_{t,start},a_{t,end})$ is sampled from the probabilities $p_{t}=(p_{t,start},p_{t,end})$ . Next, we can obtain the new graph $G_{t+1}$ based on the action $a_{t}$ . Specifically, in our generator, we first employ several graph convolutional layers to aggregate neighborhood information and learn node features. Mathematically, it can be written as

(4)

\widehat{X}=\mbox{GCNs}(G_{t},C),

where $\widehat{X}$ denotes the learnt node features. Note that the graph $G_{t}$ and the candidate set $C$ are combined as the input of GCNs. We merge all nodes in $C$ to $G_{t}$ without adding any edge and then obtain the new node feature matrix and adjacency matrix. Then Multilayer Perceptrons (MLPs) are used to predict the probabilities of the starting node, $p_{t,start}$ and the action $a_{t,start}$ is sampled from this probabilty distribution. Mathematically, it can be written as

(5)		$\displaystyle p_{t,start}$	$\displaystyle=$	$\displaystyle\mbox{Softmax}(\mbox{MLPs}(\widehat{X})),$
(6)		$\displaystyle a_{t,start}$	$\displaystyle\sim$	$\displaystyle p_{t,start}\cdot m_{t,start},$

where $\cdot$ means element-wise product and $m_{t,start}$ is to mask out all candidate nodes since the starting node can be only selected from the current graph $G_{t}$ . Let $\widehat{x}_{start}$ denote the features of the node selected by the start action $a_{t,start}$ . Then conditioned on the selected node, we employ the second MLPs to compute the probability distribution of the ending node $p_{t,end}$ from which we sample the ending node action $a_{t,end}$ . Note that since the starting node and the ending node cannot be the same, we apply a mask $m_{t,end}$ to mask out the node selected by $a_{t,start}$ . Mathematically, it can be written as

(7)		$\displaystyle p_{t,end}$	$\displaystyle=$	$\displaystyle\mbox{Softmax}(\mbox{MLPs}([\widehat{X},\widehat{x}_{start}])),$
(8)		$\displaystyle a_{t,end}$	$\displaystyle\sim$	$\displaystyle p_{t,end}\cdot m_{t,end},$

where $[\cdot,\cdot]$ denotes broadcasting and concatenation. In addition, $m_{t,end}$ is the mask consisting of all 1s except the position indicating $a_{t,start}$ . Note that the same graph generator $g_{\theta}(\cdot)$ is shared by different time steps, and our generator is capable to incorporate graphs with variable sizes.

We illustrate our graph generator in Figure 2 where we show the graph generation procedure for one step. The current graph $G_{t}$ consists of 4 nodes and the candidate set has 3 available nodes. They are combined together to serve as the input of the graph generator. The embeddings of candidate nodes are concatenated to the feature matrix of $G_{t}$ while the adjacency matrix of $G_{t}$ is expanded accordingly. Then multiple graph convolutional layers are employed to learn features for all nodes. With the first MLPs, we obtain the probabilities of selecting different nodes as the starting node, and from which we sample the node 1 as the starting node. Then based on the features of node 1 and all node features, the second MLPs predict the ending node. We sample from the probabilities and select the node 7 as the ending node, which corresponds to the red node in the candidate set. Finally, a new graph is obtained by including a red node and connecting it with node 1.

3.4. Training the Graph Generator

The graph generator is trained to generate specific graphs that can maximize the class score of class $c_{i}$ and be valid to graph rules. Since such guidance is not differentiable, we employ policy gradient (Sutton et al., 2000) to train the generator. According to (Lei et al., 2016; Yu et al., 2017), the loss function for the action $a_{t}$ at step $t$ can be mathematically written as

(9)

\mathcal{L}_{g}=-R_{t}(\mathcal{L}_{CE}(p_{t,start},a_{t,start})+\mathcal{L}_{CE}(p_{t,end},a_{t,end})),

where $\mathcal{L}_{CE}(\cdot,\cdot)$ denotes the cross entropy loss and $R_{t}$ means the reward function for step $t$ . Intuitively, the reward $R_{t}$ indicates whether $a_{t}$ has a large chance to generate graph with high class score of class $c_{i}$ and being valid. Hence, the reward $R_{t}$ consists of two parts. The first part $R_{t,f}$ is the feedback from the trained model $f(\cdot)$ and the second part $R_{t,r}$ is from the graph rules. Specifically, for step $t$ , the reward $R_{t,f}$ contains both an intermediate reward and a final graph reward for graph $G_{t+1}$ that

(10)

R_{t,f}=R_{t,f}(G_{t+1})+\lambda_{1}\frac{\sum_{i=1}^{m}R_{t,f}(\mbox{Rollout}({G_{t+1}}))}{m},

where $\lambda_{1}$ is a hyper-parameter, and the first term is the intermediate reward which can be obtained by feeding $G_{t+1}$ to the trained GNNs $f(\cdot)$ and checking the predicted probability for class $c_{i}$ . Mathematically, it can be computed as

(11)

R_{t,f}(G_{t+1})=p(f(G_{t+1})=c_{i})-1/\ell,

where $\ell$ denotes the number of possible classes for $f(\cdot)$ . In addition, the second term in Equation (10) is the final graph reward for $G_{t+1}$ which can be obtained by performing Rollout (Yu et al., 2017) $m$ times on the intermediate graph $G_{t+1}$ . Each time, a final graph is generated based on $G_{t+1}$ until termination and then evaluated by $f(\cdot)$ using Equation (11). Then the evaluations for $m$ final graphs are averaged to serve as the final graph reward. Overall, $R_{t,f}$ is positive when the obtained graph tends to yield high score for class $c_{i}$ , and vice versa.

Algorithm 1 The algorithm of our proposed XGNN.

1:Given the trained GNNs for graph classification, denoted as

f(\cdot)

, we try to interpret it and set the target class as

c_{i}

2:Let

C

define the candidate node set and

g(\cdot)

mean our graph generator. We predefine the maximum generation step as

S_{max}

and the number of Rollout as

m

3:Define the initial graph as

G_{1}

4:for step

t

S_{max}

5: Merge the current graph

G_{t}

and the candidate set

C

6: Obtain the action

a_{t}

from the generator

g(\cdot)

that

a_{t}=(a_{t,start},a_{t,end})

with Equation (4-8).

7: Obtain the new graph

G_{t+1}

based on

a_{t}

8: Evaluate

G_{t+1}

with Equation (10-12) and obtain

R_{t}

9: Update the generator

g(\cdot)

with Equation (9).

10: if

R_{t}<0

then roll back and set

G_{t+1}=G_{t}

11: end if

12:end for

In addition, the reward $R_{t,r}$ is obtained from graphs rules and is employed to encourage the generated graphs to be valid and human-intelligible. The first rule we employ is that only one edge is allowed to be added between any two nodes. Second, the generated graph cannot contain more nodes than the predefined maximum node number. In addition, we incorporate dataset-specific rules to guide the graph generation. For example, in a chemical dataset, each node represents an atom so that its degree cannot exceed the valency of the corresponding atom. When any of these rules is violated, a negative reward will be applied for $R_{t,r}$ . Finally, by combining the $R_{t,f}$ and $R_{t,r}$ , we can obtain the reward for step $t$ that

(12)

R_{t}=R_{t,f}(G_{t+1})+\lambda_{1}\frac{\sum_{i=1}^{m}R_{t,f}(\mbox{Rollout}({G_{t+1}}))}{m}+\lambda_{2}R_{t,r},

where $\lambda_{1}$ and $\lambda_{2}$ are hyper-parameters. We illustrate the training procedure in Algorithm 1. Note that we roll back the graph $G_{t+1}$ to $G_{t}$ when the action $a_{t}$ is evaluated as not promising that $R_{t}<0$ .

4. Experimental Studies

4.1. Dataset and Experimental Setup

We evaluate our proposed XGNN on both synthetic and real-world datasets. We report the summary statistics of these datasets in Table 1. Since there is no existing work investigating model-level interpretations of GNNs, we have no baseline to compare with. Note that existing studies (Ying et al., 2019; Baldassarre and Azizpour, 2019) only focus on interpreting GNNs at example-level while ignoring the model-level explanations. Comparing with them is not expected since these example-level and model-level are two totally different interpretation directions.
Synthetic dataset: Since our XGNN generates model-level explanations for Deep GNNs, we build a synthetic dataset, known as Is_Acyclic, where the ground truth explanations are available. The graphs are labeled based on if there is any cycle existing in the graph. The graphs are obtained using Networkx software package (Hagberg et al., 2008). The first class refers to cyclic graphs, including grid-like graphs, cycle graphs, wheel graphs, and circular ladder graphs. The second class denotes acyclic graphs, containing star-like graphs, binary tree graphs, path graphs and full rary tree graphs (Storer, 2012). Note that all nodes in this dataset are unlabeled and we focus on investigating the ability of GNNs to capture graph structures.
Real-world dataset: We conduct experiments on the real-world dataset MUTAG. The MUTAG dataset contains graphs representing chemical compounds where nodes represent different atoms and edges represent chemical bonds. The graphs are labeled into two different classes according to their mutagenic effect on a bacterium (Debnath et al., 1991). Each node is labeled based on its type of atom and there are seven possible atom types: Carbon, Nitrogen, Oxygen, Fluorine, Iodine, Chlorine, Bromine. Note that the edge labels are ignored for simplicity. For this dataset, we investigate the ability of GNNs to capture both graph structures and node labels.
Graph classification models: We train graph classification models using these datasets and then try to explain these models. These models share a similar pipeline that first learns node features using multiple layers of GCNs, then obtain graph level embeddings by averaging all node features, and finally employs fully-connected layers to perform graph classification. For the synthetic dataset Is_Acyclic, we use the node degrees as the initial features for all nodes. Then we apply two layers of GCNs with output dimensions equal to 8, 16 respectively and perform global averaging to obtain the graph representations. Finally, we employ one fully-connected layer as the classifier. Meanwhile, for the real-world dataset MUTAG, since all nodes are labeled, we employ the corresponding one-hot representations as the initial node features. Then we employ three layers of GCNs with output dimensions equal to 32, 48, 64 respectively and average all node features. The final classifier contains two fully-connected layers in which the hidden dimension is set to 32. Note that for all GCN layers, we apply the GCN version shown in Equation (1). In addition, we employ Sigmoid as the non-linear function in GCNs for dataset Is_Acyclic while we use Relu for dataset MUTAG. These models are implemented using Pytorch (Paszke et al., 2017) and trained using Adam optimizer (Kingma and Ba, 2014). The training accuracies of these models are reported in Table 1, which show that the models we try to interpret are models with reasonable performance.

Table 1. Statistics and properties of datasets. Note that the edge number and node number are averaged numbers.

Dataset	Classes	# of Edges	# of Nodes	Accuracy
Is_Acyclic	2	30.04	28.46	0.978
MUTAG	2	19.79	17.93	0.963

Graph generators: For both datasets, our graph generators share the same structure. Our generator first employs a fully-connected layer to map node features to the dimension of 8. Then three layers of GCNs are employed with output dimensions equal to 16, 24, 32 respectively. The first MLPs consist of two fully-connected layers with the hidden dimension equal to 16 and a ReLU6 non-linear function. The second MLPs also have two fully-connected layers that the hidden dimension is set to 24 and ReLU6 is applied. The initial features for input graphs are the same as mentioned above. For dataset Is_Acyclic, we set $\lambda_{1}=1$ , $\lambda_{2}=1$ , and $R_{t,r}=-1$ if the generated graph violates any graph rule. For dataset MUTAG, we set $\lambda_{1}=1$ , $\lambda_{2}=2$ , and the total reward $R_{t}=-1$ if the generated graph violates any graph rule. In addition, we perform rollout $m=10$ times each step to obtain final graph rewards. The models are implemented using Pytorch (Paszke et al., 2017) and trained using Adam optimizer (Kingma and Ba, 2014) with $\beta_{1}=0.9$ and $\beta_{2}=0.999$ . The learning rate for graph generator training is set to 0.01.

4.2. Experimental Results on Synthetic Data

We first conduct experiments on the synthetic dataset Is_Acyclic where the ground truth is available. As shown in Table 1, the trained GNN classifier can reach a promising performance. Since the dataset is manually and synthetically built based on if the graph contains any circle, we can check if the trained GNN classifier makes predictions in such a way. We explain the model with our proposed XGNN and report the generated interpretations in Figure 3. We show the explanations for the class “cyclic” in the first row and the results for the class “acyclic” in the second row. In addition, we also report different generated explanations by setting different maximum graph node limits.

First, by comparing the graphs generated for different classes, we can easily conclude the difference that the explanations for the class “cyclic” always contain circles while the results for the class “acyclic” have no circle at all. Second, to verify whether our explanations can maximize the class probability for a certain class, as shown in Equation (2), we feed each generated graph to the trained GNN classifier and report the predicted probability for the corresponding class. The results show that our generated graph patterns can consistently yield high predicted probabilities. Note that even though the graph obtained for the class “cyclic” with maximum node number equal to 3 only leads to $p=0.7544$ , it is still the highest probability for all possible graphs with 3 nodes. Finally, based on these results, we can understand what patterns can maximize the predicted probabilities for different classes. In our results, we know the trained GNN classifier very likely distinguishes different classes by detecting circular structures, which is consistent with our expectations. Hence, such explanations help understand and trust the model, and increase the trustworthiness of this model to be used as a circular graph detector. In addition, it is noteworthy that our generated graphs are easier to analyze compared with the graphs in the datasets. Our generated graphs have significantly fewer numbers of nodes and simpler structures, and yield higher predicted probabilities while the graphs from the dataset have an average of 28 nodes and 30 edges, as shown in Table 1.

4.3. Experimental Results on Real-World Data

We also evaluate our proposed XGNN using real-world data. For dataset MUTAG, there is no ground truth for the interpretations. Since all nodes are labeled as different types of atoms, we investigate whether the trained GNN classifier can capture both graph structures and node labels. We interpret the trained GNN with our proposed method and report selected results in Figure 4 and Figure 5. Note that the generated graphs may not represent real chemical compounds because, for simplicity, we only incorporate a simple chemical rule that the degree of an atom cannot exceed its maximum chemical valency. In addition, since nodes are labeled, we can set the initial graphs as different types of atoms.

We first set the initial graph as a single carbon atom and report the results in Figure 4, since generally, any organic compound contains carbon (Seager and Slabaugh, 2013). The first row reports explanations for the class “non-mutagenic” while the second row shows the results for the class “mutagenic”. We report the generated graphs with different node limits and the GNN predicted probabilities. For the class “mutagenic”, we can observe that carbon circles and $NO_{2}$ are some common patterns, and this is consistent with the chemical fact that carbon rings and $NO_{2}$ chemical groups are mutagenic (Debnath et al., 1991). Such observations indicate that the trained GNN classifier may capture these key graph patterns to make predictions. In addition, for the class “non-mutagenic”, we observe the atom Chlorine is widely existing in the generated graphs and the combination of Chlorine, Bromine, and Fluorine always leads to “non-mutagenic” predictions. By analyzing such explanations, we can better understand the trained GNN model.

We also explore different initial graphs and report the results in Figure 5. We fix the maximum node limit as 5 and generate explanations for the class “mutagenic”. First, no matter how we set the initial graph, our proposed method can always find graph patterns maximizing the predicted probability of class “mutagenic”. For the first 5 graphs, which means the initial graph is set to a single node of Carbon, Nitrogen, Oxygen, Iodine, or Fluorine, some generated graphs still have common patterns like carbon circle and $NO_{2}$ chemical groups. Our observations further confirm that these key patterns are captured by the trained GNNs. In addition, we notice that the generator can still produce graphs with Chlorine which are predicted as “mutagenic”, which is contrary to our conclusion above. If all graphs with Chlorine should be identified as “non-mutagenic”, such explanations show the limitations of trained GNNs. Then these generated explanations can provide guidance for improving the trained GNNs, for example, we may place more emphasis on the graphs Chlorine when training the GNNs. Furthermore, the generated explanations may also be used to retrain and improve the GNN models to correctly capture our desired patterns. Overall, the experimental results show that our proposed interpretation method XGNN can help verify, understand, and even help improve the trained GNN models.

5. Conclusions

Graphs neural networks are widely studied recently and have shown great performance for multiple graph tasks. However, graph models are still treated as black-boxes and hence cannot be fully trustable. It raises the need of investigating the interpretation techniques for graph neural networks. It is still a less explored area where existing methods only focus on example-level explanations for graph models. However, none of the existing work investigates the model-level interpretations of graph models which is more general and high-level. Hence, in this work, we propose a novel method, XGNN, to interpret graph models in the model-level. Specifically, we propose to find graph patterns that can maximize a certain prediction via graph generation. We formulate it as a reinforcement learning problem and generate graph pattern iteratively. We train a graph generator and for each step, it predicts how to add an edge into the current graph. In addition, we incorporate several graph rules to encourage the generated graphs to be valid and human-intelligible. Finally, we conduct experiments on both synthetic and real-world datasets to demonstrate the effectiveness of our proposed XGNN. Experimental results show that the generated graphs help discover what patterns will maximize a certain prediction of the trained GNNs. The generated explanations help verify and better understand if the trained GNNs make a prediction in our expected way. Furthermore, our results also show that the generated explanations can help improve the trained models.

ACKNOWLEDGMENTS

This work was supported in part by National Science Foundation grants DBI-2028361, IIS-1714741, IIS-1715940, IIS-1845081, IIS-1900990 and Defense Advanced Research Projects Agency grant N66001-17-2-4031.

References

(1)
Alon (2006) Uri Alon. 2006. An introduction to systems biology: design principles of biological circuits. Chapman and Hall/CRC.
Alon (2007) Uri Alon. 2007. Network motifs: theory and experimental approaches. Nature Reviews Genetics 8, 6 (2007), 450.
Bach et al. (2015) Sebastian Bach, Alexander Binder, Grégoire Montavon, Frederick Klauschen, Klaus-Robert Müller, and Wojciech Samek. 2015. On pixel-wise explanations for non-linear classifier decisions by layer-wise relevance propagation. PloS one 10, 7 (2015).
Baldassarre and Azizpour (2019) Federico Baldassarre and Hossein Azizpour. 2019. Explainability Techniques for Graph Convolutional Networks. In International Conference on Machine Learning (ICML) Workshops, 2019 Workshop on Learning and Reasoning with Graph-Structured Representations.
Dabkowski and Gal (2017) Piotr Dabkowski and Yarin Gal. 2017. Real time image saliency for black box classifiers. In Advances in Neural Information Processing Systems. 6967–6976.
Debnath et al. (1991) Asim Kumar Debnath, Rosa L Lopez de Compadre, Gargi Debnath, Alan J Shusterman, and Corwin Hansch. 1991. Structure-activity relationship of mutagenic aromatic and heteroaromatic nitro compounds. correlation with molecular orbital energies and hydrophobicity. Journal of medicinal chemistry 34, 2 (1991), 786–797.
Doshi-Velez and Kim (2017) Finale Doshi-Velez and Been Kim. 2017. Towards a rigorous science of interpretable machine learning. arXiv preprint arXiv:1702.08608 (2017).
Erhan et al. (2009) Dumitru Erhan, Yoshua Bengio, Aaron Courville, and Pascal Vincent. 2009. Visualizing higher-layer features of a deep network. Technical Report, University of Montreal 1341, 3 (2009), 1.
Fong and Vedaldi (2017) Ruth C Fong and Andrea Vedaldi. 2017. Interpretable explanations of black boxes by meaningful perturbation. In Proceedings of the IEEE International Conference on Computer Vision. 3429–3437.
Gao and Ji (2019a) Hongyang Gao and Shuiwang Ji. 2019a. Graph representation learning via hard and channel-wise attention networks. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 741–749.
Gao and Ji (2019b) Hongyang Gao and Shuiwang Ji. 2019b. Graph U-Net. In International conference on machine learning. 2083–2092.
Gevrey et al. (2003) Muriel Gevrey, Ioannis Dimopoulos, and Sovan Lek. 2003. Review and comparison of methods to study the contribution of variables in artificial neural network models. Ecological modelling 160, 3 (2003), 249–264.
Gilmer et al. (2017) Justin Gilmer, Samuel S Schoenholz, Patrick F Riley, Oriol Vinyals, and George E Dahl. 2017. Neural message passing for quantum chemistry. In Proceedings of the 34th International Conference on Machine Learning-Volume 70. JMLR. org, 1263–1272.
Guimaraes et al. (2017) Gabriel Lima Guimaraes, Benjamin Sanchez-Lengeling, Carlos Outeiral, Pedro Luis Cunha Farias, and Alán Aspuru-Guzik. 2017. Objective-reinforced generative adversarial networks (ORGAN) for sequence generation models. arXiv preprint arXiv:1705.10843 (2017).
Hagberg et al. (2008) Aric Hagberg, Pieter Swart, and Daniel S Chult. 2008. Exploring network structure, dynamics, and function using NetworkX. Technical Report. Los Alamos National Lab.(LANL), Los Alamos, NM (United States).
Hamilton et al. (2017) Will Hamilton, Zhitao Ying, and Jure Leskovec. 2017. Inductive representation learning on large graphs. In Advances in neural information processing systems. 1024–1034.
Jin et al. (2018) Wengong Jin, Regina Barzilay, and Tommi Jaakkola. 2018. Junction Tree Variational Autoencoder for Molecular Graph Generation. In Proceedings of the 35th International Conference on Machine Learning. 2323–2332.
Kingma and Ba (2014) Diederik P Kingma and Jimmy Ba. 2014. Adam: A Method for Stochastic Optimization. In Proceedings of the 3rd International Conference on Learning Representations.
Kipf and Welling (2017) Thomas N Kipf and Max Welling. 2017. Semi-supervised classification with graph convolutional networks. In Proceedings of the International Conference on Learning Representations.
Lee et al. (2019) Junhyun Lee, Inyeop Lee, and Jaewoo Kang. 2019. Self-Attention Graph Pooling. In International Conference on Machine Learning. 3734–3743.
Lei et al. (2016) Tao Lei, Regina Barzilay, and Tommi Jaakkola. 2016. Rationalizing Neural Predictions. In Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing. 107–117.
Li et al. (2018) Yujia Li, Oriol Vinyals, Chris Dyer, Razvan Pascanu, and Peter Battaglia. 2018. Learning deep generative models of graphs. arXiv preprint arXiv:1803.03324 (2018).
Milo et al. (2002) Ron Milo, Shai Shen-Orr, Shalev Itzkovitz, Nadav Kashtan, Dmitri Chklovskii, and Uri Alon. 2002. Network motifs: simple building blocks of complex networks. Science 298, 5594 (2002), 824–827.
Nguyen et al. (2017) Anh Nguyen, Jeff Clune, Yoshua Bengio, Alexey Dosovitskiy, and Jason Yosinski. 2017. Plug & play generative networks: Conditional iterative generation of images in latent space. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 4467–4477.
Nguyen et al. (2015) Anh Nguyen, Jason Yosinski, and Jeff Clune. 2015. Deep neural networks are easily fooled: High confidence predictions for unrecognizable images. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 427–436.
Olah et al. (2017) Chris Olah, Alexander Mordvintsev, and Ludwig Schubert. 2017. Feature visualization. Distill 2, 11 (2017), e7.
Paszke et al. (2017) Adam Paszke, Sam Gross, Soumith Chintala, Gregory Chanan, Edward Yang, Zachary DeVito, Zeming Lin, Alban Desmaison, Luca Antiga, and Adam Lerer. 2017. Automatic differentiation in PyTorch. In Proceedings of the International Conference on Learning Representations.
Seager and Slabaugh (2013) Spencer L Seager and Michael R Slabaugh. 2013. Chemistry for today: General, organic, and biochemistry. Cengage learning.
Selvaraju et al. (2017) Ramprasaath R Selvaraju, Michael Cogswell, Abhishek Das, Ramakrishna Vedantam, Devi Parikh, and Dhruv Batra. 2017. Grad-CAM: Visual explanations from deep networks via gradient-based localization. In Proceedings of the IEEE International Conference on Computer Vision. 618–626.
Shen-Orr et al. (2002) Shai S Shen-Orr, Ron Milo, Shmoolik Mangan, and Uri Alon. 2002. Network motifs in the transcriptional regulation network of Escherichia coli. Nature genetics 31, 1 (2002), 64.
Simonyan et al. (2013) Karen Simonyan, Andrea Vedaldi, and Andrew Zisserman. 2013. Deep inside convolutional networks: Visualising image classification models and saliency maps. arXiv preprint arXiv:1312.6034 (2013).
Smilkov et al. (2017) Daniel Smilkov, Nikhil Thorat, Been Kim, Fernanda Viégas, and Martin Wattenberg. 2017. Smoothgrad: removing noise by adding noise. arXiv preprint arXiv:1706.03825 (2017).
Springenberg et al. (2014) Jost Tobias Springenberg, Alexey Dosovitskiy, Thomas Brox, and Martin Riedmiller. 2014. Striving for simplicity: The all convolutional net. arXiv preprint arXiv:1412.6806 (2014).
Storer (2012) James Andrew Storer. 2012. An introduction to data structures and algorithms. Springer Science & Business Media.
Sutton et al. (2000) Richard S Sutton, David A McAllester, Satinder P Singh, and Yishay Mansour. 2000. Policy gradient methods for reinforcement learning with function approximation. In Advances in neural information processing systems. 1057–1063.
Thekumparampil et al. (2018) Kiran K Thekumparampil, Chong Wang, Sewoong Oh, and Li-Jia Li. 2018. Attention-based graph neural network for semi-supervised learning. arXiv preprint arXiv:1803.03735 (2018).
Veličković et al. (2018) Petar Veličković, Guillem Cucurull, Arantxa Casanova, Adriana Romero, Pietro Lio, and Yoshua Bengio. 2018. Graph Attention Networks. In International Conference on Learning Representations. https://openreview.net/forum?id=rJXMpikCZ
Wang et al. (2018) Hongwei Wang, Jia Wang, Jialin Wang, Miao Zhao, Weinan Zhang, Fuzheng Zhang, Xing Xie, and Minyi Guo. 2018. GraphGAN: Graph representation learning with generative adversarial nets. In Thirty-Second AAAI Conference on Artificial Intelligence. 2508–2515.
Xu et al. (2019) Keyulu Xu, Weihua Hu, Jure Leskovec, and Stefanie Jegelka. 2019. How Powerful are Graph Neural Networks?. In International Conference on Learning Representations. https://openreview.net/forum?id=ryGs6iA5Km
Ying et al. (2019) Zhitao Ying, Dylan Bourgeois, Jiaxuan You, Marinka Zitnik, and Jure Leskovec. 2019. GNNExplainer: Generating Explanations for Graph Neural Networks. In Advances in Neural Information Processing Systems 32. 9244–9255.
You et al. (2018) Jiaxuan You, Bowen Liu, Zhitao Ying, Vijay Pande, and Jure Leskovec. 2018. Graph convolutional policy network for goal-directed molecular graph generation. In Advances in Neural Information Processing Systems. 6410–6421.
Yu et al. (2017) L Yu, W Zhang, J Wang, and Y Yu. 2017. Seqgan: sequence generative adversarial nets with policy gradient. In AAAI-17: Thirty-First AAAI Conference on Artificial Intelligence, Vol. 31. Association for the Advancement of Artificial Intelligence (AAAI), 2852–2858.
Yuan et al. (2019) Hao Yuan, Yongjun Chen, Xia Hu, and Shuiwang Ji. 2019. Interpreting Deep Models for Text Analysis via Optimization and Regularization Methods. In Thirty-Third AAAI Conference on Artificial Intelligence. 5717–5724.
Yuan and Ji (2020) Hao Yuan and Shuiwang Ji. 2020. StructPool: Structured Graph Pooling via Conditional Random Fields. In International Conference on Learning Representations. https://openreview.net/forum?id=BJxg_hVtwH
Zeiler and Fergus (2014) Matthew D Zeiler and Rob Fergus. 2014. Visualizing and understanding convolutional networks. In European Conference on Computer Vision. Springer, 818–833.
Zhang and Chen (2018) Muhan Zhang and Yixin Chen. 2018. Link prediction based on graph neural networks. In Advances in Neural Information Processing Systems. 5165–5175.
Zhang et al. (2018) Muhan Zhang, Zhicheng Cui, Marion Neumann, and Yixin Chen. 2018. An end-to-end deep learning architecture for graph classification. In Thirty-Second AAAI Conference on Artificial Intelligence. 4438–4445.
Zhou et al. (2016) Bolei Zhou, Aditya Khosla, Agata Lapedriza, Aude Oliva, and Antonio Torralba. 2016. Learning deep features for discriminative localization. In Proceedings of the IEEE International Conference on Computer Vision. 2921–2929.

XGNN: Towards Model-Level Explanations of Graph Neural Networks