skip to main content
10.1145/3534678.3539347acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article
Public Access

Graph Rationalization with Environment-based Augmentations

Authors Info & Claims
Published:14 August 2022Publication History

ABSTRACT

Rationale is defined as a subset of input features that best explains or supports the prediction by machine learning models. Rationale identification has improved the generalizability and interpretability of neural networks on vision and language data. In graph applications such as molecule and polymer property prediction, identifying representative subgraph structures named as graph rationales plays an essential role in the performance of graph neural networks. Existing graph pooling and/or distribution intervention methods suffer from the lack of examples to learn to identify optimal graph rationales. In this work, we introduce a new augmentation operation called environment replacement that automatically creates virtual data examples to improve rationale identification. We propose an efficient framework that performs rationale-environment separation and representation learning on the real and augmented examples in latent spaces to avoid the high complexity of explicit graph decoding and encoding. Comparing against recent techniques, experiments on seven molecular and four polymer datasets demonstrate the effectiveness and efficiency of the proposed augmentation-based graph rationalization framework. Data and the implementation of the proposed framework are publicly available https://github.com/liugangcode/GREA.

References

  1. Martin Arjovsky, Léon Bottou, Ishaan Gulrajani, and David Lopez-Paz. 2019. Invariant risk minimization. In arXiv:1907.02893 .Google ScholarGoogle Scholar
  2. Shiyu Chang, Yang Zhang, Mo Yu, and Tommi Jaakkola. 2020. Invariant rationalization. In ICML. 1448--1458.Google ScholarGoogle Scholar
  3. Deli Chen, Yankai Lin, Wei Li, Peng Li, Jie Zhou, and Xu Sun. 2020. Measuring and relieving the over-smoothing problem for graph neural networks from the topological view. In AAAI, Vol. 34. 3438--3445.Google ScholarGoogle ScholarCross RefCross Ref
  4. Lihua Chen, Ghanshyam Pilania, Rohit Batra, Tran Doan Huan, Chiho Kim, Christopher Kuenneth, and Rampi Ramprasad. 2021. Polymer informatics: Current status and critical next steps. Materials Science and Engineering: R: Reports , Vol. 144 (2021), 100595.Google ScholarGoogle ScholarCross RefCross Ref
  5. Shaohua Fan, Xiao Wang, Chuan Shi, Peng Cui, and Bai Wang. 2021. Generalizing Graph Neural Networks on Out-Of-Distribution Graphs. In arXiv:2111.10657 .Google ScholarGoogle Scholar
  6. Hongyang Gao and Shuiwang Ji. 2021. Graph U-Nets. IEEE TPAMI (2021).Google ScholarGoogle Scholar
  7. Zhichun Guo, Chuxu Zhang, Wenhao Yu, John Herr, Olaf Wiest, Meng Jiang, and Nitesh V Chawla. 2021. Few-Shot Graph Learning for Molecular Property Prediction. In WWW. 2559--2567.Google ScholarGoogle Scholar
  8. William L Hamilton, Rex Ying, and Jure Leskovec. 2017. Inductive representation learning on large graphs. In NeurIPS. 1025--1035.Google ScholarGoogle Scholar
  9. Weihua Hu, Matthias Fey, Marinka Zitnik, Yuxiao Dong, Hongyu Ren, Bowen Liu, Michele Catasta, and Jure Leskovec. 2020. Open Graph Benchmark: Datasets for Machine Learning on Graphs. In NeurIPS .Google ScholarGoogle Scholar
  10. Meng Jiang, Taeho Jung, Ryan Karl, and Tong Zhao. 2022. Federated Dynamic Graph Neural Networks with Secure Aggregation for Video-based Distributed Surveillance. TIST , Vol. 13, 4 (2022), 1--23.Google ScholarGoogle Scholar
  11. Chiho Kim, Anand Chandrasekaran, Tran Doan Huan, Deya Das, and Rampi Ramprasad. 2018. Polymer genome: a data-powered polymer informatics platform for property predictions. The Journal of Physical Chemistry C , Vol. 122, 31 (2018), 17575--17585.Google ScholarGoogle ScholarCross RefCross Ref
  12. Thomas N Kipf and Max Welling. 2017. Semi-supervised classification with graph convolutional networks. In ICLR .Google ScholarGoogle Scholar
  13. Greg Landrum. 2013. RDKit: A software suite for cheminformatics, computational chemistry, and predictive modeling.Google ScholarGoogle Scholar
  14. Junhyun Lee, Inyeop Lee, and Jaewoo Kang. 2019. Self-attention graph pooling. In ICML. 3734--3743.Google ScholarGoogle Scholar
  15. Haoyang Li, Xin Wang, Ziwei Zhang, and Wenwu Zhu. 2021. OOD-GNN: Out-of-Distribution Generalized Graph Neural Network. In arXiv:2112.03806 .Google ScholarGoogle Scholar
  16. Ruimin Ma, Zeyu Liu, Quanwei Zhang, Zhiyu Liu, and Tengfei Luo. 2019. Evaluating polymer representations via quantifying structure--property relationships. Journal of chemical information and modeling , Vol. 59, 7 (2019), 3110--3119.Google ScholarGoogle ScholarCross RefCross Ref
  17. Ruimin Ma and Tengfei Luo. 2020. PI1M: a benchmark database for polymer informatics. Journal of Chemical Information and Modeling , Vol. 60, 10 (2020), 4684.Google ScholarGoogle ScholarCross RefCross Ref
  18. Yao Ma, Xiaorui Liu, Tong Zhao, Yozen Liu, Jiliang Tang, and Neil Shah. 2021. A unified view on graph neural networks as graph signal denoising. In CIKM . 1202--1211.Google ScholarGoogle Scholar
  19. Diego Mesquita, Amauri Souza, and Samuel Kaski. 2020. Rethinking pooling in graph neural networks. In NeurIPS .Google ScholarGoogle Scholar
  20. Shingo Otsuka, Isao Kuwajima, Junko Hosoya, Yibin Xu, and Masayoshi Yamazaki. 2011. PoLyInfo: Polymer database for polymeric materials design. In International Conference on Emerging Intelligent Data and Web Technologies. 22.Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Hyeonjin Park, Seunghun Lee, Sihyeon Kim, Jinyoung Park, Jisu Jeong, Kyung-Min Kim, Jung-Woo Ha, and Hyunwoo J Kim. 2021. Metropolis-Hastings Data Augmentation for Graph Neural Networks. In NeurIPS .Google ScholarGoogle Scholar
  22. Yu Rong, Wenbing Huang, Tingyang Xu, and Junzhou Huang. 2019. DropEdge: Towards Deep Graph Convolutional Networks on Node Classification. In ICLR .Google ScholarGoogle Scholar
  23. Elan Rosenfeld, Pradeep Kumar Ravikumar, and Andrej Risteski. 2021. The Risks of Invariant Risk Minimization. In ICLR .Google ScholarGoogle Scholar
  24. David F Sanders, Zachary P Smith, Ruilan Guo, Lloyd M Robeson, James E McGrath, Donald R Paul, and Benny D Freeman. 2013. Energy-efficient polymeric gas separation membranes for a sustainable future: A review. Polymer , Vol. 54, 18 (2013), 4729--4761.Google ScholarGoogle ScholarCross RefCross Ref
  25. A Thornton, L Robeson, B Freeman, and D Uhlmann. 2012. Polymer Gas Separation Membrane Database.Google ScholarGoogle Scholar
  26. Petar Velivc ković , Guillem Cucurull, Arantxa Casanova, Adriana Romero, Pietro Liò, and Yoshua Bengio. 2018. Graph Attention Networks. In ICLR .Google ScholarGoogle Scholar
  27. Daheng Wang, Meng Jiang, Munira Syed, Oliver Conway, Vishal Juneja, Sriram Subramanian, and Nitesh V Chawla. 2020 a. Calendar graph neural networks for modeling time structures in spatiotemporal user behaviors. In KDD. 2581--2589.Google ScholarGoogle Scholar
  28. Daheng Wang, Zhihan Zhang, Yihong Ma, Tong Zhao, Tianwen Jiang, Nitesh Chawla, and Meng Jiang. 2021 a. Modeling co-evolution of attributed and structural information in graph sequence. IEEE TKDE (2021).Google ScholarGoogle Scholar
  29. Daheng Wang, Zhihan Zhang, Yihong Ma, Tong Zhao, Tianwen Jiang, Nitesh Chawla, and Meng Jiang. 2021 b. Modeling co-evolution of attributed and structural information in graph sequence. IEEE TKDE (2021).Google ScholarGoogle Scholar
  30. Daheng Wang, Tong Zhao, Nitesh V Chawla, and Meng Jiang. 2021 c. Dynamic Attributed Graph Prediction with Conditional Normalizing Flows. In ICDM. IEEE, 1385--1390.Google ScholarGoogle Scholar
  31. Yiwei Wang, Wei Wang, Yuxuan Liang, Yujun Cai, and Bryan Hooi. 2020 b. Graphcrop: Subgraph cropping for graph classification. In arXiv:2009.10564 .Google ScholarGoogle Scholar
  32. Yiwei Wang, Wei Wang, Yuxuan Liang, Yujun Cai, Juncheng Liu, and Bryan Hooi. 2020 c. Nodeaug: Semi-supervised node classification with data augmentation. In KDD . 207--217.Google ScholarGoogle Scholar
  33. Xingfei Wei, Zhi Wang, Zhiting Tian, and Tengfei Luo. 2021. Thermal Transport in Polymers: A Review. Journal of Heat Transfer , Vol. 143, 7 (2021), 072101.Google ScholarGoogle ScholarCross RefCross Ref
  34. Yingxin Wu, Xiang Wang, An Zhang, Xiangnan He, and Tat-Seng Chua. 2022. Discovering Invariant Rationales for Graph Neural Networks. In ICLR .Google ScholarGoogle Scholar
  35. Zonghan Wu, Shirui Pan, Fengwen Chen, Guodong Long, Chengqi Zhang, and S Yu Philip. 2020. A comprehensive survey on graph neural networks. IEEE TNNLS , Vol. 32, 1 (2020), 4--24.Google ScholarGoogle Scholar
  36. Zhenqin Wu, Bharath Ramsundar, Evan N Feinberg, Joseph Gomes, Caleb Geniesse, Aneesh S Pappu, Karl Leswing, and Vijay Pande. 2018. MoleculeNet: a benchmark for molecular machine learning. Chemical science (2018), 513--530.Google ScholarGoogle Scholar
  37. Keyulu Xu, Weihua Hu, Jure Leskovec, and Stefanie Jegelka. 2019. How Powerful are Graph Neural Networks?. In ICLR .Google ScholarGoogle Scholar
  38. Jason Yang, Lei Tao, Jinlong He, Jeffrey McCutcheon, and Ying Li. 2021. Discovery of Innovative Polymers for Next-Generation Gas-Separation Membranes using Interpretable Machine Learning. In chemrxiv-2021-p4g7z .Google ScholarGoogle Scholar
  39. Rex Ying, Dylan Bourgeois, Jiaxuan You, Marinka Zitnik, and Jure Leskovec. 2019. Gnnexplainer: Generating explanations for graph neural networks. In NeurIPS .Google ScholarGoogle Scholar
  40. Rex Ying, Jiaxuan You, Christopher Morris, Xiang Ren, William L Hamilton, and Jure Leskovec. 2018. Hierarchical graph representation learning with differentiable pooling. In NeurIPS . 4805--4815.Google ScholarGoogle Scholar
  41. Yuning You, Tianlong Chen, Yongduo Sui, Ting Chen, Zhangyang Wang, and Yang Shen. 2020. Graph contrastive learning with augmentations. In NeurIPS. 5812--5823.Google ScholarGoogle Scholar
  42. Ziwei Zhang, Peng Cui, and Wenwu Zhu. 2020. Deep learning on graphs: A survey. IEEE TKDE (2020).Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. Tong Zhao, Tianwen Jiang, Neil Shah, and Meng Jiang. 2021a. A synergistic approach for graph anomaly detection with pattern mining and feature learning. IEEE TNNLS (2021).Google ScholarGoogle ScholarCross RefCross Ref
  44. Tong Zhao, Gang Liu, Stephan Günnemann, and Meng Jiang. 2022a. Graph Data Augmentation for Graph Machine Learning: A Survey. arXiv preprint arXiv:2202.08871 (2022).Google ScholarGoogle Scholar
  45. Tong Zhao, Gang Liu, Daheng Wang, Wenhao Yu, and Meng Jiang. 2022b. Learning from Counterfactual Links for Link Prediction. ICML (2022).Google ScholarGoogle Scholar
  46. Tong Zhao, Yozen Liu, Leonardo Neves, Oliver Woodford, Meng Jiang, and Neil Shah. 2021b. Data Augmentation for Graph Neural Networks. In AAAI. 11015.Google ScholarGoogle Scholar
  47. Tong Zhao, Bo Ni, Wenhao Yu, Zhichun Guo, Neil Shah, and Meng Jiang. 2021c. Action Sequence Augmentation for Early Graph-based Anomaly Detection. In CIKM . 2668--2678.Google ScholarGoogle Scholar
  48. Jiajun Zhou, Jie Shen, and Qi Xuan. 2020. Data Augmentation for Graph Classification. In CIKM. 2341--2344.Google ScholarGoogle Scholar
  49. Yanqiao Zhu, Yichen Xu, Feng Yu, Qiang Liu, Shu Wu, and Liang Wang. 2021. Graph contrastive learning with adaptive augmentation. In WWW . 2069--2080.Google ScholarGoogle Scholar

Index Terms

  1. Graph Rationalization with Environment-based Augmentations

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        KDD '22: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
        August 2022
        5033 pages
        ISBN:9781450393850
        DOI:10.1145/3534678

        Copyright © 2022 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 14 August 2022

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        Overall Acceptance Rate1,133of8,635submissions,13%

        Upcoming Conference

        KDD '24

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader