Abstract
In the area of geographic information processing, there are few researches on geographic text classification. However, the application of this task in Chinese is relatively rare. In our work, we intend to implement a method to extract text containing geographical entities from a large number of network texts. The geographic information in these texts is of great practical significance to transportation, urban and rural planning, disaster relief, and other fields. We use the method of graph convolutional neural network with attention mechanism to achieve this function. Graph attention networks (GAT) is an improvement of graph convolutional neural networks (GCN). Compared with GCN, the advantage of GAT is that the attention mechanism is proposed to weight the sum of the characteristics of adjacent vertices. In addition, We construct a Chinese dataset containing geographical classification from multiple datasets of Chinese text classification. The Macro-F Score of the geoGAT we used reached 95% on the new Chinese dataset.
- Peter W. Battaglia, Jessica B. Hamrick, Victor Bapst, Alvaro Sanchez-Gonzalez, Vinicius Zambaldi, Mateusz Malinowski, Andrea Tacchetti, David Raposo, Adam Santoro, Ryan Faulkner, Caglar Gulcehre, Francis Song, Andrew Ballard, Justin Gilmer, George Dahl, Ashish Vaswani, Kelsey Allen, Charles Nash, Victoria Langston, Chris Dyer, Nicolas Heess, Daan Wierstra, Pushmeet Kohli, Matt Botvinick, Oriol Vinyals, Yujia Li, and Razvan Pascanu. 2018. Relational inductive biases, deep learning, and graph networks. Retrieved from https://arXiv:cs.LG/1806.01261.Google Scholar
- Davide Buscaldi and Paulo Rosso. 2008. A conceptual density-based approach for the disambiguation of toponyms. Int. J. Geogr. Info. Sci. 22, 3 (2008), 301–313. DOI:https://doi.org/10.1080/13658810701626251 Google Scholar
Digital Library
- H. Cai, V. W. Zheng, and K. C. Chang. 2018. A comprehensive survey of graph embedding: problems, techniques, and applications. IEEE Trans. Knowl. Data Eng. 30, 9 (2018), 1616–1637.Google Scholar
Digital Library
- Jie Chen, Tengfei Ma, and Cao Xiao. 2018. FastGCN: Fast Learning with Graph Convolutional Networks via Importance Sampling. Retrieved from https://arXiv:cs.LG/1801.10247.Google Scholar
- Ross Girshick, Jeff Donahue, Trevor Darrell, and Jitendra Malik. 2014. Rich feature hierarchies for accurate object detection and semantic segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’14). Google Scholar
Digital Library
- M. Gori, G. Monfardini, and F. Scarselli. 2005. A new model for learning in graph domains. In Proceedings of the IEEE International Joint Conference on Neural Networks, Vol. 2. 729–734 vol. 2.Google Scholar
- Sepp Hochreiter and Jurgen Schmidhuber. 1997. Long short-term memory. Neural Comput. 9, 8 (1997), 1735–1780. DOI:https://doi.org/10.1162/neco.1997.9.8.1735 Google Scholar
Digital Library
- Yingjie Hu. 2018. Geo-text data and data-driven geospatial semantics. Geography Compass 12, 11 (2018), e12404. DOI:https://doi.org/10.1111/gec3.12404Google Scholar
Cross Ref
- Gao Huang, Zhuang Liu, Laurens van der Maaten, and Kilian Q. Weinberger. 2017. Densely connected convolutional networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’17).Google Scholar
- Rie Johnson and Tong Zhang. 2017. Deep pyramid convolutional neural networks for text categorization. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, 562–570. DOI:https://doi.org/10.18653/v1/P17-1052Google Scholar
Cross Ref
- Armand Joulin, Edouard Grave, Piotr Bojanowski, and Tomas Mikolov. 2016. Bag of Tricks for Efficient Text Classification. Retrieved from https://arXiv:cs.CL/1607.01759.Google Scholar
- Nal Kalchbrenner, Edward Grefenstette, and Phil Blunsom. 2014. A Convolutional Neural Network for Modelling Sentences. Retrieved from https://arXiv:cs.CL/1404.2188.Google Scholar
- Yoon Kim. 2014. Convolutional Neural Networks for Sentence Classification. Retrieved from https://arXiv:cs.CL/1408.5882.Google Scholar
- Thomas N. Kipf and Max Welling. 2016. Semi-Supervised Classification with Graph Convolutional Networks. Retrieved from https://arXiv:cs.LG/1609.02907.Google Scholar
- Y. Lecun, L. Bottou, Y. Bengio, and P. Haffner. 1998. Gradient-based learning applied to document recognition. Proc. IEEE 86, 11 (1998), 2278–2324.Google Scholar
Cross Ref
- John Boaz Lee, Ryan A. Rossi, Sungchul Kim, Nesreen K. Ahmed, and Eunyee Koh. 2018. Attention Models in Graphs: A Survey. Retrieved from https://arXiv:cs.AI/1807.07984.Google Scholar
- Yujia Li, Daniel Tarlow, Marc Brockschmidt, and Richard Zemel. 2015. Gated Graph Sequence Neural Networks. Retrieved from https://arXiv:cs.LG/1511.05493.Google Scholar
- Michael D. Lieberman and Hanan Samet. 2011. Multifaceted toponym recognition for streaming news. In Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information (SIGIR’11). ACM Press, 843. DOI:https://doi.org/10.1145/2009916.2010029 Google Scholar
Digital Library
- Zachary C. Lipton, John Berkowitz, and Charles Elkan. 2015. A Critical Review of Recurrent Neural Networks for Sequence Learning. Retrieved from https://arXiv:cs.LG/1506.00019.Google Scholar
- Pengfei Liu, Xipeng Qiu, and Xuanjing Huang. 2016. Recurrent Neural Network for Text Classification with Multi-Task Learning. Retrieved from https://arXiv:cs.CL/1605.05101. Google Scholar
Digital Library
- M. E. Maron and J. L. Kuhns. 1960. On relevance, probabilistic indexing and information retrieval. J. ACM 7, 3 (July 1960), 216–244. DOI:https://doi.org/10.1145/321033.321035 Google Scholar
Digital Library
- A. Micheli. 2009. Neural network for graphs: A contextual constructive approach. IEEE Trans. Neural Netw. 20, 3 (2009), 498–511. Google Scholar
Digital Library
- Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Efficient Estimation of Word Representations in Vector Space. Retrieved from https://arXiv:cs.CL/1301.3781.Google Scholar
- Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S. Corrado, and Jeff Dean. 2013. Distributed representations of words and phrases and their compositionality. In Advances in Neural Information Processing Systems 26, C. J. C. Burges, L. Bottou, M. Welling, Z. Ghahramani, and K. Q. Weinberger (Eds.). Curran Associates, 3111–3119. Retrieved from http://papers.nips.cc/paper/5021-distributed-representations-of-words-and-phrases-and-their-compositionality.pdf. Google Scholar
Digital Library
- Simon Overell and Stefan Ruger. 2008. Using co-occurrence models for placename disambiguation. Int. J. Geogr. Info. Sci. 22, 3 (2008), 265–287. DOI:https://doi.org/10.1080/13658810701626236 Google Scholar
Digital Library
- Jeffrey Pennington, Richard Socher, and Christopher Manning. 2014. Glove: Global vectors for word representation. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP’14). Association for Computational Linguistics, 1532–1543. DOI:https://doi.org/10.3115/v1/D14-1162Google Scholar
Cross Ref
- Afshin Rahimi, Trevor Cohn, and Timothy Baldwin. 2018. Semi-supervised User Geolocation via Graph Convolutional Networks. Retrieved from https://arXiv:cs.CL/1804.08049.Google Scholar
- Francois Rousseau, Emmanouil Kiagias, and Michalis Vazirgiannis. 2015. Text categorization as a graph classification problem. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing. Association for Computational Linguistics, 1702–1712. DOI:https://doi.org/10.3115/v1/P15-1164Google Scholar
- M. Sun, J. Li, Z. Guo, Z. Yu, Y. Zheng, X. Si, and Z. Liu. 2016. Thuctc: An efficient chinese text classifier. Retrieved from GitHub Repository.Google Scholar
- V. David Sanchez A. 2003. Advanced support vector machines and kernel methods. Neurocomputing 55, 1 (2003), 5–20. DOI:https://doi.org/10.1016/S0925-2312(03)00373-4Support Vector Machines.Google Scholar
Cross Ref
- Petar Velickovic, Guillem Cucurull, Arantxa Casanova, Adriana Romero, Pietro Lio, and Yoshua Bengio. 2017. Graph Attention Networks. Retrieved from https://arXiv:stat.ML/1710.10903.Google Scholar
- Canhui Wang, Min Zhang, Shaoping Ma, and Liyun Ru. 2008. Automatic online news issue construction in web environment. In Proceedings of the 17th international conference on World Wide Web. 457–466. Google Scholar
Digital Library
- Jimin Wang, Yingjie Hu, and Kenneth Joseph. 2020. NeuroTPR: A neuro-net toponym recognition model for extracting locations from social media messages. Trans. GIS 24, 3 (2020), 719–735. DOI:https://doi.org/10.1111/tgis.12627 Retrieved from arXiv:https://onlinelibrary.wiley.com/doi/pdf/10.1111/tgis.12627.Google Scholar
Cross Ref
- Liang Yao, Chengsheng Mao, and Yuan Luo. 2018. Graph convolutional networks for text classification. Retrieved from http://arxiv.org/abs/1809.05679.Google Scholar
- Peng Zhou, Wei Shi, Jun Tian, Zhenyu Qi, Bingchen Li, Hongwei Hao, and Bo Xu. 2016. Attention-based bidirectional long short-term memory networks for relation classification. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, 207–212. DOI:https://doi.org/10.18653/v1/P16-2034Google Scholar
Cross Ref
Index Terms
geoGAT: Graph Model Based on Attention Mechanism for Geographic Text Classification
Recommendations
A radical-aware attention-based model for Chinese text classification
AAAI'19/IAAI'19/EAAI'19: Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence and Thirty-First Innovative Applications of Artificial Intelligence Conference and Ninth AAAI Symposium on Educational Advances in Artificial IntelligenceRecent years, Chinese text classification has attracted more and more research attention. However, most existing techniques which specifically aim at English materials may lose effectiveness on this task due to the huge difference between Chinese and ...
Chinese text classification by the Naïve Bayes Classifier and the associative classifier with multiple confidence threshold values
Each type of classifier has its own advantages as well as certain shortcomings. In this paper, we take the advantages of the associative classifier and the Naive Bayes Classifier to make up the shortcomings of each other, thus improving the accuracy of ...
Graph Fusion Network for Text Classification
AbstractText classification is an important and classical problem in natural language processing. Recently, Graph Neural Networks (GNNs) have been widely applied in text classification and achieved outstanding performance. Despite the success ...
Highlights- We transform external knowledge into structural information to build better graphs.






Comments