Abstract
We focus on using natural language unstructured textual Knowledge Bases (KBs) to answer questions from community-based Question-and-Answer (Q8A) websites. We propose a novel framework that integrates multi-level tag recommendation with external KBs to retrieve the most relevant KB articles to answer user posted questions. Different from many existing efforts that primarily rely on the Q8A sites’ own historical data (e.g., user answers), retrieving answers from authoritative external KBs (e.g., online programming documentation repositories) has the potential to provide rich information to help users better understand the problem, acquire the knowledge, and hence avoid asking similar questions in future. The proposed multi-level tag recommendation best leverages the rich tag information by first categorizing them into different semantic levels based on their usage frequencies. A post-tag co-clustering model, augmented by a two-step tag recommender, is used to predict tags at different levels for a given user posted question. A KB article retrieval component leverages the recommended multi-level tags to select the appropriate KBs and search/rank the matching articles thereof. We conduct extensive experiments using real-world data from a Q8A site and multiple external KBs to demonstrate the effectiveness of the proposed question-answering framework.
- Junwei Bao, Nan Duan, Ming Zhou, and Tiejun Zhao. 2014. Knowledge-based question answering as machine translation. Cell 2, 6 (2014).Google Scholar
- David M. Blei, Andrew Y. Ng, and Michael I. Jordan. 2003. Latent dirichlet allocation. J. Mach.-Learn. Res. 3 (2003), 993--1022. Google Scholar
Digital Library
- Yanhua Chen, Manjeet Rege, Ming Dong, and Jing Hua. 2008. Non-negative matrix factorization for semi-supervised data clustering. Knowl. Info. Syst. 17, 3 (2008), 355--379.Google Scholar
Digital Library
- Philipp Cimiano, Michael Erdmann, and Günter Ladwig. 2007. Corpus-based pattern induction for a knowledge-based question answering approach. In Proceedings of the IEEE International Conference on Semantic Computing (ICSC’07). IEEE, 671--678. Google Scholar
Digital Library
- Peter Clark, John Thompson, and Bruce Porter. 1999. A knowledge-based approach to question-answering. In Proceedings of the AAAI Conference on Artificial Intelligence (AAAI’99), Vol. 99. Citeseer, 43--51.Google Scholar
- Oracle Corporation. {n.d.}. The Java Language Specification, Java SE 8th Edition. Retrieved from http://docs.oracle.com/javase/specs/jls/se8/html/index.html.Google Scholar
- Oracle Corporation. {n.d.}. Java Platform, Standard Edition 8 API Specification. Retrieved from https://docs.oracle.com/javase/8/docs/api/index.html.Google Scholar
- Daniel Hasan Dalip, Marcos André Gonçalves, Marco Cristo, and Pavel Calado. 2013. Exploiting user feedback to learn to rank answers in Q8A forums: A case study with stack overflow. In Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 543--552. Google Scholar
Digital Library
- Inderjit S. Dhillon. 2001. Co-clustering documents and words using bipartite spectral graph partitioning. In Proceedings of the Conference on Knowledge Discovery and Data Mining (KDD’01). ACM, 269--274. Google Scholar
Digital Library
- Stack Exchange. {n.d.}. Stack Exchange Data Dump. Retrieved from https://archive.org/details/stackexchange.Google Scholar
- Anthony Fader, Luke Zettlemoyer, and Oren Etzioni. 2014. Open question answering over curated and extracted knowledge bases. In Proceedings of the Knowledge Discovery and Data Mining (KDD’14). ACM, 1156--1165. Google Scholar
Digital Library
- Django Software Foundation. {n.d.}. Django Documentation. Retrieved from https://docs.djangoproject.com/en/1.9/.Google Scholar
- Python Software Foundation. {n.d.}. The Python Language Reference. Retrieved from https://docs.python.org/3/reference/index.html.Google Scholar
- Python Software Foundation. {n.d.}. The Python Standard Library. Retrieved from https://docs.python.org/3/library/index.html.Google Scholar
- The Mozilla Foundation. {n.d.}. JavaScript Reference. Retrieved from https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference.Google Scholar
- The PHP Group. {n.d.}. PHP Documentation, PHP 7. Retrieved from http://php.net/docs.php.Google Scholar
- Ulf Hermjakob, Eduard H. Hovy, and Chin-Yew Lin. 2002. Knowledge-based question answering. In Proceedings of the SCI Conference.Google Scholar
- Google Incorporated. {n.d.}. Android Reference. Retrieved from http://developer.android.com/reference/packages.html.Google Scholar
- Jin Liu, Pingyi Zhou, Zijiang Yang, Xiao Liu, and John Grundy. 2018. FastTagRec: Fast tag recommendation for software information sites. Auto. Softw. Eng. (2018), 1--27. Google Scholar
Digital Library
- Stefania Mariano and Andrea Casey. 2007. The process of knowledge retrieval: A case study of an American high-technology research, engineering and consulting company. VINE 37, 3 (2007), 314--330.Google Scholar
Cross Ref
- Avigit K. Saha, Ripon K. Saha, and Kevin A. Schneider. 2013. A discriminative model approach for suggesting tags automatically for stack overflow questions. In Proceedings of the Mining Software Repositories Conference (MSR’13). IEEE Press, 73--76. Google Scholar
Digital Library
- A. K. Singh, N. K. Nagwani, and S. Pandey. 2017. TAGme: A topical folksonomy based collaborative filtering for tag recommendation in community sites. In Proceedings of the 4th Multidisciplinary International Social Networks Conference. ACM, 27. Google Scholar
Digital Library
- Parikshit Sondhi and ChengXiang Zhai. 2014. Mining semi-structured online knowledge bases to answer natural language questions on community QA websites. In Proceedings of the ACM International Conference on Information and Knowledge Management (CIKM’14). ACM, 341--350. Google Scholar
Digital Library
- Huan Sun, Hao Ma, Xiaodong He, Wen-tau Yih, Yu Su, and Xifeng Yan. 2016. Table cell search for question answering. In Proceedings of the World Wide Web Conference (WWW’16). 771--782. Google Scholar
Digital Library
- Robert West, Evgeniy Gabrilovich, Kevin Murphy, Shaohua Sun, Rahul Gupta, and Dekang Lin. 2014. Knowledge base completion via search-based question answering. In Proceedings of the World Wide Web Conference (WWW’14). ACM, 515--526. Google Scholar
Digital Library
- Yiyu Yao, Yi Zeng, Ning Zhong, and Xiangji Huang. 2007. Knowledge retrieval (KR). In Proceedings of the Web Intelligence Consortium (WIC’07). IEEE, 729--735. Google Scholar
Digital Library
- Zhiping Zheng. 2003. Question answering using web news as knowledge base. In Proceedings of the Conference of the European Chapter of the Association for Computational Linguistics (EACL’03). ACL, 251--254. Google Scholar
Digital Library
- Zhou Zhibin, Shi Shuicai, Li Yuqin, and Lv Xueqiang. 2010. An answer extraction method of simple question based on web knowledge library. In Proceedings of the Workshop on Education Technology and Computer Science (ETCS’10), Vol. 1. IEEE, 308--311.Google Scholar
Cross Ref
- P. Zhou, J. Liu, Z. Yang, and G. Zhou. 2017. Scalable tag recommendation for software information sites. In Proceedings of the IEEE 24th International Conference on Software Analysis, Evolution and Reengineering (SANER’17). 272--282.Google Scholar
Index Terms
Integrating Multi-level Tag Recommendation with External Knowledge Bases for Automatic Question Answering
Recommendations
Integrating Social Relations into Personalized Tag Recommendation
IHMSC '10: Proceedings of the 2010 Second International Conference on Intelligent Human-Machine Systems and Cybernetics - Volume 01Personalized tag recommendation is to provide a user with a ranked list of tags for a specific resource that best serves the user's needs. In this paper, we proposed a personalized tag recommendation algorithm incorporating with users' social relations. ...
Tag recommendation for social bookmarking: Probabilistic approaches
Principles and Practice of Multi-Agent SystemsTagging has become increasingly popular with the explosion of user-created content on the web. A 'tag' can be defined as a group of keywords that makes organizing, browsing and searching for content more efficient. Users apply tags to a variety of web-...
Personalized tag recommendation based on user preference and content
ADMA'10: Proceedings of the 6th international conference on Advanced data mining and applications - Volume Part IIWith the widely use of collaborative tagging system nowadays, users could tag their favorite resources with free keywords. Tag recommendation technology is developed to help users in the process of tagging. However, most of the tag recommendation ...






Comments