Abstract
Community Q&A forum is a special type of social media that provides a platform to raise questions and to answer them (both by forum participants), to facilitate online information sharing. Currently, community Q&A forums in professional domains have attracted a large number of users by offering professional knowledge. To support information access and save users’ efforts of raising new questions, they usually come with a question retrieval function, which retrieves similar existing questions (and their answers) to a user’s query. However, it can be difficult for community Q&A forums to cover all domains, especially those emerging lately with little labeled data but great discrepancy from existing domains. We refer to this scenario as cross-domain question retrieval. To handle the unique challenges of cross-domain question retrieval, we design a model based on adversarial training, namely,
- [1] . 2009. Supervised domain adaption for WSD. In Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics (EACL’09), , , and (Eds.). ACL, 42–50. Google Scholar
Digital Library
- [2] . 2006. Domain adaptation with structural correspondence learning. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP’06). ACL, 120–128. Google Scholar
Digital Library
- [3] . 1993. Signature verification using A “Siamese” time delay neural network. Int. J. Pattern Recogn. Artif. Intell. 7, 4 (1993), 669–688.Google Scholar
Cross Ref
- [4] . 2011. Learning the latent topics for question retrieval in community QA. In Proceedings of the 5th International Joint Conference on Natural Language Processing (IJCNLP’11). ACL, 273–281.Google Scholar
- [5] . 2016. A semantic graph-based topic model for question retrieval in community question answering. In Proceedings of the 9th ACM International Conference on Web Search and Data Mining, , , , and (Eds.). ACM, 287–296. Google Scholar
Digital Library
- [6] . 2018. Enhancing sentence embedding with generalized pooling. In Proceedings of the 27th International Conference on Computational Linguistics (COLING’18). ACL, 1815–1826.Google Scholar
- [7] . 2017. Enhanced LSTM for natural language inference. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (ACL’17). ACL, 1657–1668.Google Scholar
Cross Ref
- [8] . 2018. Question retrieval for community-based question answering via heterogeneous social influential network. Neurocomputing 285 (2018), 117–124.Google Scholar
Cross Ref
- [9] . 2016. Together we stand: Siamese networks for similar question retrieval. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (ACL’16). ACL.Google Scholar
Cross Ref
- [10] . 2019. BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT’19). ACL, 4171–4186.Google Scholar
- [11] . 2020. Adversarial and domain-aware BERT for cross-domain sentiment analysis. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL’20). ACL, 4019–4028.Google Scholar
Cross Ref
- [12] . 2008. Searching questions by identifying question topic and question focus. In Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL’08: HLT). ACL, 156–164. Retrieved from https://www.aclweb.org/anthology/P08-1019.Google Scholar
- [13] . 2008. Modeling transfer relationships between learning tasks for improved inductive transfer. In Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases (ECML/PKDD’08)(
Lecture Notes in Computer Science , Vol. 5211), , , and (Eds.). Springer, 317–332. Google ScholarDigital Library
- [14] . 2016. Domain-Adversarial training of neural networks. J. Mach. Learn. Res. 17 (2016), 59:1–59:35. Google Scholar
Digital Library
- [15] . 2018. Natural language inference over interaction space. In Proceedings of the 6th International Conference on Learning Representations (ICLR’18). OpenReview.net.Google Scholar
- [16] . 2015. Explaining and harnessing adversarial examples. In Proceedings of the 3rd International Conference on Learning Representations (ICLR’15).Google Scholar
- [17] . 2005. Finding similar questions in large question and answer archives. In Proceedings of the ACM CIKM International Conference on Information and Knowledge Management, , , , , and (Eds.). ACM, 84–90. Google Scholar
Digital Library
- [18] . 2012. Question-answer topic model for question retrieval in community question answering. In Proceedings of the 21st ACM International Conference on Information and Knowledge Management (CIKM’12), , , , and (Eds.). ACM, 2471–2474. Google Scholar
Digital Library
- [19] . 2007. Instance weighting for domain adaptation in NLP. In Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL’07). ACL.Google Scholar
- [20] . 2018. Cross-Domain labeled LDA for cross-domain text classification. In Proceedings of the IEEE International Conference on Data Mining (ICDM’18). IEEE Computer Society, 187–196.Google Scholar
Cross Ref
- [21] . 2013. Frequently asked questions retrieval for Croatian based on semantic textual similarity. In Proceedings of the 4th Biennial International Workshop on Balto-Slavic Natural Language Processing ([email protected]’13), , , , and (Eds.). ACL, 24–33.Google Scholar
- [22] . 2020. TextAT: Adversarial training for natural language understanding with token-level perturbation. Retrieved from https://arxiv.org/abs/2004.14543.Google Scholar
- [23] . 2020. Weakly-supervised domain adaption for aspect extraction via multi-level interaction transfer. Retrieved from https://arxiv.org/abs/2006.09235.Google Scholar
- [24] . 2005. Logistic regression with an auxiliary data source. In Proceedings of the 22nd International Conference (ICML’05)(
ACM International Conference Proceeding Series , Vol. 119), and (Eds.). ACM, 505–512. Google ScholarDigital Library
- [25] . 2018. Domain adaptation for disease phrase matching with adversarial networks. In Proceedings of the Biomedical Natural Language Processing Workshop (BioNLP’18). ACL, 137–141.Google Scholar
Cross Ref
- [26] . 2006. Effective self-training for parsing. In Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics. ACL. Google Scholar
Digital Library
- [27] . 2013. Distributed representations of words and phrases and their compositionality. In Proceedings of the 27th Annual Conference on Neural Information Processing Systems. 3111–3119. Google Scholar
Digital Library
- [28] . 2015. Distributional smoothing with virtual adversarial training. Retrieved from https://arxiv.org/abs/1507.00677.Google Scholar
- [29] . 2016. Siamese recurrent architectures for learning sentence similarity. In Proceedings of the 30th AAAI Conference on Artificial Intelligence. AAAI Press, 2786–2792. Google Scholar
Digital Library
- [30] . 2019. Manhattan siamese LSTM for question retrieval in community question answering. In Proceedings of the On the Move to Meaningful Internet Systems (OTM’19) Confederated International Conferences: CoopIS, ODBASE, and C&TC(
Lecture Notes in Computer Science , Vol. 11877), , , , , , and (Eds.). Springer, 661–677.Google ScholarDigital Library
- [31] . 2020. Multi-Group transfer learning on multiple latent spaces for text classification. IEEE Access 8 (2020), 64120–64130.Google Scholar
Cross Ref
- [32] . 2021. Few-shot text classification by leveraging bi-directional attention and cross-class knowledge. Sci. China Inf. Sci. 64, 3 (2021).Google Scholar
Cross Ref
- [33] . 2018. Cross-Domain sentiment classification with target domain specific information. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL’18). ACL, 2505–2513.Google Scholar
Cross Ref
- [34] . 2014. Glove: Global vectors for word representation. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP’14). ACL, 1532–1543.Google Scholar
Cross Ref
- [35] . 2018. Deep contextualized word representations. In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT’18), , , and (Eds.). ACL, 2227–2237.Google Scholar
Cross Ref
- [36] . 2018. Predicting the semantic textual similarity with siamese CNN and LSTM. In Proceedings of the Actes de la Conférence TALN (CORIA-TALN-RJC’18). ATALA, 311–320.Google Scholar
- [37] . 2019. FAQ retrieval using query-question similarity and BERT-based query-answer relevance. In Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’19), , , , , , and (Eds.). ACM, 1113–1116. Google Scholar
Digital Library
- [38] . 2003. Example selection for bootstrapping statistical parsers. In Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL’03). ACL. Google Scholar
Digital Library
- [39] . 2014. Intriguing properties of neural networks. In Proceedings of the 2nd International Conference on Learning Representations (ICLR’14).Google Scholar
- [40] . 2020. Effective FAQ retrieval and question matching with unsupervised knowledge injection. Retrieved from https://arxiv.org/abs/2010.14049.Google Scholar
- [41] . 2019. GLUE: A multi-task benchmark and analysis platform for natural language understanding. In Proceedings of the 7th International Conference on Learning Representations (ICLR’19). OpenReview.net.Google Scholar
- [42] . 2017. A compare-aggregate model for matching text sequences. In Proceedings of the 5th International Conference on Learning Representations (ICLR’17). OpenReview.net.Google Scholar
- [43] . 2017. Bilateral multi-perspective matching for natural language sentences. In Proceedings of the 26th International Joint Conference on Artificial Intelligence (IJCAI’17). ijcai.org, 4144–4150. Google Scholar
Digital Library
- [44] . 2016. Multi-Perspective context matching for machine comprehension. Retrieved from https://arxiv.org/abs/1612.04211.Google Scholar
- [45] . 2018. A broad-coverage challenge corpus for sentence understanding through inference. In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT’18), , , and (Eds.). ACL, 1112–1122.Google Scholar
Cross Ref
- [46] . 2008. Retrieval models for question and answer archives. In Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’08), , , , , and (Eds.). ACM, 475–482. Google Scholar
Digital Library
- [47] . 2016. ABCNN: Attention-based convolutional neural network for modeling sentence pairs. Trans. Assoc. Comput. Linguist. 4 (2016), 259–272.Google Scholar
Cross Ref
- [48] . 2014. Question retrieval with high quality answers in community question answering. In Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management (CIKM’14), , , , , , and (Eds.). ACM, 371–380. Google Scholar
Digital Library
- [49] . 2018. An unsupervised model with attention autoencoders for question retrieval. In Proceedings of the 32nd AAAI Conference on Artificial Intelligence (AAAI’18), the 30th Innovative Applications of Artificial Intelligence (IAAI’18), and the 8th AAAI Symposium on Educational Advances in Artificial Intelligence (EAAI’18), and (Eds.). AAAI Press, 4978–4986. Google Scholar
Digital Library
- [50] . 2011. Phrase-Based translation model for question retrieval in community question answer archives. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, , , and (Eds.). ACL, 653–662. Google Scholar
Digital Library
- [51] . 2015. Learning continuous word embedding with metadata for question retrieval in community question answering. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing (ACL’15). ACL, 250–259.Google Scholar
Cross Ref
- [52] . 2017. Modeling and learning distributed word representation with metadata for question retrieval. IEEE Trans. Knowl. Data Eng. 29, 6 (2017), 1226–1239. Google Scholar
Digital Library
- [53] . 2013. Improving question retrieval in community question answering using world knowledge. In Proceedings of the 23rd International Joint Conference on Artificial Intelligence (IJCAI’13), (Ed.). IJCAI/AAAI, 2239–2245. Google Scholar
Digital Library
- [54] . 2020. FreeLB: Enhanced adversarial training for natural language understanding. In Proceedings of the 8th International Conference on Learning Representations (ICLR’20). OpenReview.net.Google Scholar
Index Terms
Adversarial Cross-domain Community Question Retrieval
Recommendations
Cross-View Adaptation Network for Cross-Domain Relation Extraction
Chinese Computational LinguisticsAbstractIn relation extraction, directly adopting a model trained in the source domain to the target domain will suffer greatly performance decrease. Existing studies extract the shared features between domains in a coarse-grained way, which inevitably ...
Pairwise Adversarial Training for Unsupervised Class-imbalanced Domain Adaptation
KDD '22: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data MiningUnsupervised domain adaptation (UDA) has become an appealing approach for knowledge transfer from a labeled source domain to an unlabeled target domain. However, when the classes in source and target domains are imbalanced, most existing UDA methods ...
Unsupervised domain adaptation with adversarial distribution adaptation network
AbstractAdversarial domain adaptation is a powerful approach to transfer the knowledge of the label-rich source domain to the label-scarce target domain by mitigating domain shifts across distributions. Existing domain adaptation methods align either the ...






Comments