Abstract
Search engines are still the most common way of finding information on the Web. However, they are largely unable to provide satisfactory answers to time- and location-specific queries. Such queries can best and often only be answered by humans that are currently on-site. Although online platforms for community question answering are very popular, very few exceptions consider the notion of users’ current physical locations. In this article, we present CloseUp, our prototype for the seamless integration of community-driven live search into a Google-like search experience. Our efforts focus on overcoming the defining differences between traditional Web search and community question answering, namely the formulation of search requests (keyword-based queries vs. well-formed questions) and the expected response times (milliseconds vs. minutes/hours). To this end, the system features a deep learning pipeline to analyze submitted queries and translate relevant queries into questions. Searching users can submit suggested questions to a community of mobile users. CloseUp provides a stand-alone mobile application for submitting, browsing, and replying to questions. Replies from mobile users are presented as live results in the search interface. Using a field study, we evaluated the feasibility and practicability of our approach.
- Ashton Anderson, Daniel Huttenlocher, Jon Kleinberg, and Jure Leskovec. 2012. Discovering value from community activity on focused question answering sites: A case study of stack overflow. In Proceedings of KDD’12. ACM, New York, NY. Google Scholar
Digital Library
- Hazleen Aris and Marina Md. Din. 2016. Crowdsourcing evolution: Towards a taxonomy of crowdsourcing initiatives. In Proceedings of the PerCom Workshops. IEEE, Los Alamitos, CA.Google Scholar
Cross Ref
- Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2014. Neural machine translation by jointly learning to align and translate. arXiv:1409.0473.Google Scholar
- Cory Barr, Rosie Jones, and Moira Regelson. 2008. The linguistic structure of English Web-search queries. In Proceedings of EMNLP’08. Google Scholar
Digital Library
- Petter Bae Brandtzæg and Jan Heim. 2008. User loyalty and online communities: Why members of online communities are not faithful. In Proceedings of INTETAIN’08.Google Scholar
Cross Ref
- Chris Callison-Burch, Miles Osborne, and Philipp Koehn. 2006. Re-evaluation the role of BLEU in machine translation research. In Proceedings of EACL’06.Google Scholar
- Claudio Carpineto and Giovanni Romano. 2012. A survey of automatic query expansion in information retrieval. ACM Computing Surveys 44, 1 (2012), Article 1. Google Scholar
Digital Library
- William Chan, Navdeep Jaitly, Quoc Le, and Oriol Vinyals. 2016. Listen, attend and spell: A neural network for large vocabulary conversational speech recognition. In Proceedings of ICASSP’16.Google Scholar
Cross Ref
- Shuo Chang and Aditya Pal. 2013. Routing questions for collaborative answering in community question answering. In Proceedings of ASONAM’13. ACM, New York, NY. Google Scholar
Digital Library
- Danqi Chen and Christopher D. Manning. 2014. A fast and accurate dependency parser using neural networks. In Proceedings of EMNLP’14.Google Scholar
- Xiang Cheng, Shuguang Zhu, Sen Su, and Gang Chen. 2017. A multi-objective optimization approach for question routing in community question answering services. IEEE Transactions on Knowledge and Data Engineering 29, 9 (2017), 1779--1792.Google Scholar
Cross Ref
- Kyunghyun Cho, Bart van Merrienboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. 2014. Learning phrase representations using RNN encoder--decoder for statistical machine translation. In Proceedings of EMNLP’14.Google Scholar
Cross Ref
- Sumit Chopra, Michael Auli, and Alexander M. Rush. 2016. Abstractive sentence summarization with attentive recurrent neural networks. In Proceedings of NAACL-HTL’16.Google Scholar
- Brooke Cowan, Sven Zethelius, Brittany Luk, Teodora Baras, Prachi Ukarde, and Daodao Zhang. 2015. Named entity recognition in travel-related search queries. In Proceedings of AAAI’15. Google Scholar
Digital Library
- Tom De Smedt and Walter Daelemans. 2012. Pattern for Python. Journal of Machine Learning Research 13 (2012), 2063--2067. Google Scholar
Digital Library
- Sebastian Deterding, Miguel Sicart, Lennart Nacke, Kenton O’Hara, and Dan Dixon. 2011. Gamification. Using game-design elements in non-gaming contexts. In Proceedings of CHI’11. ACM, New York, NY. Google Scholar
Digital Library
- Anhai Doan, Raghu Ramakrishnan, and Alon Y. Halevy. 2011. Crowdsourcing systems on the World-Wide Web. Communications of the ACM 54, 4 (2011), 86--96. Google Scholar
Digital Library
- William F. Eddy. 1982. Convex Hull Peeling. Physica-Verlag HD.Google Scholar
- Andreas Eiselt and Alejandro Figueroa. 2013. A two-step named entity recognizer for open-domain search queries. In Proceedings of IJCNLP’13.Google Scholar
- Ahmad Ghazal, Tilmann Rabl, Minqing Hu, Francois Raab, Meikel Poess, Alain Crolotte, and Hans-Arno Jacobsen. 2013. BigBench: Towards an industry standard benchmark for big data analytics. In Proceedings of SIGMOD’13. ACM, New York, NY. Google Scholar
Digital Library
- Uri Gneezy and Aldo Rustichini. 2000. Pay enough or don’t pay at all. Quarterly Journal of Economics 115, 3 (2000), 791--810.Google Scholar
Cross Ref
- Çaglar Gülçehre, Sungjin Ahn, Ramesh Nallapati, Bowen Zhou, and Yoshua Bengio. 2016. Pointing the unknown words. In Proceedings of the 54th Annual Meeting of the ACL.Google Scholar
Cross Ref
- Jiafeng Guo, Gu Xu, Xueqi Cheng, and Hang Li. 2009. Named entity recognition in query. In Proceedings of SIGIR’09. ACM, New York, NY. Google Scholar
Digital Library
- Jiahui Guo, Bin Yue, Guandong Xu, Zhenglu Yang, and Jin-Mao Wei. 2017. An enhanced convolutional neural network model for answer selection. In Proceedings of WWW’17 Companion. Google Scholar
Digital Library
- Ferry Hendrikx, Kris Bubendorfer, and Ryan Chard. 2015. Reputation systems. Journal of Parallel and Distributed Computing 75, C (2015), 184--197. Google Scholar
Digital Library
- T. Hoßfeld, M. Hirth, P. Korshunov, P. Hanhart, B. Gardlo, C. Keimel, and C Timmerer. 2014. Survey of Web-based crowdsourcing frameworks for subjective quality assessment. In Proceedings of MMSP’14. IEEE, Los Alamitos, CA.Google Scholar
- Max Jaderberg, Karen Simonyan, Andrea Vedaldi, and Andrew Zisserman. 2014. Synthetic data and artificial neural networks for natural scene text recognition. arXiv:1406.2227.Google Scholar
- Jiahua Jin, Yijun Li, Xiaojia Zhong, and Li Zhai. 2015. Why users contribute knowledge to online communities: An empirical study of an online social Q8A community. Information and Management 52, 7 (2015), 840--849. Google Scholar
Digital Library
- Xiao-Ling Jin, Zhongyun Zhou, Matthew K. O. Lee, and Christy M. K. Cheung. 2013. Why users keep answering questions in online question answering communities: A theoretical and empirical investigation. International Journal of Information Management 33, 1 (2013), 93--104.Google Scholar
Cross Ref
- Armand Joulin, Edouard Grave, Piotr Bojanowski, and Tomas Mikolov. 2016. Bag of tricks for efficient text classification. arXiv:1607.01759.Google Scholar
- Thivya Kandappu, Nikita Jaiman, Randy Tandriansyah, Archan Misra, Shih-Fen Cheng, Cen Chen, Hoong Chuin Lau, Deepthi Chander, and Koustuv Dasgupta. 2016. TASKer: Behavioral insights via campus-based experimental mobile crowd-sourcing. In Proceedings of UbiComp’16. ACM, New York, NY. Google Scholar
Digital Library
- Aikaterini Katmada, Anna Satsiou, and Ioannis Kompatsiaris. 2016. Incentive Mechanisms for Crowdsourcing Platforms. Springer.Google Scholar
- Joachim Kimmerle, Ulrike Cress, and Friedrich W. Hesse. 2007. An interactional perspective on group awareness: Alleviating the information-exchange dilemma. International Journal of Human-Computer Studies 65, 11 (2007), 899--910. Google Scholar
Digital Library
- Cliff Lampe, Rick Wash, Alcides Velasquez, and Elif Ozkaya. 2010. Motivations to participate in online communities. In Proceedings of CHI’10. ACM, New York, NY. Google Scholar
Digital Library
- Guillaume Lample, Miguel Ballesteros, Sandeep Subramanian, Kazuya Kawakami, and Chris Dyer. 2016. Neural architectures for named entity recognition. In Proceedings of AAACL’16.Google Scholar
Cross Ref
- Myriam Leggieri, Christian von der Weth, and John Breslin. 2015. Using sensors to bridge the gap between real places and their Web-based representations. In Proceedings of ISSNIP’15. IEEE, Los Alamitos, CA.Google Scholar
- Chenliang Li, Aixin Sun, Jianshu Weng, and Qi He. 2013. Exploiting hybrid contexts for tweet segmentation. In Proceedings of SIGIR’13. ACM, New York, NY. Google Scholar
Digital Library
- Xiaohua Liu, Shaodian Zhang, Furu Wei, and Ming Zhou. 2011. Recognizing named entities in tweets. In Proceedings of HLT’11. Google Scholar
Digital Library
- Yefeng Liu, Todorka Alexandrova, and Tatsuo Nakajima. 2013. Using stranger as sensors: Temporal and geo-sensitive question answering via social media. In Proceedings of W WW’13. ACM, New York, NY. Google Scholar
Digital Library
- Thang Luong, Hieu Pham, and Christopher D. Manning. 2015. Effective approaches to attention-based neural machine translation. In Proceedings of EMNLP’15.Google Scholar
- Monica Marrero, Julian Urbano, Sonia Sanchez-Cuadrado, Jorge Morato, and Juan Miguel Gomez-Berbis. 2013. Named entity recognition: Fallacies, challenges and opportunities. Computer Standards and Interfaces 35, 5 (2013), 482--489.Google Scholar
Cross Ref
- Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S. Corrado, and Jeff Dean. 2013. Distributed representations of words and phrases and their compositionality. In Proceedings of NIPS’13. Google Scholar
Digital Library
- George A. Miller. 1995. WordNet: A lexical database for English. Communications of the ACM 38, 11 (1995), 39--41. Google Scholar
Digital Library
- Nolan Miller, Paul Resnick, and Richard Zeckhauser. 2005. Eliciting informative feedback: The peer-prediction method. Management Science 51, 9 (2005), 1359--1373. Google Scholar
Digital Library
- Mohamed Musthag and Deepak Ganesan. 2013. Labor dynamics in a mobile micro-task market. In Proceedings of CHI’13. ACM, New York, NY. Google Scholar
Digital Library
- David Nadeau and Satoshi Sekine. 2007. A survey of named entity recognition and classification. Linguisticae Investigationes 30, 1 (2007), 1--20.Google Scholar
Cross Ref
- Ramesh Nallapati, Bowen Zhou, Cicero dos Santos, Caglar Gulcehre, and Bing Xiang. 2016. Abstractive text summarization using sequence-to-sequence RNNs and beyond. In Proceedings of CoNLL’16.Google Scholar
Cross Ref
- Jessie Ooi, Xiuqin Ma, Hongwu Qin, and Siau Chuin Liew. 2015. A survey of query expansion, query suggestion and query refinement techniques. In Proceedings of ICSECS’15. IEEE, Los Alamitos, CA.Google Scholar
Cross Ref
- Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. 2002. BLEU: A method for automatic evaluation of machine translation. In Proceedings of ACL’02. Google Scholar
Digital Library
- Greg Pass, Abdur Chowdhury, and Cayley Torgeson. 2006. A picture of search. In Proceedings of InfoScale’06. ACM, New York, NY. Google Scholar
Digital Library
- Dražen Prelec. 2004. A Bayesian truth serum for subjective data. Science 306, 5695 (2004), 462--466.Google Scholar
- Lev Ratinov and Dan Roth. 2009. Design challenges and misconceptions in named entity recognition. In Proceedings of CoNLL’09. Google Scholar
Digital Library
- Soumya Ray, Sung S. Kim, and James G. Morris. 2014. The central role of engagement in online communities. Information Systems Research 25, 3 (2014), 528--546. Google Scholar
Digital Library
- Ju Ren, Yaoxue Zhang, Kuan Zhang, and Xuemin Shen. 2015. Exploiting mobile crowdsourcing for pervasive cloud services: Challenges and solutions. IEEE Communications Magazine 53, 3 (2015), 1--9.Google Scholar
Cross Ref
- Fatemeh Riahi, Zainab Zolaktaf, Mahdi Shafiei, and Evangelos Milios. 2012. Finding expert users in community question answering. In Proceedings of WWW’12 Companion. ACM, New York, NY. Google Scholar
Digital Library
- Dominic Seyler, Mohamed Yahya, Klaus Berberich, and Omar Alonso. 2016. Automated question generation for quality control in human computation tasks. In Proceedings of WebSci’16. ACM, New York, NY. Google Scholar
Digital Library
- Nigel Shadbolt, Max Van Kleek, and Reuben Binns. 2016. The rise of social machines: The development of a human/digital ecosystem. IEEE Consumer Electronics Magazine 5, 2 (2016), 106--111.Google Scholar
Cross Ref
- Aaron D. Shaw, John J. Horton, and Daniel L. Chen. 2011. Designing incentives for inexpert human raters. In Proceedings of CSCW’11. ACM, New York, NY. Google Scholar
Digital Library
- Yikang Shen, Wenge Rong, Nan Jiang, Baolin Peng, Jie Tang, and Zhang Xiong. 2017. Word embedding based correlation model for question/answer matching. In Proceedings of AAAI’17. Google Scholar
Digital Library
- Ivan Srba and Maria Bielikova. 2016. A comprehensive survey and classification of approaches for community question answering. ACM Trans. Web 10, 3 (2016), Article 18. Google Scholar
Digital Library
- Ilya Sutskever, Oriol Vinyals, and Quoc V. Le. 2014. Sequence to sequence learning with neural networks. In Proceedings of NIPS’14. Google Scholar
Digital Library
- Luke Taylor and Geoff Nitschke. 2017. Improving deep learning using generic data augmentation. arXiv:1708.06020.Google Scholar
- Jaime Teevan, Kevyn Collins-Thompson, Ryen W. White, Susan T. Dumais, and Yubin Kim. 2013. Slow search: Information retrieval without time constraints. In Proceedings of HCIR’13. ACM, New York, NY. Google Scholar
Digital Library
- Trang Tran and Mari Ostendorf. 2016. Characterizing the language of online communities and its relation to community reception. In Proceedings of EMNLP’16.Google Scholar
Cross Ref
- Rajan Vaish, Keith Wyngarden, Jingshu Chen, Brandon Cheung, and Michael S. Bernstein. 2014. Twitch crowdsourcing: Crowd contributions in short bursts of time. In Proceedings of CHI’14. ACM, New York, NY. Google Scholar
Digital Library
- Oriol Vinyals and Quoc Le. 2015. A neural conversational model. In Proceedings of ICML Deep Learning Workshop’15.Google Scholar
- Yuhui Wang, Christian von der Weth, Thomas Winkler, and Mohan Kankanhalli. 2016. Tweeting camera: A new paradigm of event-based smart sensing device: Demo. In Proceedings of ICDSC’16. ACM, New York, NY. Google Scholar
Digital Library
- Etienne Wenger. 2011. Communities of practice: Learning, meaning, and identity. Cambridge University Press.Google Scholar
- Christian von der Weth, Ashraf M. Abdul, and Mohan Kankanhalli. 2017. Cyber-physical social networks. ACM Transactions on Internet Technology 17, 2 (2017), Article 17. Google Scholar
Digital Library
- Ryen W. White, Matthew Richardson, and Wen-Tau Yih. 2015. Questions vs. queries in informational search tasks. In Proceedings of WWW’15 Companion. ACM, New York, NY. Google Scholar
Digital Library
- Ronald J. Williams and David Zipser. 1989. A learning algorithm for continually running fully recurrent neural networks. Neural Computing 1, 2 (1989), 270--280. Google Scholar
Digital Library
- Ian H. Witten, Eibe Frank, and Mark A. Hall. 2011. Data Mining: Practical Machine Learning Tools and Techniques. (3rd ed.). Morgan Kaufmann. Google Scholar
Digital Library
- Haocheng Wu, Wei Wu, Ming Zhou, Enhong Chen, Lei Duan, and Heung-Yeung Shum. 2014. Improving search relevance for short queries in community question answering. In Proceedings of WSDM’14. ACM, New York, NY. Google Scholar
Digital Library
- Tingxin Yan, Matt Marzilli, Ryan Holmes, Deepak Ganesan, and Mark Corner. 2009. mCrowd: A platform for mobile crowdsourcing. In Proceedings of SenSys’09. ACM, New York, NY. Google Scholar
Digital Library
- Xuchen Yao, Benjamin Van Durme, Chris Callison-Burch, and Peter Clark. 2013. Answer extraction as sequence tagging with tree edit distance. In Proceedings of NAACL’13.Google Scholar
- Man-Ching Yuen, Irwin King, and Kwong-Sak Leung. 2011. A survey of crowdsourcing systems. In Proceedings of PASSAT’11. IEEE, Los Alamitos, CA.Google Scholar
Cross Ref
- Yuxiang Zhao and Qinghua Zhu. 2014. Evaluation on crowdsourcing research: Current status and future direction. Information Systems Frontiers 16, 3 (2014), 417--434. Google Scholar
Digital Library
Index Terms
CloseUp—A Community-Driven Live Online Search Engine
Recommendations
Improving search relevance for short queries in community question answering
WSDM '14: Proceedings of the 7th ACM international conference on Web search and data miningRelevant question retrieval and ranking is a typical task in community question answering (CQA). Existing methods mainly focus on long and syntactically structured queries. However, when an input query is short, the task becomes challenging, due to a ...






Comments