skip to main content
research-article

The Tip of the Buyer: Extracting Product Tips from Reviews

Published:23 February 2023Publication History
Skip Abstract Section

Abstract

Product reviews play a key role in e-commerce platforms. Studies show that many users read product reviews before a purchase and trust them to the same extent as personal recommendations. However, in many cases, the number of reviews per product is large and extracting useful information becomes a challenging task. Several websites have recently added an option to post tips—short, concise, practical, and self-contained pieces of advice about the products. These tips are complementary to the reviews and usually add a new non-trivial insight about the product, beyond its title, attributes, and description. Yet, most if not all major e-commerce platforms lack the notion of a tip as a first-class citizen and customers typically express their advice through other means, such as reviews.

In this work, we propose an extractive method for tip generation from product reviews. We focus on five popular e-commerce domains whose reviews tend to contain useful non-trivial tips that are beneficial for potential customers. We formally define the task of tip extraction in e-commerce by providing the list of tip types, tip timing (before and/or after the purchase), and connection to the surrounding context sentences. To extract the tips, we propose a supervised approach and leverage a publicly available dataset, annotated by human editors, containing 14,000 product reviews. To demonstrate the potential of our approach, we compare different tip generation methods and evaluate them both manually and over the labeled set. Our approach demonstrates particularly high performance for popular products in the Baby, Home Improvement, and Sports & Outdoors domains, with precision of over 95% for the top 3 tips per product. In addition, we evaluate the performance of our methods on previously unseen domains. Finally, we discuss the practical usage of our approach in real-world applications. Concretely, we explain how tips generated from user reviews can be integrated in various use cases within e-commerce platforms and benefit both buyers and sellers.

REFERENCES

  1. [1] Akoglu Leman, Chandy Rishi, and Faloutsos Christos. 2013. Opinion fraud detection in online reviews by network effects. In Proc. of ICWSM, Vol. 7.Google ScholarGoogle Scholar
  2. [2] Avron Uri, Gershtein Shay, Guy Ido, Milo Tova, and Novgorodov Slava. 2022. Automated category tree construction in e-commerce. In Proc. of SIGMOD. 17701783.Google ScholarGoogle Scholar
  3. [3] Baccianella Stefano, Esuli Andrea, and Sebastiani Fabrizio. 2009. Multi-facet rating of product reviews. In European Conference on Information Retrieval. Springer, 461472.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. [4] Berger Adam and Lafferty John. 1999. Information retrieval as statistical translation. In Proc. of SIGIR. 222229.Google ScholarGoogle Scholar
  5. [5] Bojanowski Piotr, Grave Edouard, Joulin Armand, and Mikolov Tomas. 2017. Enriching word vectors with subword information. TACL 5 (2017), 135146.Google ScholarGoogle ScholarCross RefCross Ref
  6. [6] BrightLocal. 2018. Local Consumer Review Survey. https://www.brightlocal.com/research/local-consumer-review-survey/.Google ScholarGoogle Scholar
  7. [7] Carmel David, Uziel Erel, Guy Ido, Mass Yosi, and Roitman Haggai. 2012. Folksonomy-based term extraction for word cloud generation. ACM Trans. Intell. Syst. Technol. 3, 4, Article 60 (Sept.2012), 20 pages.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. [8] Caruana Rich. 1998. Multitask learning. In Learning to Learn. Springer, 95133.Google ScholarGoogle ScholarCross RefCross Ref
  9. [9] Chen Chien Chin and Tseng You-De. 2011. Quality evaluation of product reviews using an information quality framework. Decision Support Systems 50, 4 (2011), 755768.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. [10] Chen Lei, Cao Jie, Chen Huanhuan, Liang Weichao, Tao Haicheng, and Zhu Guixiang. 2021. Attentive multi-task learning for group itinerary recommendation. Knowledge and Information Systems 63, 7 (2021), 16871716.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. [11] Chen Lei, Cao Jie, Wang Youquan, Liang Weichao, and Zhu Guixiang. 2022. Multi-view graph attention network for travel recommendation. Expert Systems with Applications 191 (2022), 116234.Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. [12] Chen Lei, Cao Jie, Zhu Guixiang, Wang Youquan, and Liang Weichao. 2021. A multi-task learning approach for improving travel recommendation with keywords generation. Knowledge-Based Systems 233 (2021), 107521.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. [13] Chevalier Judith A. and Mayzlin Dina. 2006. The effect of word of mouth on sales: Online book reviews. Journal of Marketing Research 43, 3 (2006), 345354.Google ScholarGoogle ScholarCross RefCross Ref
  14. [14] Cremonesi Paolo, Facendola Raffaele, Garzotto Franca, Guarnerio Matteo, Natali Mattia, and Pagano Roberto. 2014. Polarized review summarization as decision making tool. In Proc. of AVI. 355356.Google ScholarGoogle Scholar
  15. [15] Dagan Arnon, Guy Ido, and Novgorodov Slava. 2021. An image is worth a thousand terms? Analysis of visual e-commerce search. In Proc. of SIGIR. 102112.Google ScholarGoogle Scholar
  16. [16] Devlin Jacob, Chang Ming-Wei, Lee Kenton, and Toutanova Kristina. 2018. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018).Google ScholarGoogle Scholar
  17. [17] Dror Rotem, Baumer Gili, Shlomov Segev, and Reichart Roi. 2018. The hitchhiker’s guide to testing statistical significance in natural language processing. In Proc. of ACL. 13831392.Google ScholarGoogle Scholar
  18. [18] Duan Wenjing, Gu Bin, and Whinston Andrew B.. 2008. Do online reviews matter? An empirical investigation of panel data. Decision Support Systems 45, 4 (2008), 10071016.Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. [19] Elad Guy, Guy Ido, Novgorodov Slava, Kimelfeld Benny, and Radinsky Kira. 2019. Learning to generate personalized product descriptions. In Proc. of CIKM. 389398.Google ScholarGoogle Scholar
  20. [20] Fleiss Joseph L.. 1971. Measuring nominal scale agreement among many raters.Psychological Bulletin 76, 5 (1971), 378382.Google ScholarGoogle ScholarCross RefCross Ref
  21. [21] Gao Shen, Chen Xiuying, Li Piji, Ren Zhaochun, Bing Lidong, Zhao Dongyan, and Yan Rui. 2019. Abstractive text summarization by incorporating reader comments. In Proc. of the AAAI Conference, Vol. 33. 63996406.Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. [22] Gerani Shima, Mehdad Yashar, Carenini Giuseppe, Ng Raymond T., and Nejat Bita. 2014. Abstractive summarization of product reviews using discourse structure. In Proc. of EMNLP. 16021613.Google ScholarGoogle Scholar
  23. [23] Guy Ido, Mejer Avihai, Nus Alexander, and Raiber Fiana. 2017. Extracting and ranking travel tips from user-generated reviews. In Proc. of WWW. 987996.Google ScholarGoogle Scholar
  24. [24] Guy Ido and Shapira Bracha. 2018. From royals to vegans: Characterizing question trolling on a community question answering website. In Proc. of SIGIR. 835844.Google ScholarGoogle Scholar
  25. [25] Hashimoto Kazuma, Xiong Caiming, Tsuruoka Yoshimasa, and Socher Richard. 2016. A joint many-task model: Growing a neural network for multiple NLP tasks. arXiv preprint abs/1611.01587 (2016).Google ScholarGoogle Scholar
  26. [26] He Ruining and McAuley Julian. 2016. Ups and downs: Modeling the visual evolution of fashion trends with one-class collaborative filtering. In Proc. of WWW. 507517.Google ScholarGoogle Scholar
  27. [27] Hirsch Sharon, Guy Ido, Nus Alexander, Dagan Arnon, and Kurland Oren. 2020. Query reformulation in e-commerce search. In Proc. of SIGIR. 13191328.Google ScholarGoogle Scholar
  28. [28] Hirsch Sharon, Novgorodov Slava, Guy Ido, and Nus Alexander. 2021. Generating tips from product reviews. In Proc. of WSDM. 310318.Google ScholarGoogle Scholar
  29. [29] Hochreiter Sepp and Schmidhuber Jürgen. 1997. Long short-term memory. Neural Computation 9, 8 (1997), 17351780.Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. [30] Hu Minqing and Liu Bing. 2004. Mining and summarizing customer reviews. In Proc. of KDD. 168177.Google ScholarGoogle Scholar
  31. [31] Hu Nan, Bose Indranil, Koh Noi Sian, and Liu Ling. 2012. Manipulation of online reviews: An analysis of ratings, readability, and sentiments. Decision Support Systems 52, 3 (2012), 674684.Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. [32] Huang Chunli, Jiang Wenjun, Wu Jie, and Wang Guojun. 2020. Personalized review recommendation based on users’ aspect sentiment. ACM Transactions on Internet Technology (TOIT) 20, 4 (2020), 126.Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. [33] Huang Jui-Ting, Li Jinyu, Yu Dong, Deng Li, and Gong Yifan. 2013. Cross-language knowledge transfer using multilingual deep neural network with shared hidden layers. In 2013 IEEE International Conference on Acoustics, Speech and Signal Processing. IEEE, 73047308.Google ScholarGoogle ScholarCross RefCross Ref
  34. [34] Kim Soo-Min, Pantel Patrick, Chklovski Tim, and Pennacchiotti Marco. 2006. Automatically assessing review helpfulness. In Proc. of EMNLP. 423430.Google ScholarGoogle Scholar
  35. [35] Kingma Diederik P. and Ba Jimmy. 2014. Adam: A method for stochastic optimization. arXiv preprint abs/1412.6980 (2014).Google ScholarGoogle Scholar
  36. [36] Lappas Theodoros, Crovella Mark, and Terzi Evimaria. 2012. Selecting a characteristic set of reviews. In Proc. of KDD. 832840.Google ScholarGoogle Scholar
  37. [37] Lappas Theodoros and Gunopulos Dimitrios. 2010. Efficient confident search in large review corpora. In ECML PKDD. Springer, 195210.Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. [38] Lavee Gal and Guy Ido. 2022. Lot or not: Identifying multi-quantity offerings in e-commerce. In Proc. of ECNLP 5. 250262.Google ScholarGoogle Scholar
  39. [39] Li Piji, Wang Zihao, Bing Lidong, and Lam Wai. 2019. Persona-aware tips generation. In The World Wide Web Conference. 10061016.Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. [40] Li Piji, Wang Zihao, Ren Zhaochun, Bing Lidong, and Lam Wai. 2017. Neural rating regression with abstractive tips generation for recommendation. In Proc. of SIGIR. 345354.Google ScholarGoogle Scholar
  41. [41] Litvin Stephen W., Goldsmith Ronald E., and Pan Bing. 2008. Electronic word-of-mouth in hospitality and tourism management. Tourism Management 29, 3 (2008), 458468.Google ScholarGoogle ScholarCross RefCross Ref
  42. [42] Liu Linqing, Lu Yao, Yang Min, Qu Qiang, Zhu Jia, and Li Hongyan. 2018. Generative adversarial network for abstractive text summarization. In 32nd AAAI Conference on Artificial Intelligence.Google ScholarGoogle ScholarCross RefCross Ref
  43. [43] Marcus Mitchell, Santorini Beatrice, and Marcinkiewicz Mary Ann. 1993. Building a large annotated corpus of English: The Penn Treebank. Computational Linguistics 19, 2 (1993), 313–330.Google ScholarGoogle Scholar
  44. [44] Mikolov Tomas, Chen Kai, Corrado Greg, and Dean Jeffrey. 2013. Efficient estimation of word representations in vector space. arXiv preprint abs/1301.37810 (2013).Google ScholarGoogle Scholar
  45. [45] Nguyen Thanh-Son, Lauw Hady W., and Tsaparas Panayiotis. 2013. Using micro-reviews to select an efficient set of reviews. In Proc. of CIKM. 10671076.Google ScholarGoogle Scholar
  46. [46] Noreen Eric W.. 1989. Computer-intensive Methods for Testing Hypotheses. Wiley New York.Google ScholarGoogle Scholar
  47. [47] Novgorodov Slava, Guy Ido, Elad Guy, and Radinsky Kira. 2019. Generating product descriptions from user reviews. In Proc. of WWW. 13541364.Google ScholarGoogle Scholar
  48. [48] Novgorodov Slava, Guy Ido, Elad Guy, and Radinsky Kira. 2020. Descriptions from the customers: Comparative analysis of review-based product description generation methods. ACM Transactions on Internet Technology (TOIT) 20, 4 (2020), 131.Google ScholarGoogle ScholarDigital LibraryDigital Library
  49. [49] Pennington Jeffrey, Socher Richard, and Manning Christopher D.. 2014. Glove: Global vectors for word representation. In Proc. of EMNLP, Vol. 14. 15321543.Google ScholarGoogle Scholar
  50. [50] Rish Irina. 2001. An empirical study of the naive Bayes classifier. In IJCAI 2001 Workshop on Empirical Methods in Artificial Intelligence, Vol. 3. 4146.Google ScholarGoogle Scholar
  51. [51] Ryu Jihee, Jung Yuchul, and Myaeng Sung-Hyon. 2012. Actionable clause detection from non-imperative sentences in howto instructions: A step for actionable information extraction. In TSD. Springer, 272281.Google ScholarGoogle Scholar
  52. [52] Saarijärvi Hannu, Sutinen Ulla-Maija, and Harris Lloyd C.. 2017. Uncovering consumers’ returning behaviour: A study of fashion e-commerce. International Review of Retail, Distribution and Consumer Research 27, 3 (2017), 284299.Google ScholarGoogle Scholar
  53. [53] Sipos Ruben and Joachims Thorsten. 2013. Generating comparative summaries from reviews. In Proc. of CIKM. 18531856.Google ScholarGoogle Scholar
  54. [54] Snowball Doug. 1980. Some effects of accounting expertise and information load: An empirical study. Accounting, Organizations and Society 5, 3 (1980), 323338.Google ScholarGoogle ScholarCross RefCross Ref
  55. [55] Søgaard Anders and Goldberg Yoav. 2016. Deep multi-task learning with low level tasks supervised at lower layers. In Proc. of ACL, Vol. 2. 231235.Google ScholarGoogle Scholar
  56. [56] Sondhi Parikshit, Sharma Mohit, Kolari Pranam, and Zhai ChengXiang. 2018. A taxonomy of queries for e-commerce search. In Proc. of SIGIR. 12451248.Google ScholarGoogle Scholar
  57. [57] Speier Cheri, Valacich Joseph S., and Vessey Iris. 1999. The influence of task interruption on individual decision making: An information overload perspective. Decision Sciences 30, 2 (1999), 337360.Google ScholarGoogle ScholarCross RefCross Ref
  58. [58] Tsur Oren and Rappoport Ari. 2009. Revrank: A fully unsupervised algorithm for selecting the most helpful book reviews. In Proc. of ICWSM.Google ScholarGoogle Scholar
  59. [59] Tsurel David, Doron Michael, Nus Alexander, Dagan Arnon, Guy Ido, and Shahaf Dafna. 2020. E-commerce dispute resolution prediction. In Proc. of CIKM. 14651474.Google ScholarGoogle Scholar
  60. [60] Tzaban Hen, Guy Ido, Greenstein-Messica Asnat, Dagan Arnon, Rokach Lior, and Shapira Bracha. 2020. Product bundle identification using semi-supervised learning. In Proc. of SIGIR. 791800.Google ScholarGoogle Scholar
  61. [61] Wang Shaohua, Phan NhatHai, Wang Yan, and Zhao Yong. 2019. Extracting API tips from developer question and answer websites. In 2019 IEEE/ACM 16th International Conference on Mining Software Repositories (MSR ’19). IEEE, 321332.Google ScholarGoogle ScholarDigital LibraryDigital Library
  62. [62] Weber Ingmar, Ukkonen Antti, and Gionis Aris. 2012. Answers, not links: Extracting tips from yahoo! answers to address how-to web queries. In Proc. of WSDM. 613622.Google ScholarGoogle Scholar
  63. [63] Wicaksono Alfan Farizki and Myaeng Sung-Hyon. 2012. Mining advices from weblogs. In Proc. of CIKM. 23472350.Google ScholarGoogle Scholar
  64. [64] Wicaksono Alfan Farizki and Myaeng Sung-Hyon. 2013. Toward advice mining: Conditional random fields for extracting advice-revealing text units. In Proc. of CIKM. 20392048.Google ScholarGoogle Scholar
  65. [65] Xu Qing-Song and Liang Yi-Zeng. 2001. Monte Carlo cross validation. Chemometrics and Intelligent Laboratory Systems 56, 1 (2001), 111.Google ScholarGoogle ScholarCross RefCross Ref
  66. [66] Yang Cheng, Wu Lingang, Tan Kun, Yu Chunyang, Zhou Yuliang, Tao Ye, and Song Yu. 2021. Online user review analysis for product evaluation and improvement. Journal of Theoretical and Applied Electronic Commerce Research 16, 5 (2021), 15981611.Google ScholarGoogle ScholarCross RefCross Ref
  67. [67] Yang Zichao, Yang Diyi, Dyer Chris, He Xiaodong, Smola Alex, and Hovy Eduard. 2016. Hierarchical attention networks for document classification. In Proc. of NAACL. 14801489.Google ScholarGoogle Scholar
  68. [68] Ye Qiang, Law Rob, and Gu Bin. 2009. The impact of online user reviews on hotel room sales. International Journal of Hospitality Management 28, 1 (2009), 180182.Google ScholarGoogle ScholarCross RefCross Ref
  69. [69] Zhu Di, Lappas Theodoros, and Zhang Juheng. 2018. Unsupervised tip-mining from customer reviews. Decision Support Systems 107 (2018), 116124.Google ScholarGoogle ScholarCross RefCross Ref
  70. [70] Zhu Yada, Li Jianbo, He Jingrui, Quanz Brian Leo, and Deshpande Ajay A.. 2018. A local algorithm for product return prediction in e-commerce. In Proc. of IJCAI. 37183724.Google ScholarGoogle Scholar

Index Terms

  1. The Tip of the Buyer: Extracting Product Tips from Reviews

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        Full Access

        • Published in

          cover image ACM Transactions on Internet Technology
          ACM Transactions on Internet Technology  Volume 23, Issue 1
          February 2023
          564 pages
          ISSN:1533-5399
          EISSN:1557-6051
          DOI:10.1145/3584863
          • Editor:
          • Ling Liu
          Issue’s Table of Contents

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 23 February 2023
          • Online AM: 14 July 2022
          • Accepted: 21 June 2022
          • Revised: 27 February 2022
          • Received: 29 November 2021
          Published in toit Volume 23, Issue 1

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article
        • Article Metrics

          • Downloads (Last 12 months)160
          • Downloads (Last 6 weeks)16

          Other Metrics

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Full Text

        View this article in Full Text.

        View Full Text

        HTML Format

        View this article in HTML Format .

        View HTML Format
        About Cookies On This Site

        We use cookies to ensure that we give you the best experience on our website.

        Learn more

        Got it!