10.1145/3323933.3324084acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicctaConference Proceedings
research-article
Open Access

Information Extraction from Unstructured Recipe Data

ABSTRACT

Online food recipes are an important source of information for many individuals, who use these to learn how to cook new dishes and choose their meals. However, these often lack structured information, useful to improve search and recommendation systems of food recipe websites, as well as calculate accurate nutritional information, which brings additional value to users. To solve this problem, FRIES was developed. FRIES automatically extracts the names, quantities, units and cooking methods for each ingredient in a recipe. The system uses mainly rule-based methods and achieves an average F-measure of 0.89 for the extraction of the cooking methods present in a recipe and an average F-measure of 0.83 for the extraction of associations linking cooking methods to ingredients. FRIES' results show that it can accurately and automatically extract information from cooking recipes. This information can be used to estimate the nutritional information of food recipes and support recommendation systems.

References

  1. West, R., White, R. W., & Horvitz, E. (2013, May). From cookies to cooks: Insights on dietary patterns via analysis of web usage logs. In Proceedings of the 22nd international conference on World Wide Web (pp. 1399--1410). ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Trattner, C., & Elsweiler, D. (2017). Food Recommender Systems: Important Contributions, Challenges and Future Research Directions. arXiv preprint arXiv:1711.02760.Google ScholarGoogle Scholar
  3. Harvey, M., Ludwig, B., & Elsweiler, D. (2013, October). You are what you eat: Learning user tastes for rating prediction. In International Symposium on String Processing and Information Retrieval (pp. 153--164). Springer, Cham. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Yamakata, Y., Imahori, S., Sugiyama, Y., Mori, S., & Tanaka, K. (2013, November). Feature extraction and summarization of recipes using flow graph. In International Conference on Social Informatics (pp. 241--254). Springer, Cham. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Hamada, R., Ide, I., Sakai, S., & Tanaka, H. (2000, November). Structural analysis of cooking preparation steps in Japanese. In Proceedings of the fifth international workshop on on Information retrieval with Asian languages (pp. 157--164). ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Hamon, T., & Grabar, N. (2013, October). Extraction of ingredient names from recipes by combining linguistic annotations and CRF selection. In Proceedings of the 5th international workshop on Multimedia for cooking & eating activities (pp. 63--68). ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Mori, S., Sasada, T., Yamakata, Y., & Yoshino, K. (2012). A machine learning approach to recipe text processing. In Proceedings of the 1st Cooking with Computer Workshop (pp. 29--34).Google ScholarGoogle Scholar
  8. Ueta, T., Iwakami, M., & Ito, T. (2011, December). A recipe recommendation system based on automatic nutrition information extraction. In International Conference on Knowledge Science, Engineering and Management (pp. 79--90). Springer, Berlin, Heidelberg. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. E.W. Miller, A. Achilleos, and R. Bayless. How to Cook Like a Top Chef. Chronicle Books, 2010.Google ScholarGoogle Scholar
  10. B.A. Kipfer. The Culinarian: A Kitchen Desk Reference. Houghton Mifflin Harcourt, 2012.Google ScholarGoogle Scholar
  11. C. Conran, T. Conran, and S. Hopkinson. The Conran Cookbook. Conran Octopus, 2001.Google ScholarGoogle Scholar
  12. Miller, G. (1998). WordNet: An electronic lexical database. MIT press.Google ScholarGoogle Scholar
  13. Manning, C., Surdeanu, M., Bauer, J., Finkel, J., Bethard, S., & McClosky, D. (2014). The Stanford CoreNLP natural language processing toolkit. In Proceedings of 52nd annual meeting of the association for computational linguistics: system demonstrations (pp. 55--60).Google ScholarGoogle ScholarCross RefCross Ref
  14. Hara, T., Matsuzaki, T., Miyao, Y., & Tsujii, J. I. (2011). Exploring difficulties in parsing imperatives and questions. In Proceedings of 5th International Joint Conference on Natural Language Processing (pp. 749--757).Google ScholarGoogle Scholar

Index Terms

  1. Information Extraction from Unstructured Recipe Data

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Article Metrics

      • Downloads (Last 12 months)377
      • Downloads (Last 6 weeks)31

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader
    About Cookies On This Site

    We use cookies to ensure that we give you the best experience on our website.

    Learn more

    Got it!