skip to main content
research-article

I Know What You Bought At Chipotle for $9.81 by Solving A Linear Inverse Problem

Authors Info & Claims
Published:15 June 2021Publication History
Skip Abstract Section

Abstract

We consider the question of identifying which set of products are purchased and at what prices in a given transaction by observing only the total amount spent in the transaction, and nothing more. The ability to solve such an inverse problem can lead to refined information about consumer spending by simply observing anonymized credit card transactions data. Indeed, when considered in isolation, it is impossible to identify the products purchased and their prices from a given transaction just based on the transaction total. However, given a large number of transactions, there may be a hope. As the main contribution of this work, we provide a robust estimation algorithm for decomposing transaction totals into the underlying, individual product(s) purchased by utilizing a large corpus of transactions. Our method recovers a (product prices) vector $p \in \mathbbR ^N_>0 $ of unknown dimension (number of products) N as well as matrix $A \in \mathbbZ ^M \times N _\geq0 $ simply from M observations (transaction totals) $y \in \mathbbR ^M_>0 $ such that $y = A p + η$ with η \in \mathbbR ^M$ representing noise (taxes, discounts, etc.). We formally establish that our algorithm identifies $N, A$ precisely and p approximately, as long as each product is purchased individually at least once, i.e. $M \geq N$ and A has rank N. Computationally, the algorithm runs in polynomial time (with respect to problem parameters), and thus we provide a computationally efficient and statistically robust method for solving such inverse problems. We apply the algorithm to a large corpus of anonymized consumer credit card transactions in the period 2016-2019, with data obtained from a commercial data vendor. The transactions are associated with spending at Apple, Chipotle, Netflix, and Spotify. From just transactions data, our algorithm identifies (i) key price points (without access to the listed prices), (ii) products purchased within a transaction, (iii) product launches, and (iv) evidence of a new 'secret' product from Netflix - rumored to be in limited release.

References

  1. AlternativeData.org. Alternativedata.org database point of sale data. https://alternativedata.org/data-providers/category,point-of-sale. Accessed: 2019-05--19.Google ScholarGoogle Scholar
  2. Mark Bergen and Jennifer Surane. Google and mastercard cut a secret ad deal to track retail sales. https://www.bloomberg.com/news/articles/2018-08--30/google-and-mastercard-cut-a-secret-ad-deal-to-track-retail-sales, August 2018. Accessed: 2019-05--19.Google ScholarGoogle Scholar
  3. Radu Berinde, Anna C Gilbert, Piotr Indyk, Howard Karloff, and Martin J Strauss. Combining geometry and combinatorics: A unified approach to sparse signal recovery. In 2008 46th Annual Allerton Conference on Communication, Control, and Computing, pages 798--805. IEEE, 2008.Google ScholarGoogle ScholarCross RefCross Ref
  4. Florentin Butaru, QingQing Chen, Brian Clark, Sanmay Das, Andrew W Lo, and Akhtar Siddique. Risk and risk management in the credit card industry. Working Paper 21305, National Bureau of Economic Research, June 2015.Google ScholarGoogle Scholar
  5. Emmanuel Candes and Terence Tao. Near optimal signal recovery from random projections: Universal encoding strategies. arXiv preprint math/0410542, 2004.Google ScholarGoogle Scholar
  6. Emmanuel J Candes. The restricted isometry property and its implications for compressed sensing. Comptes rendus mathematique, 346(9--10):589--592, 2008.Google ScholarGoogle Scholar
  7. Chipotle. Chipotle online ordering. https://order.chipotle.com/Meal/Index/1597'showloc=1, 2019. Accessed: 2019-05-01.Google ScholarGoogle Scholar
  8. Thomas H. Cormen, Charles E. Leiserson, Ronald L. Rivest, and Clifford Stein. Introduction to Algorithms, Third Edition. The MIT Press, 3rd edition, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Ryan Dezember. Your smartphone's location data is worth big money to wall street. https://www.wsj.com/articles/your-smartphones-location-data-is-worth-big-money-to-wall-street-1541131260, November 2018. Accessed: 2018--11-04.Google ScholarGoogle Scholar
  10. David L Donoho. Compressed sensing. IEEE Transactions on information theory, 52(4):1289--1306, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. DVD.COM. Dvd.com choose a plan. https://dvd.netflix.com/Plans?dsrc=DVDWEB_NMHOME_NMHEADER_PLANS. Accessed: 2019-05--27.Google ScholarGoogle Scholar
  12. Amir Efrati. U.S. slowdown at Uber and Lyft. https://www.theinformation.com/articles/u-s-slowdown-at-uber-and-lyft, September 2018. Accessed: 2018--10--25.Google ScholarGoogle Scholar
  13. Michael Fleder and Devavrat Shah. Forecasting with alternative data. In Abstracts of the 2020 SIGMETRICS/Performance Joint International Conference on Measurement and Modeling of Computer Systems, SIGMETRICS '20, page 23--24, New York, NY, USA, 2020. Association for Computing Machinery. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Bradley Hope. Provider of personal finance tools tracks bank cards sells data to investors. https://www.wsj.com/articles/provider-of-personal-finance-tools-tracks-bank-cards-sells-data-to-investors-1438914620, April 2015. Accessed: 2018-05--10.Google ScholarGoogle Scholar
  15. IO&C. The big trends in data reshaping financial industry. https://ioandc.com/the-big-trends-in-data-reshaping-financial-industry, April 2019. Accessed: 2019-04-07.Google ScholarGoogle Scholar
  16. Jon Kleinberg and Eva Tardos. Algorithm design. Pearson Education India, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. S.P Kothari. Capital markets research in accounting. Journal of Accounting and Economics, 31(1):105 -- 231, 2001.Google ScholarGoogle ScholarCross RefCross Ref
  18. Tze Leung Lai, Ching Zong Wei, et al. Least squares estimates in stochastic regression models with applications to identification and control of dynamic systems. The Annals of Statistics, 10(1):154--166, 1982.Google ScholarGoogle ScholarCross RefCross Ref
  19. Netflix. Netflix pick your price. https://www.netflix.com. Accessed: 2019-05--27.Google ScholarGoogle Scholar
  20. Sarah Perez. Spotify and Hulu launch a discounted entertainment bundle for $$12.99$. https://techcrunch.com/2018/04/11/spotify-and-hulu-launch-a-discounted-entertainment-bundle-for-12--99-per-month, April 2018. Accessed: 2019-06--11.Google ScholarGoogle Scholar
  21. Ashley Rodriguez. A history of netflix us price hikes, charted. https://qz.com/1524449/netflix-just-raised-prices-in-the-us-a-history-of-hikes-charted. Accessed: 2019-05--27.Google ScholarGoogle Scholar
  22. Second Measure. Data points. https://secondmeasure.com/datapoints. Accessed: 2019-05--19.Google ScholarGoogle Scholar
  23. Todd Spangler. Netflix testing out pricier new "Ultra" plan at $16.99 per month. https://variety.com/2018/digital/news/netflix-ultra-plan-hdr-ultrahd-test-1202865305, July 2018. Accessed: 2019-05--27.Google ScholarGoogle Scholar
  24. Robin Wigglesworth. Asset management's fight for alternative data analysts heats up. https://www.ft.com/content/2f454550-02c8--11e8--9650--9c0ad2d7c5b5, January 2018. Accessed: 2018-05-07.Google ScholarGoogle Scholar

Index Terms

  1. I Know What You Bought At Chipotle for $9.81 by Solving A Linear Inverse Problem

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader
    About Cookies On This Site

    We use cookies to ensure that we give you the best experience on our website.

    Learn more

    Got it!