ABSTRACT
Content delivery networks serve a major fraction of the Internet traffic, and their geographically deployed infrastructure makes them a good vantage point to observe traffic access patterns. We perform a large-scale investigation to characterize Web traffic patterns observed from a major CDN infrastructure. Specifically, we discover that responses with application/json content-type form a growing majority of all HTTP requests. As a result, we seek to understand what types of devices and applications are requesting JSON objects and explore opportunities to optimize CDN delivery of JSON traffic. Our study shows that mobile applications account for at least 52% of JSON traffic on the CDN and embedded devices account for another 12% of all JSON traffic. We also find that more than 55% of JSON traffic on the CDN is uncacheable, showing that a large portion of JSON traffic on the CDN is dynamic. By further looking at patterns of periodicity in requests, we find that 6.3% of JSON traffic is periodically requested and reflects the use of (partially) autonomous software systems, IoT devices, and other kinds of machine-to-machine communication. Finally, we explore dependencies in JSON traffic through the lens of ngram models and find that these models can capture patterns between subsequent requests. We can potentially leverage this to prefetch requests, improving the cache hit ratio.
References
- Ecmascript 5.1 language specification. https://www.ecma-international.org/ecma-262/5.1/#sec-15.12.2.Google Scholar
- Edge Device Characteristics - Akamai. https://learn.akamai.com/en-us/webhelp/ion/oca/GUID-8DC8807F-B65E-40EC-BB14-896C9F12026E.html.Google Scholar
- Fact & Figures - Akamai. https://www.akamai.com/us/en/about/facts-figures.jsp.Google Scholar
- ION Web Performance Optimization - Akamai. https://www.akamai.com/us/en/products/performance/web-performance-optimization.jsp.Google Scholar
- The json data interchange syntax. http://www.ecma-international.org/publications/files/ECMA-ST/ECMA-404.pdf.Google Scholar
- Kona Site Defender - Akamai. https://www.akamai.com/us/en/products/security/kona-site-defender.jsp.Google Scholar
- Media types - iana. https://www.iana.org/assignments/media-types/media-types.xhtml.Google Scholar
- Mime types - mdn. https://developer.mozilla.org/en-US/docs/Web/HTTP/Basics_of_HTTP/MIME_types.Google Scholar
- Progressive web apps - the app shell model. https://developers.google.com/web/fundamentals/architecture/app-shell.Google Scholar
- Symmantec Sitereview. https://sitereview.bluecoat.com.Google Scholar
- User agent string database. http://www.useragentstring.com/.Google Scholar
- Lecture on Ngrams and Backoff Models, 2009. http://l2r.cs.uiuc.edu/~danr/Teaching/CS546--09/Lectures/Lec5-Stat-09-ext.pdf.Google Scholar
- Butkiewicz, M., Wang, D., Wu, Z., Madhyastha, H. V., and Sekar, V. Klotski: Reprioritizing web content to improve user experience on mobile devices. In 12th USENIX Symposium on Networked Systems Design and Implementation (NSDI 15) (Oakland, CA, May 2015), USENIX Association, pp. 439--453.Google Scholar
- Choi, B., Kim, J., Cho, D., Kim, S., and Han, D. Appx: an automated app acceleration framework for low latency mobile app. In Proceedings of the 14th International Conference on emerging Networking EXperiments and Technologies (2018), ACM, pp. 27--40.Google Scholar
Digital Library
- Fielding, R., and Reschke, J. Rfc 7231-hypertext transfer protocol (http/1.1): Semantics and content. Internet Engineering Task Force (IETF) (2014). https://tools.ietf.org/html/rfc7231.Google Scholar
- Gerber, A., and Doverspike, R. Traffic types and growth in backbone networks. In Optical Fiber Communication Conference/National Fiber Optic Engineers Conference 2011 (2011), Optical Society of America.Google Scholar
Cross Ref
- Higgins, B. D., Flinn, J., Giuli, T. J., Noble, B., Peplin, C., and Watson, D. Informed mobile prefetching. In Proceedings of the 10th international conference on Mobile systems, applications, and services (2012), ACM, pp. 155--168.Google Scholar
Digital Library
- Kline, J., Barford, P., Cahn, A., and Sommers, J. On the structure and characteristics of user agent string. In Proceedings of the 2017 Internet Measurement Conference (New York, NY, USA, 2017), IMC '17, ACM, pp. 184--190.Google Scholar
- Miskovic, S., Lee, G. M., Liao, Y., and Baldi, M. Appprint: automatic fingerprinting of mobile applications in network traffic. In International Conference on Passive and Active Network Measurement (2015), Springer, pp. 57--69.Google Scholar
Cross Ref
- Nejati, J., and Balasubramanian, A. An in-depth study of mobile browser performance. In Proceedings of the 25th International Conference on World Wide Web (2016), International World Wide Web Conferences Steering Committee, pp. 1305--1315.Google Scholar
- Netravali, R., and Mickens, J. Prophecy: Accelerating mobile page loads using final-state write logs. In 15th USENIX Symposium on Networked Systems Design and Implementation (NSDI 18) (Renton, WA, Apr. 2018), USENIX Association, pp. 249--266.Google Scholar
- Nygren, E., Sitaraman, R. K., and Sun, J. The Akamai network: A platform for high-performance internet applications. ACM SIGOPS Operating Systems Review 44, 3 (2010), 2--19.Google Scholar
Digital Library
- Poese, I., Frank, B., Smaragdakis, G., Uhlig, S., Feldmann, A., and Maggs, B. Enabling content-aware traffic engineering. ACM SIGCOMM Computer Communication Review 42, 5 (2012), 21--28.Google Scholar
Digital Library
- Pujol, E., Richter, P., Chandrasekaran, B., Smaragdakis, G., Feldmann, A., Maggs, B. M., and Ng, K.-C. Back-office web traffic on the internet. In Proceedings of the 2014 Conference on Internet Measurement Conference (2014), ACM, pp. 257--270.Google Scholar
Digital Library
- Richter, P., Padmanabhan, R., Spring, N., Berger, A., and CLark, D. Advancing the Art of Internet Edge Outage Detection. In Proceedings of ACM IMC 2018 (Boston, MA, November 2018).Google Scholar
- Richter, P., Padmanabhan, R., Spring, N., Berger, A., and Clark, D. Advancing the art of internet edge outage detection. In Proceedings of the Internet Measurement Conference 2018 (2018), ACM, pp. 350--363.Google Scholar
Digital Library
- Sivakumar, A., Puzhavakath Narayanan, S., Gopalakrishnan, V., Lee, S., Rao, S., and Sen, S. Parcel: Proxy assisted browsing in cellular networks for energy and latency reduction. In Proceedings of the 10th ACM International on Conference on Emerging Networking Experiments and Technologies (New York, NY, USA, 2014), CoNEXT '14, ACM, pp. 325--336.Google Scholar
Digital Library
- Trevisan, M., Giordano, D., Drago, I., Mellia, M., and Munafo, M. Five years at the edge: watching internet from the isp network. In Proceedings of the 14th International Conference on Emerging Networking Experiments and Technologies (2018), ACM, pp. 1--12.Google Scholar
Digital Library
- Vlachos, M., Yu, P., and Castelli, V. On periodicity detection and structural periodic similarity. In Proceedings of the 2005 SIAM international conference on data mining (2005), SIAM, pp. 449--460.Google Scholar
Cross Ref
- Wang, X. S., Balasubramanian, A., Krishnamurthy, A., and Wetherall, D. Demystifying page load performance with WProf. In Presented as part of the 10th USENIX Symposium on Networked Systems Design and Implementation (NSDI 13) (Lombard, IL, 2013), USENIX, pp. 473--485.Google Scholar
- Wang, X. S., Krishnamurthy, A., and Wetherall, D. Speeding up web page loads with shandian. In 13th USENIX Symposium on Networked Systems Design and Implementation (NSDI 16) (Santa Clara, CA, 2016), USENIX Association, pp. 109--122.Google Scholar
Digital Library
- Xu, Q., Andrews, T., Liao, Y., Miskovic, S., Mao, Z. M., Baldi, M., and Nucci, A. Flowr: a self-learning system for classifying mobileapplication traffic. ACM SIGMETRICS Performance Evaluation Review 42, 1 (2014), 569--570.Google Scholar
Digital Library
- Xu, Q., Erman, J., Gerber, A., Mao, Z., Pang, J., and Venkataraman, S. Identifying diverse usage behaviors of smartphone apps. In Proceedings of the 2011 ACM SIGCOMM conference on Internet measurement conference (2011), ACM, pp. 329--344.Google Scholar
Digital Library
- Yao, H., Ranjan, G., Tongaonkar, A., Liao, Y., and Mao, Z. M. Samples: Self adaptive mining of persistent lexical snippets for classifying mobile application traffic. In Proceedings of the 21st Annual International Conference on Mobile Computing and Networking (2015), ACM, pp. 439--451.Google Scholar
Digital Library
- Zhao, Y., Laser, M. S., Lyu, Y., and Medvidovic, N. Leveraging program analysis to reduce user-perceived latency in mobile applications. In Proceedings of the 40th International Conference on Software Engineering (2018), ACM, pp. 176--186.Google Scholar
Digital Library
Index Terms
Characterizing JSON Traffic Patterns on a CDN





Comments