skip to main content
research-article
Public Access

Neural Network Meets DCN: Traffic-driven Topology Adaptation with Deep Learning

Authors Info & Claims
Published:13 June 2018Publication History
Skip Abstract Section

Abstract

The emerging optical/wireless topology reconfiguration technologies have shown great potential in improving the performance of data center networks. However, it also poses a big challenge on how to find the best topology configurations to support the dynamic traffic demands. In this work, we present xWeaver, a traffic-driven deep learning solution to infer the high-performance network topology online. xWeaver supports a powerful network model that enables the topology optimization over different performance metrics and network architectures. With the design of properly-structured neural networks, it can automatically derive the critical traffic patterns from data traces and learn the underlying mapping between the traffic patterns and topology configurations specific to the target data center. After offline training, xWeaver generates the optimized (or near-optimal) topology configuration online, and can also smoothly update its model parameters for new traffic patterns. We build an optical-circuit-switch-based testbed to demonstrate the function and transmission efficiency of our proposed solution. We further perform extensive simulations to show the significant performance gain of xWeaver, in supporting higher network throughput and smaller flow completion time.

References

  1. 2012. Grante library. http://www.nowozin.net/sebastian/grante.Google ScholarGoogle Scholar
  2. 2017. Floodlight. http://www.projectfloodlight.org.Google ScholarGoogle Scholar
  3. 2017. iperf tool. https://iperf.fr.Google ScholarGoogle Scholar
  4. 2017. NS-2 Simulator. https://www.nsnam.org.Google ScholarGoogle Scholar
  5. 2018. UGM library. http://www.cs.ubc.ca/~schmidtm/Software/UGM.html.Google ScholarGoogle Scholar
  6. Mohammad Al-Fares, Alexander Loukissas, and Amin Vahdat. 2008. A scalable, commodity data center network architecture. In SIGCOMM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Mohammad Al-Fares, Sivasankar Radhakrishnan, Barath Raghavan, Nelson Huang, and Amin Vahdat. 2010. Hedera: Dynamic Flow Scheduling for Data Center Networks.. In NSDI. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Mohammad Alizadeh, Albert Greenberg, David A Maltz, Jitendra Padhye, Parveen Patel, Balaji Prabhakar, Sudipta Sengupta, and Murari Sridharan. 2010. Data center tcp (dctcp). In SIGCOMM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Wei Bai, Li Chen, Kai Chen, and Haitao Wu. 2016. Enabling ECN in multi-service multi-queue data centers. In NSDI. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Theophilus Benson, Aditya Akella, and David A Maltz. 2010. Network traffic characteristics of data centers in the wild. In IMC. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Theophilus Benson, Ashok Anand, Aditya Akella, and Ming Zhang. 2010. Understanding data center traffic characteristics. ACM SIGCOMM Computer Communication Review 40, 1 (2010), 92--99. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Shaileshh Bojja, Mohammad Alizadeh, and Pramod Viswanath. 2016. Costly Circuits, Submodular Schedules and Approximate Carathéodory Theorems. In SIGMETRICS. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Kai Chen, Anubhav Singla, Ashutosh Singh, Kishore Ramachandran, Lei Xu, Yueping Zhang, Xitao Wen, and Yan Chen. 2014. OSA: an optical switching architecture for data center networks with unprecedented flexibility. In NSDI. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Li Chen, Kai Chen, Joshua Zhu, Minlan Yu, George Porter, Chunming Qiao, and Shan Zhong. 2017. Enabling WideSpread Communications on Optical Fabric with MegaSwitch. In NSDI. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Trishul Chilimbi, Yutaka Suzue, Johnson Apacible, and Karthik Kalyanaraman. 2014. Project adam: Building an efficient and scalable deep learning training system. In OSDI. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Yong Cui, Shihan Xiao, Xin Wang, Zhenjie Yang, Chao Zhu, Xiangyang Li, Liu Yang, and Ning Ge. 2016. Diamond: nesting the data center network with wireless rings in 3D space. In NSDI. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Jack Edmonds. 1965. Paths, trees, and flowers. Canadian Journal of mathematics 17, 3 (1965), 449--467.Google ScholarGoogle Scholar
  18. Nathan Farrington, George Porter, Sivasankar Radhakrishnan, Hamid Hajabdolali Bazzaz, Vikram Subramanya, Yeshaiahu Fainman, George Papen, and Amin Vahdat. 2010. Helios: a hybrid electrical/optical switch architecture for modular data centers. In SIGCOMM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Monia Ghobadi, Ratul Mahajan, Amar Phanishayee, Nikhil Devanur, Janardhan Kulkarni, Gireeja Ranade, PierreAlexandre Blanche, Houman Rastegarfar, Madeleine Glick, and Daniel Kilper. 2016. ProjecToR: Agile Reconfigurable Data Center Interconnect. In SIGCOMM 2016. ACM, 216--229. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Phillipa Gill, Navendu Jain, and Nachiappan Nagappan. 2011. Understanding network failures in data centers: measurement, analysis, and implications. In ACM SIGCOMM Computer Communication Review, Vol. 41. ACM, 350--361. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Priya Goyal, Piotr Dollár, Ross Girshick, Pieter Noordhuis, Lukasz Wesolowski, Aapo Kyrola, Andrew Tulloch, Yangqing Jia, and Kaiming He. 2017. Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour. Facebook Inc., arXiv preprint:1706.02677 (2017).Google ScholarGoogle Scholar
  22. Albert Greenberg, James R Hamilton, Navendu Jain, Srikanth Kandula, Changhoon Kim, Parantap Lahiri, David A Maltz, Parveen Patel, and Sudipta Sengupta. 2009. VL2: a scalable and flexible data center network. In SIGCOMM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Daniel Halperin, Srikanth Kandula, Jitendra Padhye, Paramvir Bahl, and David Wetherall. 2011. Augmenting data center networks with multi-gigabit wireless links. In SIGCOMM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Navid Hamedazimi, Zafar Qazi, Himanshu Gupta, Vyas Sekar, Samir R Das, Jon P Longtin, Himanshu Shah, and Ashish Tanwer. 2014. FireFly: a reconfigurable wireless data center fabric using free-space optics. In SIGCOMM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2015. Delving deep into rectifiers: Surpassing human-level performance on imagenet classification. In Proceedings of the IEEE International Conference on Computer Vision (ICCV). 1026--1034. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Yangqing Jia, Evan Shelhamer, Jeff Donahue, Sergey Karayev, Jonathan Long, Ross Girshick, Sergio Guadarrama, and Trevor Darrell. 2014. Caffe: Convolutional architecture for fast feature embedding. In Proceedings of the 22nd ACM international conference on Multimedia. ACM, 675--678. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Xin Jin, Yiran Li, Da Wei, Siming Li, Jie Gao, Lei Xu, Guangzhi Li, Wei Xu, and Jennifer Rexford. 2016. Optimizing Bulk Transfers with Software-Defined Optical WAN. In SIGCOMM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Srikanth Kandula, Sudipta Sengupta, Albert Greenberg, Parveen Patel, and Ronnie Chaiken. 2009. The nature of data center traffic: measurements & analysis. In Proceedings of the 9th ACM SIGCOMM conference on Internet measurement conference. ACM, 202--208. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Andrej Karpathy, George Toderici, Sanketh Shetty, Thomas Leung, Rahul Sukthankar, and Li Fei-Fei. 2014. Large-scale video classification with convolutional neural networks. In Proceedings of the IEEE conference on Computer Vision and Pattern Recognition (CVPR). 1725--1732. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Thomas N. Kipf and Max Welling. 2017. Semi-Supervised Classification with Graph Convolutional Networks. In International Conference on Learning Representations (ICLR).Google ScholarGoogle Scholar
  31. Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. 2012. Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems (NIPS). 1097--1105. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Sergey Legtchenko, Nicholas Chen, Daniel Cletheroe, Antony Rowstron, Hugh Williams, and Xiaohan Zhao. 2016. XFabric: a reconfigurable in-rack network for rack-scale computers. In NSDI. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. He Liu, Feng Lu, Alex Forencich, Rishi Kapoor, Malveeka Tewari, Geoffrey M Voelker, George Papen, Alex C Snoeren, and George Porter. 2014. Circuit switching under the radar with reactor. In NSDI. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. He Liu, Matthew K Mukerjee, Conglong Li, Nicolas Feltman, George Papen, Stefan Savage, Srinivasan Seshan, Geoffrey M Voelker, David G Andersen, Michael Kaminsky, et al. 2015. Scheduling techniques for hybrid circuit/packet networks. In CoNEXT. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Yunpeng James Liu, Peter Xiang Gao, Bernard Wong, and Srinivasan Keshav. 2014. Quartz: a new design element for low-latency DCNs. In SIGCOMM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. William M Mellette, Rob McGuinness, Arjun Roy, Alex Forencich, George Papen, Alex C Snoeren, and George Porter. 2017. RotorNet: A Scalable, Low-complexity, Optical Datacenter Network. In SIGCOMM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Mohammad Noormohammadpour and Cauligi S Raghavendra. 2017. Datacenter Traffic Control: Understanding Techniques and Trade-offs. IEEE Communications Surveys & Tutorials (2017).Google ScholarGoogle Scholar
  38. Katia Obraczka and Peter Danzig. 1997. Finding low-diameter, low edge-cost, networks. Univ. Southern California Technical Report (1997).Google ScholarGoogle Scholar
  39. Naoaki Okazaki. 2007. CRFsuite: a fast implementation of Conditional Random Fields (CRFs). http://www.chokkan. org/software/crfsuite/Google ScholarGoogle Scholar
  40. Kay Ousterhout, Ryan Rasti, Sylvia Ratnasamy, Scott Shenker, and Byung-Gon Chun. 2015. Making sense of performance in data analytics frameworks. In NSDI. Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. George Porter, Richard Strong, Nathan Farrington, Alex Forencich, Pang Chen-Sun, Tajana Rosing, Yeshaiahu Fainman, George Papen, and Amin Vahdat. 2013. Integrating microsecond circuit switching into the data center. In SIGCOMM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. Arjun Roy, Hongyi Zeng, Jasmeet Bagga, George Porter, and Alex C Snoeren. 2015. Inside the social network's (datacenter) network. In SIGCOMM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. Franco Scarselli, Marco Gori, Ah Chung Tsoi, Markus Hagenbuchner, and Gabriele Monfardini. 2009. The graph neural network model. IEEE Transactions on Neural Networks 20, 1 (2009), 61--80. Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. Brandon Schlinker, Radhika Niranjan Mysore, Sean Smith, Jeffrey C Mogul, Amin Vahdat, Minlan Yu, Ethan Katz-Bassett, and Michael Rubin. 2015. Condor: Better topologies through declarative design. In SIGCOMM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. Jürgen Schmidhuber. 2015. Deep learning in neural networks: An overview. Neural Networks 61 (2015), 85--117. Google ScholarGoogle ScholarDigital LibraryDigital Library
  46. David Silver, Aja Huang, Chris J Maddison, Arthur Guez, Laurent Sifre, George Van Den Driessche, Julian Schrittwieser, Ioannis Antonoglou, Veda Panneershelvam, Marc Lanctot, et al. 2016. Mastering the game of Go with deep neural networks and tree search. Nature 529, 7587 (2016), 484--489.Google ScholarGoogle Scholar
  47. Arjun Singh, Joon Ong, Amit Agarwal, Glen Anderson, Ashby Armistead, Roy Bannon, Seb Boving, Gaurav Desai, Bob Felderman, Paulie Germano, et al. 2015. Jupiter rising: A decade of clos topologies and centralized control in google's datacenter network. In SIGCOMM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. Charles Sutton and Andrew McCallum. 2011. An Introduction to Conditional Random Fields. Machine Learning 4, 4 (2011), 267--373. Google ScholarGoogle ScholarDigital LibraryDigital Library
  49. Jie Tang, Jing Zhang, Limin Yao, Juanzi Li, Li Zhang, and Zhong Su. 2008. Arnetminer: extraction and mining of academic social networks. In Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 990--998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  50. Vincent Vanhoucke, Andrew Senior, and Mark Z Mao. 2011. Improving the speed of neural networks on CPUs. In NIPS.Google ScholarGoogle Scholar
  51. Guohui Wang, David G Andersen, Michael Kaminsky, Konstantina Papagiannaki, TS Ng, Michael Kozuch, and Michael Ryan. 2010. c-Through: Part-time optics in data centers. In SIGCOMM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  52. Mowei Wang, Yong Cui, Xin Wang, Shihan Xiao, and Junchen Jiang. 2017. Machine Learning for Networking: Workflow, Advances and Opportunities. IEEE Network (2017).Google ScholarGoogle Scholar
  53. Yiting Xia, Xiaoye Steven Sun, Simbarashe Dzinamarira, Dingming Wu, Xin Sunny Huang, and TS Ng. 2017. A Tale of Two Topologies: Exploring Convertible Data Center Network Architectures with Flat-tree. In SIGCOMM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  54. Muhan Zhang and Yixin Chen. 2018. Link Prediction Based on Graph Neural Networks. arXiv preprint arXiv:1802.09691 (2018).Google ScholarGoogle Scholar
  55. Xia Zhou, Zengbin Zhang, Yibo Zhu, Yubo Li, Saipriya Kumar, Amin Vahdat, Ben Y Zhao, and Haitao Zheng. 2012. Mirror mirror on the ceiling: flexible wireless links for data centers. In SIGCOMM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  56. Yibo Zhu, Xia Zhou, Zengbin Zhang, Lin Zhou, Amin Vahdat, Ben Y Zhao, and Haitao Zheng. 2014. Cutting the cord: a robust wireless facilities network for data centers. In MOBICOM. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Neural Network Meets DCN: Traffic-driven Topology Adaptation with Deep Learning

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        Full Access

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader
        About Cookies On This Site

        We use cookies to ensure that we give you the best experience on our website.

        Learn more

        Got it!