Abstract
With the ever-increasing diffusion of smart devices and Internet of Things (IoT) applications, a completely new set of challenges have been added to the Data Mining domain. Edge Mining and Cloud Mining refer to Data Mining tasks aimed at IoT scenarios and performed according to, respectively, Cloud or Edge computing principles. Given the orthogonality and interdependence among the Data Mining task goals (e.g., accuracy, support, precision), the requirements of IoT applications (mainly bandwidth, energy saving, responsiveness, privacy preserving, and security) and the features of Edge/Cloud deployments (de-centralization, reliability, and ease of management), we propose EdgeMiningSim, a simulation-driven methodology inspired by software engineering principles for enabling IoT Data Mining. Such a methodology drives the domain experts in disclosing actionable knowledge, namely descriptive or predictive models for taking effective actions in the constrained and dynamic IoT scenario. A Smart Monitoring application is instantiated as a case study, aiming to exemplify the EdgeMiningSim approach and to show its benefits in effectively facing all those multifaceted aspects that simultaneously impact on IoT Data Mining.
- Charu C. Aggarwal, Naveen Ashish, and Amit Sheth. 2013. The internet of things: A survey from the data-centric perspective. In Managing and Mining Sensor Data. Springer, 383–428.Google Scholar
- Furqan Alam, Rashid Mehmood, Iyad Katib, and Aiiad Albeshri. 2016. Analysis of eight data mining algorithms for smarter Internet of Things (IoT). Proc. Comput. Sci. 98 (2016), 437–442. https://www.sciencedirect.com/science/article/pii/S187705091632213X. Google Scholar
Digital Library
- Luigi Atzori, Antonio Iera, Giacomo Morabito, and Michele Nitti. 2012. The social internet of things (siot)–When social networks meet the internet of things: Concept, architecture and network characterization. Comput. Netw. 56, 16 (2012), 3594–3608. Google Scholar
Digital Library
- Laura Belli, Simone Cirani, Luca Davoli, Gianluigi Ferrari, Lorenzo Melegari, Màrius Montón, and Marco Picone. 2015. A scalable big stream cloud architecture for the internet of things. Int. J. Syst. Service-Orient. Eng. 5, 4 (2015), 26–53. Google Scholar
Digital Library
- Kanishka Bhaduri and Marco Stolpe. 2013. Distributed data mining in sensor networks. In Managing and Mining Sensor Data. Springer, 211–236.Google Scholar
- Md Zakirul Alam Bhuiyan, Jie Wu, Gary M. Weiss, Thaier Hayajneh, Tian Wang, and Guojun Wang. 2020. Event detection through differential pattern mining in cyber-physical systems. IEEE Trans. Big Data 6, 4 (2020), 652--665. DOI:10.1109/TBDATA.2017.2731838Google Scholar
Cross Ref
- Shen Bin, Liu Yuan, and Wang Xiaoyi. 2010. Research on data mining models for the internet of things. In Proceedings of the 2010 International Conference on Image Analysis and Signal Processing. IEEE, 127–132.Google Scholar
- Antonio Brogi and Stefano Forti. 2017. QoS-aware deployment of IoT applications through the fog. IEEE IoT J. 4, 5 (2017), 1185–1192.Google Scholar
- James Byrne, Sergej Svorobej, Anna Gourinovitch, Divyaa Manimaran Elango, Paul Liston, Peter J. Byrne, and Theo Lynn. 2017. RECAP simulator: Simulation of cloud/edge/fog computing scenarios. In Proceedings of the 2017 Winter Simulation Conference (WSC’17). IEEE, 4568–4569.Google Scholar
Cross Ref
- Rodrigo N. Calheiros, Rajiv Ranjan, Anton Beloglazov, César A. F. De Rose, and Rajkumar Buyya. 2011. CloudSim: A toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithms. Softw.: Pract. Exper. 41, 1 (2011), 23–50. Google Scholar
Digital Library
- Longbing Cao and Chengqi Zhang. 2007. The evolution of KDD: Towards domain-driven data mining. Int. J. Pattern Recogn. Artif. Intell. 21, 4 (2007), 677–692.Google Scholar
Cross Ref
- Roberto Casadei, Giancarlo Fortino, Danilo Pianini, Wilma Russo, Claudio Savaglio, and Mirko Viroli. 2019. A development approach for collective opportunistic Edge-of-Things services. Inf. Sci. 498 (2019), 154–169.Google Scholar
Digital Library
- Roberto Casadei, Giancarlo Fortino, Danilo Pianini, Wilma Russo, Claudio Savaglio, and Mirko Viroli. 2019. Modelling and simulation of opportunistic IoT services with aggregate computing. Fut. Gener. Comput. Syst. 91 (2019), 252–262.Google Scholar
Cross Ref
- Feng Chen, Pan Deng, Jiafu Wan, Daqiang Zhang, Athanasios V. Vasilakos, and Xiaohui Rong. 2015. Data mining for the internet of things: Literature review and challenges. Int. J. Distrib. Sens. Netw. 11, 8 (2015), 431047. Google Scholar
Digital Library
- Ying Cheng, Ken Chen, Hemeng Sun, Yongping Zhang, and Fei Tao. 2018. Data and knowledge mining with big data towards smart production. J. Industr. Inf. Integr. 9 (2018), 1–13.Google Scholar
- Marcos Dias de Assuncao, Alexandre da Silva Veith, and Rajkumar Buyya. 2018. Distributed data stream processing and edge computing: A survey on resource elasticity and future directions. J. Netw. Comput. Appl. 103 (2018), 1–17. Google Scholar
Digital Library
- Inderjit S. Dhillon and Dharmendra S. Modha. 2002. A data-clustering algorithm on distributed memory multiprocessors. In Large-scale Parallel Data Mining. Springer, 245–260. Google Scholar
Digital Library
- Sofia Dimitriadou and Helen Karatza. 2010. Job scheduling in a distributed system using backfilling with inaccurate runtime computations. In Proceedings of the 2010 International Conference on Complex, Intelligent and Software Intensive Systems. IEEE, 329–336. Google Scholar
Digital Library
- Usama Fayyad, Gregory Piatetsky-Shapiro, and Padhraic Smyth. 1996. From data mining to knowledge discovery in databases. AI Mag. 17, 3 (1996), 37–37.Google Scholar
- Giancarlo Fortino, Raffaele Gravina, Wilma Russo, and Claudio Savaglio. 2017. Modeling and simulating internet-of-things systems: A hybrid agent-oriented approach. Comput. Sci. Eng. 19, 5 (2017), 68–76.Google Scholar
Cross Ref
- Giancarlo Fortino, Anna Rovella, Wilma Russo, and Claudio Savaglio. 2016. Towards cyberphysical digital libraries: Integrating IoT smart objects into digital libraries. In Management of Cyber Physical Objects in the Future Internet of Things. Springer, 135–156.Google Scholar
- Giancarlo Fortino and Wilma Russo. 2012. ELDAMeth: An agent-oriented methodology for simulation-based prototyping of distributed agent systems. Inf. Softw. Technol. 54, 6 (2012), 608–624. Google Scholar
Digital Library
- Giancarlo Fortino, Wilma Russo, Claudio Savaglio, Weiming Shen, and Mengchu Zhou. 2017. Agent-oriented cooperative smart objects: From IoT system design to implementation. IEEE Trans. Syst. Man Cybernet.: Syst. 48, 11 (2017), 1939–1956.Google Scholar
Cross Ref
- Elena I. Gaura, James Brusey, Michael Allen, Ross Wilkins, Dan Goldsmith, and Ramona Rednic. 2013. Edge mining the internet of things. IEEE Sens. J. 13, 10 (2013), 3816–3825.Google Scholar
Cross Ref
- Jie Han and Michael Orshansky. 2013. Approximate computing: An emerging paradigm for energy-efficient design. In Proceedings of the 2013 18th IEEE European Test Symposium (ETS’13). IEEE, 1–6.Google Scholar
Cross Ref
- Devki Nandan Jha, Khaled Alwasel, Areeb Alshoshan, Xianghua Huang, Ranesh Kumar Naha, Sudheer Kumar Battula, Saurabh Garg, Deepak Puthal, Philip James, Albert Zomaya, et al. 2020. IoTSim-Edge: A simulation framework for modeling the behavior of Internet of Things and edge computing environments. Softw.: Pract. Exper. 50, 6 (2020), 844--867.Google Scholar
Cross Ref
- Jiong Jin, Jayavardhana Gubbi, Slaven Marusic, and Marimuthu Palaniswami. 2014. An information framework for creating a smart city through internet of things. IEEE IoT J. 1, 2 (2014), 112–121.Google Scholar
- Cheonshik Kim, San-Yep Nam, Duk-Je Park, Injung Park, and Taek-Young Hyun. 2006. Product control system using RFID tag information and data mining. In Proceedings of the International Conference on Ubiquitous Convergence Technology. Springer, 100–109. Google Scholar
Digital Library
- Karthik Kumar and Yung-Hsiang Lu. 2010. Cloud computing for mobile users: Can offloading computation save energy? Computer 43, 4 (2010), 51–56. Google Scholar
Digital Library
- Xiaolei Ma, Yao-Jan Wu, Yinhai Wang, Feng Chen, and Jianfeng Liu. 2013. Mining smart card data for transit riders’ travel patterns. Transport. Res. C: Emerg. Technol. 36 (2013), 1–12.Google Scholar
Cross Ref
- Óscar Marbán, Gonzalo Mariscal, and Javier Segovia. 2009. A data mining & knowledge discovery process model. In Data Mining and Knowledge Discovery in Real Life Applications. IntechOpen.Google Scholar
- Gonzalo Mariscal, Oscar Marban, and Covadonga Fernandez. 2010. A survey of data mining and knowledge discovery process models and methodologies. Knowledge Eng. Rev. 25, 2 (2010), 137–166. Google Scholar
Digital Library
- Andras Markus and Attila Kertesz. 2020. A survey and taxonomy of simulation environments modelling fog computing. Simul. Model. Pract. Theory 101 (2020), 102042.Google Scholar
Cross Ref
- Charafeddine Mechalikh, Hajer Taktak, and Faouzi Moussa. 2019. PureEdgeSim: A simulation toolkit for performance evaluation of cloud, fog, and pure edge computing environments. In Proceedings of the 2019 International Conference on High Performance Computing & Simulation. 700–707. DOI:https://doi.org/10.5281/zenodo.2648161Google Scholar
Cross Ref
- Boris Milovic and Milan Milovic. 2012. Prediction and decision making in health care using data mining. Arab. J. Bus. Manage. Rev. (Kuwait Chap.) 1, 12 (2012), 126.Google Scholar
- Decebal Constantin Mocanu, Elena Mocanu, Phuong H. Nguyen, Madeleine Gibescu, and Antonio Liotta. 2016. Big IoT data mining for real-time energy disaggregation in buildings. In Proceedings of the 2016 IEEE International Conference on Systems, Man, and Cybernetics (SMC’16). IEEE, 003765–003769.Google Scholar
Cross Ref
- Ruxandra-Stefania Petre et al. 2012. Data mining in cloud computing. Database Syst. J. 3, 3 (2012), 67–71.Google Scholar
- Gregory J. Pottie and William J. Kaiser. 2000. Wireless integrated network sensors. Commun. ACM 43, 5 (2000), 51–58. Google Scholar
Digital Library
- Claudio Savaglio, Giuseppe Campisano, Giuseppe Di Fatta, and Giancarlo Fortino. 2019. IoT services deployment over edge vs cloud systems: A simulation-based analysis. In Proceedings of the IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS’19). IEEE, 554–559.Google Scholar
Cross Ref
- Claudio Savaglio, Pietro Gerace, Giuseppe Di Fatta, and Giancarlo Fortino. 2019. Data mining at the IoT edge. In Proceedings of the 2019 28th International Conference on Computer Communication and Networks (ICCCN’19). IEEE, 1–6.Google Scholar
Cross Ref
- Claudio Savaglio, Clara Isabel Valero, Andreu Belsa, Carlos Palau, and Giancarlo Fortino. 2020. Interoperability in cloud IoT platforms. In Springer Handbook of Internet of Things (unpublished).Google Scholar
- Shabnam Shadroo and Amir Masoud Rahmani. 2018. Systematic survey of big data and data mining in internet of things. Comput. Netw. 139 (2018), 19–47.Google Scholar
Digital Library
- Weisong Shi, Jie Cao, Quan Zhang, Youhuizi Li, and Lanyu Xu. 2016. Edge computing: Vision and challenges. IEEE IoT J. 3, 5 (2016), 637–646.Google Scholar
- A. Shobanadevi and G. Maragatham. 2017. Data mining techniques for IoT and big data—A survey. In Proceedings of the 2017 International Conference on Intelligent Sustainable Systems (ICISS’17). IEEE, 607–610.Google Scholar
- Cagatay Sonmez, Atay Ozgovde, and Cem Ersoy. 2018. Edgecloudsim: An environment for performance evaluation of edge computing systems. Trans. Emerg. Telecommun. Technol. 29, 11 (2018), e3493.Google Scholar
Digital Library
- Nagender Kumar Suryadevara, Subhas C. Mukhopadhyay, Ruili Wang, and R. K. Rayudu. 2013. Forecasting the behavior of an elderly using wireless sensors data in a smart home. Eng. Appl. Artif. Intell. 26, 10 (2013), 2641–2652. Google Scholar
Digital Library
- Shikhar Suryavansh, Chandan Bothra, Mung Chiang, Chunyi Peng, and Saurabh Bagchi. 2019. Tango of edge and cloud execution for reliability. In Proceedings of the 4th Workshop on Middleware for Edge Clouds & Cloudlets. 10–15. Google Scholar
Digital Library
- Chun-Wei Tsai, Chin-Feng Lai, Ming-Chao Chiang, and Laurence T. Yang. 2013. Data mining for internet of things: A survey. IEEE Commun. Surv. Tutor. 16, 1 (2013), 77–97.Google Scholar
Cross Ref
- M. Ammad Uddin, Ali Mansour, Denis Le Jeune, and El Hadi M. Aggoune. 2017. Agriculture internet of things: AG-IoT. In Proceedings of the 2017 27th International Telecommunication Networks and Applications Conference (ITNAC’17). IEEE, 1–6.Google Scholar
Cross Ref
- Hajo Wiemer, Lucas Drowatzky, and Steffen Ihlenfeldt. 2019. Data mining methodology for engineering applications (DMME)—A holistic extension to the CRISP-DM model. Appl. Sci. 9, 12 (2019), 2407.Google Scholar
Cross Ref
- Peter Wlodarczak, Mustafa Ally, and Jeffrey Soar. 2017. Data mining in IoT: Data analysis for a new paradigm on the internet. In Proceedings of the International Conference on Web Intelligence. ACM, 1100–1103. Google Scholar
Digital Library
- Xuyun Zhang, Julian Jang-Jaccard, Lianyong Qi, Md Z. A. Bhuiyan, and Chang Liu. 2018. Privacy issues in big data mining infrastructure, platforms, and applications. Secur. Commun. Netw. 2018, Article 6238607 (2018), 3 pages. https://doi.org/10.1155/2018/6238607 Google Scholar
Digital Library
Index Terms
A Simulation-driven Methodology for IoT Data Mining Based on Edge Computing
Recommendations
Right-Provisioned IoT Edge Computing: An Overview
GLSVLSI '19: Proceedings of the 2019 on Great Lakes Symposium on VLSIEdge computing on the Internet of Things (IoT) is an increasingly popular paradigm in which computation is moved closer to the data source (i.e., edge devices). Edge computing mitigates the overheads of cloud-based computing arising from increased ...
Untangling the Cloud from Edge Computing for IoT
SenSys '21: Proceedings of the 19th ACM Conference on Embedded Networked Sensor SystemsEdge computing improves latency and scalability in IoT by shifting applications from cloud to edge servers. However, it requires a control plane in the cloud, which hinders reliability and scalability, is impractical for remote deployments, incurs ...
Edge and fog computing for IoT: A survey on current research activities & future directions
AbstractThe Internet of Things (IoT) allows communication between devices, things, and any digital assets that send and receive data over a network without requiring interaction with a human. The main characteristic of IoT is the enormous ...
Highlights- We comprehensively review edge computing technology in the IoT environment.
- We ...






Comments