Abstract
This article addresses the research problem of how to autonomously control Pan/Tilt/Zoom (PTZ) cameras in a manner that seeks to optimize the face recognition accuracy or the overall threat detection and proposes an overall system. The article presents two alternative schemes for camera scheduling: Grid-Based Grouping (GBG) and Elevator-Based Planning (EBP). The camera control works with realistic 3D environments and considers many factors, including the direction of the subject’s movement and its location, distances from the cameras, occlusion, overall recognition probability so far, and the expected time to leave the site, as well as the movements of cameras and their capabilities and limitations. In addition, the article utilizes clustering to group subjects, thereby enabling the system to focus on the areas that are more densely populated. Moreover, it proposes a dynamic mechanism for controlling the pre-recording time spent on running the solution. Furthermore, it develops a parallel algorithm, allowing the most time-consuming phases to be parallelized, and thus run efficiently by the centralized parallel processing subsystem. We analyze through simulation the effectiveness of the overall solution, including the clustering approach, scheduling alternatives, dynamic mechanism, and parallel implementation in terms of overall recognition probability and the running time of the solution, considering the impacts of numerous parameters.
- [1] . 2018. A low-power vision system with adaptive background subtraction and image segmentation for unusual event detection. IEEE Transactions on Circuits and Systems I: Regular Papers 65, 11 (2018), 3842–3853.Google Scholar
Cross Ref
- [2] . 2020. YOLOv4: Optimal speed and accuracy of object detection. CoRR abs/2004.10934. arXiv: 2004.10934. Retrieved from https://arxiv.org/abs/2004.10934v1.Google Scholar
- [3] . 2006. The effect of image resolution on the performance of a face recognition system. In Proceedings of International Conference on Control, Automation, Robotics and Vision. 1–6.Google Scholar
Cross Ref
- [4] . 2019. ArcFace: Additive angular margin loss for deep face recognition. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.Google Scholar
Cross Ref
- [5] . 2009. Pedestrian detection: A benchmark. In Proceedings of Computer Vision and Pattern Recognition conference.Google Scholar
- [6] . 2017. Centralised, decentralised, and self-organised coverage maximisation in smart camera networks. In Proceedings of the 2017 IEEE 11th International Conference on Self-Adaptive and Self-Organizing Systems. 1–10.Google Scholar
Cross Ref
- [7] . 2011. Computer Vision: A Modern Approach (Second Edition). Pearson.Google Scholar
- [8] . 2011. Face recognition across pose and illumination. In Handbook of Face Recognition, (Eds.). Springer London, London, 197–221.
DOI: Google ScholarCross Ref
- [9] . 2020. QRMODA and BRMODA: Novel analytical models of face recognition accuracy in terms of video capturing and encoding parameters. In Proceedings of the IEEE International Conference on Multimedia and Expo (
July 2020). 1–6.Google Scholar - [10] . 2015. Performance evaluation of a novel hybrid clustering algorithm using birch and k-means. In Proceeding of the Annual IEEE India Conference. 1–6.Google Scholar
Cross Ref
- [11] . 2019. Automatic PTZ camera control based on deep-q network in video surveillance system. In Proceeding of the International Conference on Electronics, Information, and Communication. 1–3.Google Scholar
Cross Ref
- [12] . 2008. Collaborative real-time control of active cameras in large scale surveillance systems. In Proceedings of the Workshop on Multi-camera and Multi-modal Sensor Fusion Algorithms and Applications.Google Scholar
- [13] . 2015. Static, dynamic, and adaptive heterogeneity in distributed smart camera networks. ACM Transactions on Autonomous and Adaptive Systems 10, 2, (
June 2015), 30 pages.Google ScholarDigital Library
- [14] . 2015. Large-area, multilayered, and high-resolution visual monitoring using a dual-camera system. In Proceeding of the ACM Transactions on Multimedia Computing, Communications, and Applications. 1–23.Google Scholar
- [15] . 2017. SphereFace: Deep hypersphere embedding for face recognition. In Proceeding of the IEEE Conference on Computer Vision and Pattern Recognition, Vol. 1. 1.Google Scholar
Cross Ref
- [16] . 2005. Data Mining and Knowledge Discovery Handbook. Springer.Google Scholar
Cross Ref
- [17] . 2017. Topology management-based distributed camera actuation in wireless multimedia sensor networks. ACM Transactions on Autonomous and Adaptive Systems 12, 1, (
April 2017), 33 pages.Google ScholarDigital Library
- [18] . 2016. Performance analysis of disk scheduling algorithms. International Journal of Computer Sciences and Engineering 4, 5 (2016), 180–184.Google Scholar
- [19] . 2015. Dynamic camera scheduling for visual surveillance in crowded scenes using Markov random fields. In Proceeding of the IEEE International Conference on Advanced Video and Signal Based Surveillance. 1–6.Google Scholar
Cross Ref
- [20] . 2015. Dynamic reconfiguration in camera networks: A short survey. IEEE Transactions on Circuits and Systems for Video Technology 26, 5 (2015), 965–977.
DOI: Google ScholarCross Ref
- [21] . 2010. Occlusion-aware multiple camera reconfiguration. In Proceedings of the IEEE International Conference on Distributed Smart Cameras. 88–94.Google Scholar
Digital Library
- [22] . 2013. Modeling pedestrian travel time and the design of facilities: A queuing approach. PloS One 8, 5 (2013), e63503.Google Scholar
Cross Ref
- [23] . 2015. FaceNet: A unified embedding for face recognition and clustering. In Proceeding of the IEEE Conference on Computer Vision and Pattern Recognition. 815–823.Google Scholar
Cross Ref
- [24] . 2017. Smart surveillance based on video summarization. In Proceeding of IEEE Region 10 Symposium. 1–5.Google Scholar
Cross Ref
- [25] . 2010. Systems and algorithms for autonomous and scalable crowd surveillance using robotic PTZ cameras assisted by a wide-angle camera. Autonomous Robots 29, 1 (2010), 53–66.Google Scholar
Digital Library
- [26] . 2008. Improving long range and high magnification face recognition: Database acquisition, evaluation, and enhancement. Computer Vision and Image Understanding 111, 2 (
August 2008), 111–125.Google ScholarDigital Library
- [27] . 2017. Deep learning based forensic face verification in videos. In Proceeding of the International Conference on Progress in Informatics and Computing. 77–80.Google Scholar
Cross Ref
- [28] . 2003. A master-slave system to acquire biometric imagery of humans at distance. In Proceedings of the ACM SIGMM International Workshop on Video Surveillance. ACM, New York, NY, 113–120.Google Scholar
Digital Library
- [29] . 2009. Face recognition with contiguous occlusion using markov random fields. In Proceedings of the IEEE International Conference on Computer Vision. 1050 –1057.Google Scholar
Index Terms
An Autonomous System for Efficient Control of PTZ Cameras
Recommendations
Efficient Control of PTZ Cameras in Automated Video Surveillance Systems
ISM '12: Proceedings of the 2012 IEEE International Symposium on MultimediaThis paper deals with the camera control problem in automated video surveillance. We develop a solution that seeks to optimize the overall subject recognition probability by controlling the pan, tilt, and zoom of various deployed Pan/Tilt/Zoom (PTZ) ...
Stereo vision using two PTZ cameras
The research of traditional stereo vision is mainly based on static cameras. As PTZ (Pan-Tilt-Zoom) cameras are able to obtain multi-view-angle and multi-resolution information, they have received more and more concern in both research and real ...
Efficient Vision-Based Calibration for Visual Surveillance Systems with Multiple PTZ Cameras
ICVS '06: Proceedings of the Fourth IEEE International Conference on Computer Vision SystemsSurveillance systems with multiple cameras have become increasingly important. The space relationship between cameras offers useful information for surveillance applications, such as object tracking or 3- D positioning. In this paper, we propose a ...






Comments