Abstract
The audio Beamforming (BF) technique utilizes microphone arrays to extract acoustic sources recorded in a noisy environment. In this article, we propose a new approach for rapid development of multicore BF systems. Research on literature reveals that the majority of such experimental and commercial audio systems are based on desktop PCs, due to their high-level programming support and potential of rapid system development. However, these approaches introduce performance bottlenecks, excessive power consumption, and increased overall cost. Systems based on DSPs require very low power, but their performance is still limited. Custom hardware solutions alleviate the aforementioned drawbacks, however, designers primarily focus on performance optimization without providing a high-level interface for system control and test. In order to address the aforementioned problems, we propose a custom platform-independent architecture for reconfigurable audio BF systems. To evaluate our proposal, we implement our architecture as a heterogeneous multicore reconfigurable processor and map it onto FPGAs. Our approach combines the software flexibility of General-Purpose Processors (GPPs) with the computational power of multicore platforms. In order to evaluate our system we compare it against a BF software application implemented to a low-power Atom 330, a middle-ranged Core2 Duo, and a high-end Core i3. Experimental results suggest that our proposed solution can extract up to 16 audio sources in real time under a 16-microphone setup. In contrast, under the same setup, the Atom 330 cannot extract any audio sources in real time, while the Core2 Duo and the Core i3 can process in real time only up to 4 and 6 sources respectively. Furthermore, a Virtex4-based BF system consumes more than an order less energy compared to the aforementioned GPP-based approaches.
- Acoustic Camera. 2013. http://www.acoustic-camera.com.Google Scholar
- Analog Devices Inc. 2004. SHARC processor adsp-21262. http://www.analog.com/en/evaluation/21262-ezlite/eb.html.Google Scholar
- Beracoechea, J., Torres-Guijarro, S., Garcia, L., and Casajus-Quiros, F. 2006. On building immersive audio applications using robust adaptive beamforming and joint audio-video source localization. EURASIP J. Appl. Signal Process. 2006, 196. Google Scholar
Digital Library
- Berkhout, A., De Vries, D., and Vogel, P. 1993. Acoustic control by wave field synthesis. J. Acoust. Soc. Amer. 93, 2764--2778.Google Scholar
Cross Ref
- Buchner, H., Spors, S., Kellermann, W., and Rabenstein, R. 2002. Full-duplex communication systems using loudspeaker arrays and microphone arrays. In Proceedings of the IEEE International Conference on Multimedia and Expo. 509--512.Google Scholar
- Cedrick, R. 2005. Documentation of the microphone array mark III. http://www.nist.gov/smartspace/downloads/Microphone_Array_Mark_III.pdf.Google Scholar
- Cutler, R., Rui, Y., Gupta, A., Cadiz, J., Tashev, I., He, L., Colburn, A., Zhang Z., Liu, Z., and Silverberg, S. 2002. Distributed meetings: A meeting capture and broadcasting system. In Proceedings of the International Conference on Multimedia. 503--512. Google Scholar
Digital Library
- Farina, A., Glasgal, R., Armelloni, E., and Torger, A. 2001. Ambiophonic principles for the recording and reproduction of surround. In Proceedings of the 19th AES International Conference.Google Scholar
- Fiala, M., Green, D., and Roth, G. 2004. A panoramic video and acoustic beamforming sensor for videoconferencing. In Proceedings of the IEEE International Conference on Haptic, Audio and Visual Environments and their Applications. 47--52.Google Scholar
- Fillinger, A., Diduch, L., Hamchi, I., Degre, S., and Stanford, V. 2007. NIST smart data flow system II: Speaker localization. In Proceedings of the 6th International Conference on Information Processing in Sensor Networks. 549--550. Google Scholar
Digital Library
- Theile, G. 2001. Multichannel natural music recording based on pychoacoustics principles. In Proceedings of the 19th AES International Conference.Google Scholar
- Intel Corporation. 2013. http://ark.intel.com/products/35641.Google Scholar
- Intel Corporation. 2013. http://ark.intel.com/products/36500.Google Scholar
- Intel Corporation. 2013. http://ark.intel.com/products/49020.Google Scholar
- Kapralos, B., Jenkin, M., and Milios, E. 2003. Audio-visual localization of multiple speakers in a video teleconferencing setting. Int. J. Imaging Syst. Technol. 13, 1, 95--105.Google Scholar
Cross Ref
- Kyriakakis, C. 1998. Fundamental and technological limitations of immersive audio systems. Proc. IEEE 86, 941--951.Google Scholar
Cross Ref
- Mei, G., Xu, R., Lao, D., and Kwan, C. 2006. Real-time speaker verification with a microphone array. In Proceedings of the International Conference on Pervasive Systems and Computing.Google Scholar
- Mihov, S.G., Gleghorn, T., and Tashev, I. 2008. Enhanced sound capture system for small devices. In Proceedings of the International Conference of Information, Communication and Energy Systems.Google Scholar
- Mit Csail: Mit Project Oxygen. 2004. http://oxygen.lcs.mit.edu/.Google Scholar
- Mouchtaris, A., Reveliotis, P., and Kyriakakis, C. 2000. Inverse of filter design for immersive audio rendering over loudspeakers. IEEE Trans. Multimedia 2, 77--87. Google Scholar
Digital Library
- Nilsen, C.-I. C. and Hafizovic, I. 2009. Digital beamforming using a GPU. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing. 609--612. Google Scholar
Digital Library
- Polycom Inc. 2009. Polycom CX5000 unified conference station. http://www.polycom.com/products-services/products-for-microsoft/lync-optimized/cx5000-unified-conference-station.html.Google Scholar
- Sallberg, B., Swartling, M., Grbic, N., and Ingvar C. 2006. Real-time implementation of a blind beamformer for subband speech enhancement using kurtosis maximization. In Proceedings of the International Workshop on Acoustic Echo and Noise Control. 485--489.Google Scholar
- Snow, W. 1955. Basic principles of stereophonic sound. IRE Trans. Audio 2, 42--53.Google Scholar
Cross Ref
- Squarehead Technology. 2013. Audio scope zoom audio. http://www.sqhead.com/.Google Scholar
- Teutsch, H., Spors, S., Herbordt, W., Kellermann, W., and Rabenstein, R. 2003. An integrated real-time system for immersive audio applications. In Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. 67--70.Google Scholar
- Texas Instruments Inc. 2002. TMS320C62x/C67x power consumption summary. http://www.ti.com/lit/an/spra486c/spra486c.pdf.Google Scholar
- Theodoropoulos, D., Kuzmanov, G., and Gaydadjiev, G. 2009. A reconfigurable beamformer for audio applications. In Proceedings of the IEEE Symposium on Application Specific Processors. 80--87.Google Scholar
- Theodoropoulos, D., Kuzmanov, G., and Gaydadjiev, G. 2010. Minimalistic architecture for reconfigurable audio beamforming. In Proceedings of the International Conference on Field-Programmable Technology. 503--506.Google Scholar
- Veen, B. V. and Buckley, K. 1988. Beamforming: A versatile approach to spatial filtering. IEEE ASSP Mag. 5, 4--24.Google Scholar
Cross Ref
- Wall, K. and Lockwood, G. R. 2005. Modern implementation of a realtime 3d beamformer and scan converter system. In Proceedings of the IEEE Ultrasonics Symposium. 1400--1403.Google Scholar
- Weinstein, E., Steele, K., Agarwal, A., and Glass, J. 2004. LOUD: A 1020-node modular microphone array and beamformer for intelligent computing spaces. MIT/LCS Tech. memo MIT-LCS-TM-642.Google Scholar
- Xilinx Inc. 2010a. The simple microblaze microcontroller concept. http://xilinx.eetrend.com/files-eetrend-xilinx/article/201105/1839-3354-xapp1141.pdf.Google Scholar
- Xilinx Inc. 2010b. XPower estimator user guide. http://www.xilinx.com/support/documentation/sw_manuals/xilinx12_4/ise_n_power_user_guide.htm.Google Scholar
- Xilinx Inc. 2007. Implementing a real-time beamformer on an fpga platform. XCell J. 36--40. http://www.techonline.com/electrical-engineers/education-training/tech-papers/4137944/Implementing-a-Real-Time-Beamformer-on-an-FPGA-Platform.Google Scholar
- Yermeche, Z., Sallberg, B., Grbic, N., and Claesson, I. 2007. Real-time implementation of a subband beamforming algorithm for dual microphone speech enhancement. In Proceedings of the IEEE International Symposium on Circuits and Systems. 353--356.Google Scholar
- Yiu, C. K., Ho, C. H., Lu, Y., Shi, X., and Luk, W. 2008. Reconfigurable acceleration of microphone array algorithms for speech enhancement. In Proceedings of the International Conference on Application-Specific Systems, Architectures and Processors. 203--208. Google Scholar
Digital Library
Index Terms
Custom architecture for multicore audio beamforming systems
Recommendations
Wave field synthesis for 3D audio: architectural prospectives
CF '09: Proceedings of the 6th ACM conference on Computing frontiersIn this paper, we compare the architectural perspectives of the Wave Field Synthesis (WFS) 3D-audio algorithm mapped on three different platforms: a General Purpose Processor (GPP), a Graphics Processor Unit (GPU) and a Field Programmable Gate Array (...
Implementation and Performance Analysis of SEAL Encryption on FPGA, GPU and Multi-core Processors
FCCM '11: Proceedings of the 2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing MachinesAccelerators, such as field programmable gate arrays (FPGAs) and graphics processing units (GPUs), are special purpose processors designed to speed up compute-intensive sections of applications. FPGAs are highly customizable, while GPUs provide massive ...
A Many-Core Co-Processor for Embedded Parallel Computing on FPGA
DSD '15: Proceedings of the 2015 Euromicro Conference on Digital System DesignSingle processor architectures are unable to provide the required performance of high performance embedded systems. Parallel processing based on general-purpose processors can achieve these performances with a considerable increase of required ...






Comments