Abstract
This paper describes electromagnetical field simulation using the 3D-FDTD method for antenna designing on a CUDAcompatible GPU. We use the Split Perfectly Matched Layer as an absorbing boundary condition. As is well known, the 3D-FDTD method is a kind of stencil computation and is considered better at GPU implementation. In order to find the best blocking size for the target GPU architecture, we empirically explore a design space of blocking size. We also propose a kernel fusing method as one of the efficient optimization methods, which improves the total performance about 10% at the cost of a small increase in memory usage. As a result of evaluation, our implementation of the 3D-FDTD method on a GeForce GTX295 platform achieves about 130 times performance improvement compared to a simple CPU implementation, which is expected to be faster than an ideally parallelized CPU implementation using multicore and SIMD instructions.
- K. Yee, "Numerical solution of initial boundary value problems involving Maxwell's equations in isotropic media," IEEE Trans. on antennas and propagation, vol. 14, no. 3, pp. 302--307, 1966.Google Scholar
- Taflove, A. and Brodwin, ME, "Numerical solution of steady-state electromagnetic scattering problems using the time-dependent Maxwell's equations," IEEE Trans. on Microwave Theory and Techniques, vol. 23, no. 8, pp. 623--630, 1975.Google Scholar
Cross Ref
- S. Gedney, "An anisotropic perfectly matched layer-absorbing medium for the truncation of FDTD lattices," IEEE Trans. on Antennas and Propagation, vol. 44, no. 12, pp. 1630--1639, 1996.Google Scholar
Cross Ref
- J. Roden and S. Gedney, "Convolution PML (CPML): An efficient FDTD implementation of the CFS-PML for arbitrary media," Microwave and optical technology lett., vol. 27, no. 5, pp. 334--339, 2000.Google Scholar
- S. Cummer, "A simple, nearly perfectly matched layer for general electromagnetic media," Microwave and Wireless Components Lett., IEEE, vol. 13, no. 3, pp. 128--130, 2003.Google Scholar
Cross Ref
- J. Bérenger, "A perfectly matched layer for the absorption of electromagnetic waves," J. Comput. Phys., vol. 114, no. 2, pp. 185--200, 1994. Google Scholar
Digital Library
- T. Nagaoka and S. Watanabe, "A GPU-based calculation using the three-dimensional FDTD method for electromagnetic field analysis," in Annu. Int'l Conf. of the IEEE Engineering in Medicine and Biology Society (EMBC), pp. 327--330, IEEE, 2010.Google Scholar
- M. Inman, A. Elsherbeni, J. Maloney, and B. Baker, "GPU based FDTD solver with CPML boundaries," in IEEE Antennas and Propagation Society Int'l Symp., pp. 5255--5258, IEEE, 2007.Google Scholar
- L. Kang and K. Fanmin, "Parallel 3D Finite Difference Time Domain Simulations on Graphics Processors with Cuda," in Int'l Conf. on Computational Intelligence and Software Engineering, pp. 1--4, IEEE, 2009.Google Scholar
- T. Chu, J. Dai, D. Qian, W. Fang, and Y. Liu, "A Novel Scheme for High Performance Finite-Difference Time-Domain (FDTD) Computations Based on GPU," Algorithms and Architectures for Parallel Processing, pp. 441--453, 2010. Google Scholar
Digital Library
- C. Railton and J. McGeehan, "Analysis of microstrip discontinuities using the finite difference time domain technique," in IEEE MTT-S Int'l Microwave Symp. Digest, pp. 1009--1012, IEEE, 1989.Google Scholar
- J. Huiling and A. Hiroyuki, "Analysis of Computation Error in Antenna's Simulation by Using Non-Uniform Mesh FDTD," IEICE Trans. on communications, vol. 83, no. 7, pp. 1544--1553, 2000-07-25.Google Scholar
Recommendations
Speeding up NEC electromagnetic simulation using GPU technology for antenna design problems
Graphics processing units GPUs have increasingly become an attractive alternative to central processing unit CPU for some of compute-intensive and parallel tasks. This paper presents application of GPU technology to speed up numerical electromagnetic ...
Optimization and Implementation of LBM Benchmark on Multithreaded GPU
DSDE '10: Proceedings of the 2010 International Conference on Data Storage and Data EngineeringWith fast development of transistor technology, Graphic Processing Unit(GPU) is increasingly used in the non-graphics applications, and major GPU hardware vendors have introduced software stacks for their own GPUs, such as Brook+ for AMD GPU. Compared ...
Accelerating the discontinuous Galerkin method for seismic wave propagation simulations using the graphic processing unit (GPU)-single-GPU implementation
We have successfully ported an arbitrary high-order discontinuous Galerkin (ADER-DG) method for solving the three-dimensional elastic seismic wave equation on unstructured tetrahedral meshes to an Nvidia Tesla C2075 GPU using the Nvidia CUDA programming ...






Comments