Abstract
The core component of most modern trackers is a discriminative classifier, tasked with distinguishing between the target and the surrounding environment. To cope with natural image changes, this classifier is typically trained with translated and scaled sample patches. Such sets of samples are riddled with redundancies—any overlapping pixels are constrained to be the same. Based on this simple observation, we propose an analytic model for datasets of thousands of translated patches. By showing that the resulting data matrix is circulant, we can diagonalize it with the discrete Fourier transform, reducing both storage and computation by several orders of magnitude. Interestingly, for linear regression our formulation is equivalent to a correlation filter, used by some of the fastest competitive trackers. For kernel regression, however, we derive a new kernelized correlation filter (KCF), that unlike other kernel algorithms has the exact same complexity as its linear counterpart. Building on it, we also propose a fast multi-channel extension of linear correlation filters, via a linear kernel, which we call dual correlation filter (DCF). Both KCF and DCF outperform top-ranking trackers such as Struck or TLD on a 50 videos benchmark, despite running at hundreds of frames-per-second, and being implemented in a few lines of code (Algorithm 1). To encourage further developments, our tracking framework was made open-source.
- [1]
,
“Visual tracking: An experimental survey,” IEEE Trans. Pattern
Anal. Mach. Intell., vol. 36, no. 7, pp. 1442–
1468, Jul. 2014.Google Scholar
- [2]
, “Recent advances and trends in visual tracking: A review,
” Neurocomput., vol. 74, no. 18, pp. 3823
–3831, Nov. 2011.Google Scholar
- [3]
, “Real-time
compressive tracking,” in Proc. Eur. Conf. Comput. Vis., 2012, pp.
864–877.Google Scholar
- [4]
, “
Tracking-learning-detection,” IEEE Trans. Pattern Anal. Mach. Intell., vol.
34, no. 7, pp. 1409–1422, Jul.
2012.Google Scholar
- [5]
, “Robust
object tracking with online multiple instance learning,” IEEE Trans. Pattern Anal. Mach.
Intell., vol. 33, no. 8, pp. 1619–1632,
Aug. 2011.Google Scholar
- [6]
, “On-line random forests,” in Proc.
3rd IEEE Int. Conf. Comput. Vis. Workshop On-line Comput. Vis., 2009, pp. 1393
–1400.Google Scholar
- [7]
, “Struck:
Structured output tracking with kernels,” in Proc. Int. Conf. Comput. Vis.,
2011, pp. 263–270.Google Scholar
- [8]
, “Regularized
least-squares classification,” Nato Sci. Ser. Sub Ser. III, vol. 190, pp.
131–154, 2003.Google Scholar
- [9]
, “Visual object tracking using
adaptive correlation filters,” in Proc. Comput. Vis. Pattern Recognit., 2010,
pp. 2544–2550. Google Scholar
- [10]
, “
Average of synthetic exact filters,” in Proc. Comput. Vis. Pattern Recognit.,
2009, pp. 2105–2112.Google Scholar
- [11]
, “Online object tracking: A
benchmark,” in Proc. Comput. Vis. Pattern Recognit., 2013, pp.
2411–2418.Google Scholar
- [12]
, “Support vector tracking,” IEEE
Trans. Pattern Anal. Mach. Intell., vol. 26, no. 8, pp. 1064
–1072, Aug. 2004.Google Scholar
- [13]
, “
Semi-supervised on-line boosting for robust tracking,” in Proc. Eur. Conf. Comput. Vis.
, 2008, pp. 234–247.Google Scholar
- [14]
, “Online robust image
alignment via iterative convex optimization,” in Proc. Comput. Vis. Pattern Recognit.
, 2012, pp. 1808–1814.Google Scholar
- [15]
, “Distribution fields for tracking,” in Proc.
Comput. Vis. Pattern Recognit., 2012, pp. 1910–1917.
Google Scholar
- [16]
, “Beyond
sliding windows: Object localization by efficient subwindow search,” in Proc. Comput.
Vis. Pattern Recognit., 2008, pp. 1–8.Google Scholar
- [17]
, “Exploiting
spatial overlap to efficiently compute appearance distances between image windows,” in
Proc. Adv. Neural Inf. Processing Syst., 2011, pp. 2735–2743.
Google Scholar
- [18]
, “Combining
efficient object localization and image classification,” in Proc. Int. Conf. Comput.
Vis., 2009, pp. 237–244.Google Scholar
- [19]
, “Multiple kernels for object detection,
” in Proc. Int. Conf. Comput. Vis., 2009, pp. 606
–613.Google Scholar
- [20]
, “Object detection with
discriminatively trained part-based models,” IEEE Trans. Pattern Anal. Mach. Intell.
, vol. 32, no. 9, pp. 1627–1645,
Sep. 2010.Google Scholar
- [21]
,
Digital Image Processing. Upper Saddle River, NJ, USA: Prentice Hall, 2008.Google Scholar
- [22]
, “Fast feature pyramids for object
detection,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 36,
no. 8, pp. 1532–1545, Aug. 2014.
Google Scholar
- [23]
,
Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond. Cambridge,
MA, USA: MIT Press, 2002.Google Scholar
- [24]
,
“Analysis of kernel distortion-invariant filters,” Proc. SPIE
, vol. 6764, p. 1, 2007.Google Scholar
- [25]
,
“Fast FFT-based distortion-invariant kernel filters for general object recognition,
” Proc. SPIE, vol. 7252, 2009, p. 1.Google Scholar
- [26]
, “Kernel based synthetic discriminant function for object
recognition,” in Proc. IEEE Int. Conf. Acoust. Speech Signal Process.,
2006, pp. 1–5.Google Scholar
- [27]
, “Kernel correlation
filter based redundant class-dependence feature analysis (KCFA) on FRGC2.0 data,” in
Proc. 2nd Int. Conf. Anal. Model. Faces Gestures, 2005, pp. 32–43
.Google Scholar
- [28]
, “Minimum
average correlation energy filters,” Appl. Optics, vol. 26,
pp. 3633–3640, 1987.Google Scholar
- [29]
, “Exploiting the circulant structure of
tracking-by-detection with kernels,” in Proc. Eur. Conf. Comput. Vis., vol.
7575, pp. 702–715, 2012.Google Scholar
- [30]
“Event retrieval in large video
collections with circulant temporal encoding,” in Proc. Comput. Vis. Pattern Recognit.
, 2013, pp. 2459–2466.Google Scholar
- [31]
, “Beyond hard negative mining: Efficient
detector learning via block-circulant decomposition,” in Proc. Int. Conf. Comput. Vis.
, 2013, pp. 2760–2767.Google Scholar
- [32]
, “
Multi-channel correlation filters,” in Proc. Int. Conf. Comput. Vis., 2013
, pp. 3072–3079.Google Scholar
- [33]
, “
Correlation filters for object alignment,” in Proc. Comput. Vis. Pattern Recognit.
, 2013, pp. 2291–2298.Google Scholar
- [34]
, Toeplitz and Circulant Matrices: A Review. Boston, MA, USA: Now
Publishers, 2006.Google Scholar
- [35]
, Circulant Matrices. Providence, RI, USA: Amer. Math. Society, 1994.
Google Scholar
- [36]
, “Efficient additive kernels via explicit feature maps,” IEEE
Trans. Pattern Anal. Mach. Intell., vol. 34, no. 3, pp. 480
–492, Mar. 2011.Google Scholar
Index Terms
High-Speed Tracking with Kernelized Correlation Filters
Recommendations
Robust visual tracking via co-trained Kernelized correlation filters
We train a pool of discriminative classifiers jointly in a closed-form fashion for visual tracking.We propose analytic model for datasets of thousands of translated patches.It is able to outperform the baseline by a larger margin. Recent advances in ...
Coupled-layer based visual tracking via adaptive kernelized correlation filters
Part-based visual model is particularly useful when the target appearance undergoes partial occlusion or deformation. The existing reliable patches tracking (RPT) method has achieved better result by identifying and exploiting the reliable patches that ...
Visual tracking tracker via object proposals and co-trained kernelized correlation filters
AbstractVisual tracking is a challenging task in the field of computer vision with wide applications in intelligent and surveillance systems. Recently, correlation trackers have shown great achievement in visual tracking due to its high efficiency. ...




Comments