TY - JOUR
T1 - Tensor decomposition and application in image classification with histogram of oriented gradients
AU - TRAN, Dat
AU - MA, Wanli
AU - VO, Tan
PY - 2015/10/1
Y1 - 2015/10/1
N2 - In the field of visual data mining, Histogram of Oriented Gradients (HOG) and its variants have been widely used. The speed and ability to extract image features that are robust against many types of distortions such as scaling, orientation, affine and illumination that HOG offers have made it a popular choice for the task of detecting images in scenes for classification. However, the high dimensionality nature of HOG descriptors (features), usually in the order of multiple thousands of them per image, would require careful consideration in place to achieve accurate and timely categorization of objects within images. This work explores the possibility of processing HOG features as tensors, or multi-dimensional arrays. A direct result of that is tensor decomposition techniques such as canonical polyadic (CP) decomposition performed on the high-order HOG tensors as the mean for dimensionality reduction by filtering. This work focuses on the impact of this approach on both accuracy and efficiency, comparing it against the standard practice of processing HOG features. Validating with the Caltech-101 dataset, the results achieved with artificial neural network (ANN) classification indicate that the proposed method not only improves the overall system performance, it also achieves the edge in accuracy by a notable margin.
AB - In the field of visual data mining, Histogram of Oriented Gradients (HOG) and its variants have been widely used. The speed and ability to extract image features that are robust against many types of distortions such as scaling, orientation, affine and illumination that HOG offers have made it a popular choice for the task of detecting images in scenes for classification. However, the high dimensionality nature of HOG descriptors (features), usually in the order of multiple thousands of them per image, would require careful consideration in place to achieve accurate and timely categorization of objects within images. This work explores the possibility of processing HOG features as tensors, or multi-dimensional arrays. A direct result of that is tensor decomposition techniques such as canonical polyadic (CP) decomposition performed on the high-order HOG tensors as the mean for dimensionality reduction by filtering. This work focuses on the impact of this approach on both accuracy and efficiency, comparing it against the standard practice of processing HOG features. Validating with the Caltech-101 dataset, the results achieved with artificial neural network (ANN) classification indicate that the proposed method not only improves the overall system performance, it also achieves the edge in accuracy by a notable margin.
KW - image classification
KW - histogram of oriented gradients
KW - tensor decomposition
KW - ANN
KW - HOG
KW - Tensor
KW - CP decomposition
KW - Image classification
UR - http://www.scopus.com/inward/record.url?scp=84929948935&partnerID=8YFLogxK
UR - http://www.mendeley.com/research/tensor-decomposition-application-image-classification-histogram-oriented-gradients
U2 - 10.1016/j.neucom.2014.06.093
DO - 10.1016/j.neucom.2014.06.093
M3 - Article
SN - 0925-2312
VL - 165
SP - 38
EP - 45
JO - Neurocomputing
JF - Neurocomputing
IS - 5
ER -