TY - GEN
T1 - Efficient video object classifier using locality-enhanced support vector machines
AU - Jan, Tony
AU - Tsai, Po Hsiang
AU - Piccardi, Massimo
AU - Hintz, Thomas
PY - 2004
Y1 - 2004
N2 - In multimedia applications such as MPEG-4, an efficient model is required to encode and classify video objects such as human, car and building. Recently, Support Vector Machine (SVM) has been shown to be a good classifier; however, its large computational requirement prohibited its use in real time video processing applications. In this paper, a model is proposed that enables use of SVM in video applications. This paper aims to merge multi-scale based selective encoding/classification technique and locality-enhanced Support Vector Machine (SVM). The proposed model allows selected image scales (of interest) to be encoded and classified more accurately by complex classifier such as SVM, whilst other image scales of less significance to be encoded and classified by simpler encoder/classifier. Image scales of interest are readily selected from multi-scale image processing paradigm. SVM is used to encode visual object information of significant image scale only; hence its use is efficient. Experiment with MPEG-4 video object encoding and classification shows that the performance of the proposed model is comparable with other models, however with significantly reduced computational requirements.
AB - In multimedia applications such as MPEG-4, an efficient model is required to encode and classify video objects such as human, car and building. Recently, Support Vector Machine (SVM) has been shown to be a good classifier; however, its large computational requirement prohibited its use in real time video processing applications. In this paper, a model is proposed that enables use of SVM in video applications. This paper aims to merge multi-scale based selective encoding/classification technique and locality-enhanced Support Vector Machine (SVM). The proposed model allows selected image scales (of interest) to be encoded and classified more accurately by complex classifier such as SVM, whilst other image scales of less significance to be encoded and classified by simpler encoder/classifier. Image scales of interest are readily selected from multi-scale image processing paradigm. SVM is used to encode visual object information of significant image scale only; hence its use is efficient. Experiment with MPEG-4 video object encoding and classification shows that the performance of the proposed model is comparable with other models, however with significantly reduced computational requirements.
KW - Locality-enhanced support vector machines
KW - Multiscale image processing
KW - Support vector machines
KW - Video object encoder
UR - http://www.scopus.com/inward/record.url?scp=15744377169&partnerID=8YFLogxK
U2 - 10.1109/ICSMC.2004.1401401
DO - 10.1109/ICSMC.2004.1401401
M3 - Conference contribution
AN - SCOPUS:15744377169
SN - 0780385667
T3 - Conference Proceedings - IEEE International Conference on Systems, Man and Cybernetics
SP - 6373
EP - 6377
BT - 2004 IEEE International Conference on Systems, Man and Cybernetics, SMC 2004
T2 - 2004 IEEE International Conference on Systems, Man and Cybernetics, SMC 2004
Y2 - 10 October 2004 through 13 October 2004
ER -