×

A fuzzy logic approach for detection of video shot boundaries. (English) Zbl 1100.68611

Summary: Video temporal segmentation is normally the first and important step for content-based video applications. Many features including the pixel difference, colour histogram, motion, and edge information etc. have been widely used and reported in the literature to detect shot cuts inside videos. Although existing research on shot cut detection is active and extensive, it still remains a challenge to achieve accurate detection of all types of shot boundaries with one single algorithm. In this paper, we propose a fuzzy logic approach to integrate hybrid features for detecting shot boundaries inside general videos. The fuzzy logic approach contains two processing modes, where one is dedicated to detection of abrupt shot cuts including those short dissolved shots, and the other for detection of gradual shot cuts. These two modes are unified by a mode-selector to decide which mode the scheme should work on in order to achieve the best possible detection performances. By using the publicly available test data set from Carleton University, extensive experiments were carried out and the test results illustrate that the proposed algorithm outperforms the representative existing algorithms in terms of the precision and recall rates.

MSC:

68T10 Pattern recognition, speech recognition
68U10 Computing methodologies for image processing
PDFBibTeX XMLCite
Full Text: DOI

References:

[1] Jiang, J.; Weng, Y., Video extraction for fast content access to MPEG compressed videos, IEEE Trans. Circuits, Systems Video Technol., 14, 5, 595-605 (2004)
[2] Porter, S.; Mirmehdi, M.; Thosmas, B., Temporal video segmentation and classification of edit effects, Image Vision Comput., 21, 1098-1106 (2003)
[3] Lo, C.; Wang, S., Video segmentation using a histogram-based fuzzy c-means clustering algorithm, Comput. Standards & Interfaces, 23, 429-438 (2001)
[4] Jadon, R. S.; Chaudhury, S.; Biswas, K. K., A fuzzy theoretic approach for video segmentation using syntactic features, Pattern Recognition Lett., 22, 1359-1369 (2001) · Zbl 0984.68680
[5] Hanjalic, A., Shot-boundary detection: unraveled and resolved?, IEEE Trans. Circuits Syst. Video Technol., 12, 2 (2002)
[6] Huang, C.; Liao, B., A robust scene-change detection method for video segmentation, IEEE Trans. Circuits Syst. Video Technol., 11, 12 (2001)
[7] Guimaraes, S. J.F.; Couprie, M.; Araujo, A. D.A.; Leite, N. J., Video segmentation based on 2D image analysis, Pattern Recognition Lett., 24, 947-957 (2003)
[8] Liu, T. Y.; Lo, K. T.; Zhang, X. D.; Feng, J., A new cut detection algorithm with constant false-alarm ratio for video segmentation, J. Vis. Commun. Image R., 15, 132-144 (2004)
[9] A. Whitehead, P. Bose, R. Laganiere, Feature based cut detection with automatic threshold selection, Proceedings, Content Based Image and Video Retrieval CIVR, 2004.; A. Whitehead, P. Bose, R. Laganiere, Feature based cut detection with automatic threshold selection, Proceedings, Content Based Image and Video Retrieval CIVR, 2004.
[10] Gunsel, B.; Ferman, A. M.; Tekalp, A. M., Temporal video segmentation using unsupervised clustering and semantic object tracking, J. Electron. Imaging, 7, 592-603 (1998)
[11] Bescos, J., Real-time shot change detection over online MPEG-2 video, IEEE Trans. Circuits Syst. Video Technol., 14, 4 (2004)
[12] A. Hanjalic, H.J. Zhang, Optimal shot boundary detection based on robust statistical models, Proceedings of the IEEE International Conference on Multimedia Computing and System, vol. 2, 1999, pp. 710-714.; A. Hanjalic, H.J. Zhang, Optimal shot boundary detection based on robust statistical models, Proceedings of the IEEE International Conference on Multimedia Computing and System, vol. 2, 1999, pp. 710-714.
[13] H. Fang, J. Jiang, Predictive-based cross line for fast motion estimation in MPEG-4 videos, Proceedings of IS&T/SPIE’s Symposium on Electronic Imaging: Science & Technology, San Jose, USA, January 2004, pp. 175-183.; H. Fang, J. Jiang, Predictive-based cross line for fast motion estimation in MPEG-4 videos, Proceedings of IS&T/SPIE’s Symposium on Electronic Imaging: Science & Technology, San Jose, USA, January 2004, pp. 175-183.
[14] R. Jain, R. Kasturi, B.G. Schunck, Machine Vision, McGraw-Hill, New York, ISBN:0-07-032018-7.; R. Jain, R. Kasturi, B.G. Schunck, Machine Vision, McGraw-Hill, New York, ISBN:0-07-032018-7.
[15] Ross, Timothy J., Fuzzy Logic with Engineering Applications (2004), Wiley: Wiley New York · Zbl 1060.93007
[16] Quinlan, J. R., C45: Programs for Machine Learning (1993), Morgan Kaufmann: Morgan Kaufmann San Mateo, CA
[17] A. Hampapur, R. Jain, T. Weymouth, Digital video segmentation, Proceedings of the Second ACM International Conference on Multimedia, San Francisco, CA, USA, 1994, pp. 357-364.; A. Hampapur, R. Jain, T. Weymouth, Digital video segmentation, Proceedings of the Second ACM International Conference on Multimedia, San Francisco, CA, USA, 1994, pp. 357-364.
[18] J. Canny, A variational approach to edge detection, Proceedings of the AAAI, Washington, 1983, pp. 54-58.; J. Canny, A variational approach to edge detection, Proceedings of the AAAI, Washington, 1983, pp. 54-58.
[19] S. Pfeiffer, R. Lienhart, G. Kuhne, W. Effelsberg, The MoCA Project-Movie Content Analysis Research at the University of Mannheim, Informatik’98, 1998, pp. 329-338.; S. Pfeiffer, R. Lienhart, G. Kuhne, W. Effelsberg, The MoCA Project-Movie Content Analysis Research at the University of Mannheim, Informatik’98, 1998, pp. 329-338.
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.