Times series discretization using evolutionary programming. (English)
Batyrshin, Ildar (ed.) et al., Advances in soft computing. 10th Mexican international conference on artificial intelligence, MICAI 2011, Puebla, Mexico, November 26‒December 4, 2011. Proceedings, Part II. Berlin: Springer (ISBN 978-3-642-25329-4/pbk). Lecture Notes in Computer Science 7095. Lecture Notes in Artificial Intelligence, 225-234 (2011).
Summary: In this work, we present a novel algorithm for time series discretization. Our approach includes the optimization of the word size and the alphabet as one parameter. Using evolutionary programming, the search for a good discretization scheme is guided by a cost function which considers three criteria: the entropy regarding the classification, the complexity measured as the number of different strings needed to represent the complete data set, and the compression rate assessed as the length of the discrete representation. Our proposal is compared with some of the most representative algorithms found in the specialized literature, tested in a well-known benchmark of time series data sets. The statistical analysis of the classification accuracy shows that the overall performance of our algorithm is highly competitive.