Abstract
The sequence of values that are measured at time intervals equally spaced is time series data. Finding shapelets within a data set as well as classifying that data based on shapelets is one of the most recent approaches to classification of this data. In the classification using shapelets, Euclidean distance measure is adopted to find dissimilarity between two time series sequences. Though the Euclidean distance measure is known for its simplicity in computation, it has some disadvantages: it requires data to be standardized and it also requires that the two data objects being compared be of the same length. It is sensitive to noise as well. To overcome the problem, Mahalanobis distance measure can be used. In the proposed work, classification of time series data is performed using time series shapelets and used Mahalanobis distance measure which is the measure of distribution between a point and distribution. Correlations between data set is considered. It does not depend on scale. The cost complexity pruning is performed on decision tree classifier. The Mahalanobis distance improves the accuracy of algorithm and cost complexity pruning method reduces the time complexity of testing and classification of unseen data. The experimental results show that the Mahalanobis distance measure leads to more accuracy and due to decision tree pruning the algorithm is faster than existing method.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
Keywords
References
Faloutsos, C., Ranganathan, M., Manolopoulos, Y.: Fast Subsequence Matching in Time Series Databases. In: SIGMOD Conference (1994)
Pong Chan, K., Fu, A.W.-C.: Efficient Time Series Matching by wavelets. In: ICDE (1999)
Korn, F., Jagadish, H.V., Faloutsos, C.: Efficiently supporting ad hoc queries in large Datasets of time sequences. In: SIGMOD Conference (1997)
Keogh, E.J., Chakrabarti, K., Pazzani, M.J., Mehrotra, S.: Dimensionality Reduction for Fast Similarity Search in Large Time Series Databases. Knowl. Inf. Syst. 3(3) (2001)
Keogh, E.J., Chakrabarti, K., Pazzani, M.J., Mehrotra, S.: Locally Adaptive dimensionality Reduction for Indexing Large Time Series Databases. In: SIGMOD Conference (2001)
Cai, Y., Ng, R.T.: Indexing spatio-temporal trajectories with chebyshev polynomials. In: SIGMOD Conference (2004)
Lin, J., Keogh, E.J., Wei, L., Lonardi, S.: Experiencing SAX: a novel symbolic representation of time series. Data Mining Knowledge Discovery 15(2) (2007)
Chen, Q., Chen, L., Lian, X., Liw, Y., Yu, J.X.: Indexable PLA for Efficient Similarity Search. In: VLDB (2007)
Ye, L., Keogh, E.: Time Series Shapelets: A New Primitive for Data Mining. In: KDD 2009, June 29-July 1 (2009)
Ding, H., Trajcevski, G., Scheuermann, P., Wang, X., Keogh, E.: Querying and Mining of Time Series Data: Experimental Comparison of Representations and Distance Measures. In: Proc of the 34th VLDB, pp. 1542–1552 (2008)
Keogh, E., Kasetty, S.: On the need for Time Series Data Mining Benchmarks: A Survey and Empirical Demonstration. In: Proc. of the 8th ACM SIGKDD, pp. 102–111 (2002)
Yu, P., Wang, K., Xing, Z., Pei, J.: Extracting interpretable features for early classification on time Series. In: Proc. 11th SDM (2011)
Hartmann, B., Link, N.: Gesture recognition with inertial sensors and optimized DTW prototypes. In: Proc. IEEE SMC (2010)
Lines, J., Davis, L., Hills, J., Bagnall, A.: A shapelet transform for time series classification. Tech. report, University of East anglia, UK (2012)
Han, J., Kamber, M.: Data Mining: Concepts and Techniques, 2nd edn. Elsevier Publisher,
Wilcoxon, F.: Individual Comparisons by Ranking Methods. Biometrics 1, 80–83 (1945)
Kruskal, W.H.: A Nonparametric test for the several sample problem. The Annals of Mathematical Statistics 23(4), 525–540 (1952)
Mood, A.M.F.: Introduction to the theory of statistics (1950)
Keogh, E., Wei, L., Xi, X., Lee, S., Vlachos, M.: LB_Keogh Supports Exact Indexing of Shapes under Rotation Invariance with Arbitrary Representations and Distance Measures. In: The Proc. of 32nd VLDB, pp. 882–893 (2006)
Geurts, P.: Pattern Extraction for Time Series Classification. In: Siebes, A., De Raedt, L. (eds.) PKDD 2001. LNCS (LNAI), vol. 2168, pp. 115–127. Springer, Heidelberg (2001)
Keogh, E.J., Ratanamahatana, C.A.: Exact indexing of dynamic time wraping. Knowl. Inf. Syst. 7(3) (2005)
Gunopulos, D., Kollios, G.: Discovering similar multidimensional trajectories. In: ICDE (2002)
Chen, L., Özsu, M.T., Oria, V.: Robust and fast similarity search for moving object trajectories. In: Sigmod Conference (2005)
Chen, L., Ng, R.T.: On the marriage of Lp-norms and edit distance. In: VLDB (2004)
Morse, M.D., Patel, J.M.: An efficient and accurate method for evaluating time series similarity. In: SIGMOD Conference (2007)
Aßfalg, J., Kriegel, H.-P., Kröger, P., Kunath, P., Pryakhin, A., Renz, M.: Similarity search on time series based on threshold queries. In: Ioannidis, Y., et al. (eds.) EDBT 2006. LNCS, vol. 3896, pp. 276–294. Springer, Heidelberg (2006)
Chen, Y., Nascimento, M.A., Oosi, B.C., Tung, A.K.H.: SpADe: On Shape-based Pattern Detection in Streaming Time Series. In: ICDE (2007)
Mahalanobis, P.C.: On the generalized distance in statistics. Proceedings of the National Institute of Sciences of India 2(1), 49–55 (1936)
Breiman, L., Friedman, J., Olshen, R.A., Stone, C.J.: Classification and regression trees. Wadsworth (1984)
Xi, X., Keogh, E., Shelton, C., Wei, L., Ratanamahatana, C.A.: Fast Time Series Classification using Numerosity Reduction. In: The Proc. of the 23rd ICML, pp. 1033–1040 (2006)
Quinlan, J.R.: Simplifying Decision Trees. International Journal of Man-Machine Studies (1987)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Arathi, M., Govardhan, A. (2015). Effect of Mahalanobis Distance on Time Series Classification Using Shapelets. In: Satapathy, S., Govardhan, A., Raju, K., Mandal, J. (eds) Emerging ICT for Bridging the Future - Proceedings of the 49th Annual Convention of the Computer Society of India CSI Volume 2. Advances in Intelligent Systems and Computing, vol 338. Springer, Cham. https://doi.org/10.1007/978-3-319-13731-5_57
Download citation
DOI: https://doi.org/10.1007/978-3-319-13731-5_57
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-13730-8
Online ISBN: 978-3-319-13731-5
eBook Packages: EngineeringEngineering (R0)