Clustering stationary and non-stationary time series based on autocorrelation distance of hierarchical and k-means algorithms

(1) * Mohammad Alfan Alfian Riyadi Mail (Departement of Statistics, Institut Teknologi Sepuluh Nopember, Indonesia)
(2) Dian Sukma Pratiwi Mail (Departement of Actuarial Science, Bandung, Indonesia)
(3) Aldho Riski Irawan Mail (Departement of Statistics, Institut Teknologi Sepuluh Nopember, Indonesia)
(4) Kartika Fithriasari Mail (Departement of Statistics, Institut Teknologi Sepuluh Nopember, Indonesia)
*corresponding author

Abstract


Observing large dimension time series could be time-consuming. One identification and classification approach is a time series clustering. This study aimed to compare the accuracy of two algorithms, hierarchical cluster and K-Means cluster, using ACF’s distance for clustering stationary and non-stationary time series data. This research uses both simulation and real datasets. The simulation generates 7 stationary data models and another 7 of non-stationary data models. On the other hands, the real dataset is the daily temperature data in 34 cities in Indonesia. As a result, K-Means algorithm has the highest accuracy for both data models.

Keywords


Autocorrelation Distance; Hierarchical Algorithm; K-Means Algorithm; Non Stationary Time Series; Stationary Time Series

   

DOI

https://doi.org/10.26555/ijain.v3i3.98
      

Article metrics

Abstract views : 2524 | PDF views : 492

   

Cite

   

Full Text

Download

References


A.-H. Homaie-Shandizi, V. P. Nia, M. Gamache, and B. Agard, “Flight deck crew reserve: From data to forecasting,” Eng. Appl. Artif. Intell., vol. 50, pp. 106–114, Apr. 2016.

S. Makridakis, S. C. Wheelwright, and R. J. Hyndman, Forecasting methods and applications. John wiley & sons, 2008.

P. Manso, “M., A Package for Stationary Time Series Clustering,” Master thesis, Universidade da Coruna, 2013.

P. D’Urso and E. A. Maharaj, “Autocorrelation-based fuzzy clustering of time series,” Fuzzy Sets Syst., vol. 160, no. 24, pp. 3565–3589, 2009.

U. Habib, K. Hayat, and G. Zucker, “Complex building’s energy system operation patterns analysis using bag of words representation with hierarchical clustering,” Complex Adapt. Syst. Model., vol. 4, no. 1, pp. 1–20, 2016.

S. G. Khawaja, M. U. Akram, S. A. Khan, and A. Ajmal, “A novel multiprocessor architecture for k-means clustering algorithm based on network-on-chip,” in Multi-Topic Conference (INMIC), 2016 19th International, 2016, pp. 1–5.

D. Ismi, S. Panchoo, and M. Murinto, “K-means clustering based filter feature selection on high dimensional data,” Int. J. Adv. Intell. Informatics, vol. 2, no. 1, pp. 38–45, 2016.

A. Azhari and L. Hernandez, “Brainwaves feature classification by applying K-Means clustering using single-sensor EEG,” Int. J. Adv. Intell. Informatics, vol. 2, no. 3, pp. 167–173, 2016.

P. Novianti, D. Setyorini, and U. Rafflesia, “K-Means cluster analysis in earthquake epicenter clustering,” Int. J. Adv. Intell. Informatics, vol. 3, no. 2, pp. 81–89, 2017.

J. D. Cryer and K.-S. Chan, Time Series Analysis with applicaitons in R, 2nd ed. New York: Springer-Verlag New York, 2008.

S. Aghabozorgi, A. S. Shirkhorshidi, and T. Y. Wah, “Time-series clustering--A decade review,” Inf. Syst., vol. 53, pp. 16–38, 2015.

W. W. S. Wei and others, Time series analysis: univariate and multivariate methods. Pearson Addison Wesley, 2006.

S. Aminikhanghahi and D. J. Cook, “A survey of methods for time series change point detection,” Knowl. Inf. Syst., vol. 51, no. 2, pp. 339–367, 2017.

M. Längkvist, L. Karlsson, and A. Loutfi, “A review of unsupervised feature learning and deep learning for time-series modeling,” Pattern Recognit. Lett., vol. 42, pp. 11–24, 2014.

P. Montero and J. A. Vilar, “Tsclust: An r package for time series clustering,” J. Stat. Softw., vol. 62, no. 1, pp. 1–43, 2014.

J. Gunnarsson, “Portfolio-based segmentation and consumer behavior empirical evidence and methodological issues,” Stockholm School of Economics, 1999.

M. J. Norusis, PASW Statistics 18 Statistical Procedures Companion. Prentice Hall, 2010.

B. D. Fulcher and N. S. Jones, “Highly comparative feature-based time-series classification,” IEEE Trans. Knowl. Data Eng., vol. 26, no. 12, pp. 3026–3037, 2014.

R. A. Johnson, D. W. Wichern, and others, Applied multivariate statistical analysis, vol. 4. Prentice-Hall New Jersey, 2014.

W. K. Härdle and L. Simar, Applied Multivariate Statistical Analysis, 2nd ed. Berlin Heidelberg: Springer-Verlag Berlin Heidelberg, 2007.

R. R. Sokal and P. H. A. Sneath, Principles of Numerical Taxonomy, 1st ed. USA: W. H. Freeman and Company, 1973.

A. M. Paul, The cult of personality testing: How personality tests are leading us to miseducate our children, mismanage our companies, and misunderstand ourselves. Simon and Schuster, 2010.




Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

___________________________________________________________
International Journal of Advances in Intelligent Informatics
ISSN 2442-6571  (print) | 2548-3161 (online)
Organized by UAD and ASCEE Computer Society
Published by Universitas Ahmad Dahlan
W: http://ijain.org
E: info@ijain.org (paper handling issues)
   andri.pranolo.id@ieee.org (publication issues)

View IJAIN Stats

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0