Time-Series Classification with COTE: The Collective of Transformation-Based Ensembles
Open Access
- 26 March 2015
- journal article
- research article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Knowledge and Data Engineering
- Vol. 27 (9), 2522-2535
- https://doi.org/10.1109/tkde.2015.2416723
Abstract
Recently, two ideas have been explored that lead to more accurate algorithms for time-series classification (TSC). First, it has been shown that the simplest way to gain improvement on TSC problems is to transform into an alternative data space where discriminatory features are more easily detected. Second, it was demonstrated that with a single data representation, improved accuracy can be achieved through simple ensemble schemes. We combine these two principles to test the hypothesis that forming a collective of ensembles of classifiers on different data transformations improves the accuracy of time-series classification. The collective contains classifiers constructed in the time, frequency, change, and shapelet transformation domains. For the time domain, we use a set of elastic distance measures. For the other domains, we use a range of standard classifiers. Through extensive experimentation on 72 datasets, including all of the 46 UCR datasets, we demonstrate that the simple collective formed by including all classifiers in one ensemble is significantly more accurate than any of its components and any other previously published TSC algorithm. We investigate alternative hierarchical collective structures and demonstrate the utility of the approach on a new problem involving classifying Caenorhabditis elegans mutant types.Keywords
Funding Information
- Engineering and Physical Sciences Research Council (EPSRC) (EP/M015087/1)
This publication has 31 references indexed in Scilit:
- Learning time-series shapeletsPublished by Association for Computing Machinery (ACM) ,2014
- Time series classification with ensembles of elastic distance measuresData Mining and Knowledge Discovery, 2014
- A database of Caenorhabditis elegans behavioral phenotypesNature Methods, 2013
- Classification of time series by shapelet transformationData Mining and Knowledge Discovery, 2013
- A dictionary of behavioral motifs reveals clusters of genes affecting Caenorhabditis elegans locomotionProceedings of the National Academy of Sciences, 2012
- Transformation Based Ensembles for Time Series ClassificationPublished by Society for Industrial & Applied Mathematics (SIAM) ,2012
- Logical-shapeletsPublished by Association for Computing Machinery (ACM) ,2011
- Time series clustering and classification by the autoregressive metricComputational Statistics & Data Analysis, 2008
- Clustering time series from ARMA models with clipped dataPublished by Association for Computing Machinery (ACM) ,2004
- Distance measures for effective clustering of ARIMA time-seriesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002