Continually evaluating similarity-based pattern queries on a streaming time series
- 3 June 2002
- proceedings article
- Published by Association for Computing Machinery (ACM)
- p. 370-381
- https://doi.org/10.1145/564691.564734
Abstract
In many applications, local or remote sensors send in streams of data, and the system needs to monitor the streams to discover relevant events/patterns and deliver instant reaction correspondingly. An important scenario is that the incoming stream is a continually appended time series, and the patterns are time series in a database. At each time when a new value arrives (called a time position), the system needs to find, from the database, the nearest or near neighbors of the incoming time series up to the time position. This paper attacks the problem by using Fast Fourier Transform (FFT) to efficiently find the cross correlations of time series, which yields, in a batch mode, the nearest and near neighbors of the incoming time series at many time positions. To take advantage of this batch processing in achieving fast response time, this paper uses prediction methods to predict future values. FFT is used to compute the cross correlations of the predicted series (with the values that have already arrived) and the database patterns, and to obtain predicted distances between the incoming time series at many future time positions and the database patterns. When the actual data value arrives, the prediction error together with the predicted distances is used to filter out patterns that are not possible to be the nearest or near neighbors, which provides fast responses. Experiments show that with reasonable prediction errors, the performance gain is significant.Keywords
This publication has 15 references indexed in Scilit:
- Continuous queries over data streamsACM SIGMOD Record, 2001
- Financial time series prediction using least squares support vector machines within the evidence frameworkIEEE Transactions on Neural Networks, 2001
- Locally adaptive dimensionality reduction for indexing large time series databasesPublished by Association for Computing Machinery (ACM) ,2001
- Querying time series data based on similarityIEEE Transactions on Knowledge and Data Engineering, 2000
- A simple randomized algorithm for sequential prediction of ergodic time seriesIEEE Transactions on Information Theory, 1999
- Adaptive query processing for time-series dataPublished by Association for Computing Machinery (ACM) ,1999
- Fast time-series searching with scaling and shiftingPublished by Association for Computing Machinery (ACM) ,1999
- Continual queries for Internet scale event-driven information deliveryIEEE Transactions on Knowledge and Data Engineering, 1999
- High-dimensional index structures database support for next decade's applications (tutorial)Published by Association for Computing Machinery (ACM) ,1998
- Some bilinear forms whose multiplicative complexity depends on the field of constantsTheory of Computing Systems, 1976