Article

A dimensionality reduction technique for efficient similarity analysis of time series databases

Authors:
Vasileios Megalooikonomou

Temple University, Philadelphia, PA

Temple University, Philadelphia, PA
View Profile

,
Guo Li

Temple University, Philadelphia, PA

Temple University, Philadelphia, PA
View Profile

,
Qiang Wang

Temple University, Philadelphia, PA

Temple University, Philadelphia, PA
View Profile

CIKM '04: Proceedings of the thirteenth ACM international conference on Information and knowledge managementNovember 2004Pages 160–161https://doi.org/10.1145/1031171.1031203

Published:13 November 2004Publication History

CIKM '04: Proceedings of the thirteenth ACM international conference on Information and knowledge management

Pages 160–161

ABSTRACT

Efficiently searching for similarities among time series and discovering interesting patterns is an important and non-trivial problem with applications in many domains. The high dimensionality of the data makes the analysis very challenging. To solve this problem, many dimensionality reduction methods have been proposed. PCA (Piecewise Constant Approximation) and its variant have been shown efficient in time series indexing and similarity retrieval. However, in certain applications, too many false alarms introduced by the approximation may reduce the overall performance dramatically. In this paper, we introduce a new piecewise dimensionality reduction technique that is based on Vector Quantization. The new technique, PVQA (Piecewise Vector Quantized Approximation), partitions each sequence into equi-length segments and uses vector quantization to represent each segment by the closest (based on a distance metric) codeword from a codebook of key-sequences. The efficiency of calculations is improved due to the significantly lower dimensionality of the new representation. We demonstrate the utility and efficiency of the proposed technique on real and simulated datasets. By exploiting prior knowledge about the data, the proposed technique generally outperforms PCA and its variants in similarity searches.

References

Gersho, A. & Gray R. M. (1992). Vector Quantization and Signal Compression. Kluwer Academic, Boston. Google ScholarDigital Library
Keogh, E., Chakrabarti, K., Pazzani, M. & Mehrotra, S. (2000). "Dimensionality Reduction for Fast Similarity Search in Large Time Series Databases", Knowledge and Information Systems 3(3): 263--286.Google ScholarCross Ref
Lin, J., Keogh, E., Patel, P. & Lonardi, S. (2002). "Finding motifs in time series", 2nd Workshop on Temporal Data Mining at the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. July 23-26. Edmonton, Alberta, Canada.Google Scholar
Lloyd, S. P. (1982). "Least squares quantization in PCM", IEEE Transactions on Information Theory, IT(28), pp. 127--135.Google Scholar
Stanford Genomic Resources. http://genome-www.stanford.edu/nci60Google Scholar
UCI KDD Archive. http://kdd.ics.uci.eduGoogle Scholar
Yi, B-K & Faloutsos, C. (2000). "Fast Time Sequence Indexing for Arbitrary Lp Norms", in Proceedings of the VLDB, Cairo, Egypt, pp. 385--394. Google ScholarDigital Library

Index Terms

A dimensionality reduction technique for efficient similarity analysis of time series databases
1. Information systems
  1. Information retrieval
    1. Information retrieval query processing
    2. Retrieval tasks and goals
      1. Clustering and classification
  2. Information systems applications
    1. Data mining
      1. Clustering

Recommendations

A dimensionality reduction technique for efficient time series similarity analysis

We propose a dimensionality reduction technique for time series analysis that significantly improves the efficiency and accuracy of similarity searches. In contrast to piecewise constant approximation (PCA) techniques that approximate each time series ...
Read More
Dimensionality reduction-based spoken emotion recognition

To improve effectively the performance on spoken emotion recognition, it is needed to perform nonlinear dimensionality reduction for speech data lying on a nonlinear manifold embedded in a high-dimensional acoustic space. In this paper, a new supervised ...
Read More
Dimensionality Reduction and Similarity Computation by Inner-Product Approximations

As databases increasingly integrate different types of information such as multimedia, spatial, time-series, and scientific data, it becomes necessary to support efficient retrieval of multidimensional data. Both the dimensionality and the amount of ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
CIKM '04: Proceedings of the thirteenth ACM international conference on Information and knowledge management
November 2004
678 pages
ISBN:1581138741
DOI:10.1145/1031171
General Chair:
David Grossman
Illinois Institute of Technology
,
Program Chairs:
Luis Gravano
Columbia University
,
ChengXiang Zhai
University of Illinois at Urbana-Champaign
,
Otthein Herzog
University of Bremen, Germany
,
David A. Evans
Clairvoyance Corporation
Copyright © 2004 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 13 November 2004
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
data mining
dimensionality reduction
time series
Qualifiers
- Article
Conference

Acceptance Rates
Overall Acceptance Rate1,861of8,427submissions,22%
Upcoming Conference
CIKM '24

Sponsor:

sigir

sigir

The 33rd ACM International Conference on Information and Knowledge Management

October 21 - 25, 2024

Boise , ID , USA
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 24
  Total Citations
  View Citations
- 803
  Total Downloads
- Downloads (Last 12 months)7
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

A dimensionality reduction technique for efficient similarity analysis of time series databases

CIKM '04: Proceedings of the thirteenth ACM international conference on Information and knowledge management

ABSTRACT

References

Cited By

Index Terms

Recommendations

A dimensionality reduction technique for efficient time series similarity analysis

Dimensionality reduction-based spoken emotion recognition

Dimensionality Reduction and Similarity Computation by Inner-Product Approximations

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

A dimensionality reduction technique for efficient similarity analysis of time series databases

CIKM '04: Proceedings of the thirteenth ACM international conference on Information and knowledge management

ABSTRACT

References

Cited By

Index Terms

Recommendations

A dimensionality reduction technique for efficient time series similarity analysis

Dimensionality reduction-based spoken emotion recognition

Dimensionality Reduction and Similarity Computation by Inner-Product Approximations

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media