• English
    • svenska
  • svenska 
    • English
    • svenska
  • Logga in
Redigera dokument 
  •   Startsida
  • Student essays / Studentuppsatser
  • Department of Computer Science and Engineering / Institutionen för data- och informationsteknik
  • Masteruppsatser
  • Redigera dokument
  •   Startsida
  • Student essays / Studentuppsatser
  • Department of Computer Science and Engineering / Institutionen för data- och informationsteknik
  • Masteruppsatser
  • Redigera dokument
JavaScript is disabled for your browser. Some features of this site may not work without it.

Compressed Machine Learning on Time Series Data

Efficient compression through clustering using candidate selection and the application of machine learning on compressed data

Sammanfattning
The extent of time related data across many fields has led to substantial interest in the analysis of time series. This interest meets growing challenges to store and process data. While the data is collected at an exponential rate, advancements in processing units are slowing down. Therefore, active research is practiced to find more efficient means of storing and processing data. This can be especially difficult for time series due to their various shapes and scales. In this thesis, we present two variants for optimising a Greedy Clustering algorithm used for lossy time series compression. This study investigates, whether the efficient but lossy compression sufficiently preserves the characteristics of the time series to allow time series prediction and anomaly detection. We suggest two variants for a performance optimization, Greedy SF and Greedy SAX. These algorithms are based on novel lookup methods for cluster candidate selection based on statistical features of time series and extracted SAX substrings. Furthermore, we enabled the clustering to allow processing time series with different value ranges, which allows the compression of time series with various scales. To validate the endto- end pipeline including compression and prediction, a performance evaluation is applied. To further analyse the applicability, a comprehensive benchmark against a pipeline with an autoencoder for compression and a stacked LSTM for prediction is performed.
Examinationsnivå
Student essay
URL:
http://hdl.handle.net/2077/65591
Samlingar
  • Masteruppsatser
Fil(er)
Master thesis (5.578Mb)
Datum
2020-07-08
Författare
Finger, Felix
Gocht, Nathalie
Nyckelord
time series clustering
large scale data
machine learning
prediction
anomaly detection
compression
Serie/rapportnr.
CSE 20-13
Språk
eng
Metadata
Visa fullständig post

DSpace software copyright © 2002-2016  DuraSpace
gup@ub.gu.se | Teknisk hjälp
Theme by 
Atmire NV
 

 

Visa

VisaSamlingarI datumordningFörfattareTitlarNyckelordDenna samlingI datumordningFörfattareTitlarNyckelord

Mitt konto

Logga inRegistrera dig

DSpace software copyright © 2002-2016  DuraSpace
gup@ub.gu.se | Teknisk hjälp
Theme by 
Atmire NV