dc.contributor.author | Finger, Felix | |
dc.contributor.author | Gocht, Nathalie | |
dc.date.accessioned | 2020-07-08T11:43:47Z | |
dc.date.available | 2020-07-08T11:43:47Z | |
dc.date.issued | 2020-07-08 | |
dc.identifier.uri | http://hdl.handle.net/2077/65591 | |
dc.description.abstract | The extent of time related data across many fields has led to substantial interest
in the analysis of time series. This interest meets growing challenges to store and
process data. While the data is collected at an exponential rate, advancements in
processing units are slowing down. Therefore, active research is practiced to find
more efficient means of storing and processing data. This can be especially difficult
for time series due to their various shapes and scales.
In this thesis, we present two variants for optimising a Greedy Clustering algorithm
used for lossy time series compression. This study investigates, whether the efficient
but lossy compression sufficiently preserves the characteristics of the time series
to allow time series prediction and anomaly detection. We suggest two variants
for a performance optimization, Greedy SF and Greedy SAX. These algorithms are
based on novel lookup methods for cluster candidate selection based on statistical
features of time series and extracted SAX substrings. Furthermore, we enabled
the clustering to allow processing time series with different value ranges, which
allows the compression of time series with various scales. To validate the endto-
end pipeline including compression and prediction, a performance evaluation is
applied. To further analyse the applicability, a comprehensive benchmark against a
pipeline with an autoencoder for compression and a stacked LSTM for prediction is
performed. | sv |
dc.language.iso | eng | sv |
dc.relation.ispartofseries | CSE 20-13 | sv |
dc.subject | time series clustering | sv |
dc.subject | large scale data | sv |
dc.subject | machine learning | sv |
dc.subject | prediction | sv |
dc.subject | anomaly detection | sv |
dc.subject | compression | sv |
dc.title | Compressed Machine Learning on Time Series Data | sv |
dc.title.alternative | Efficient compression through clustering using candidate selection and the application of machine learning on compressed data | sv |
dc.type | text | |
dc.setspec.uppsok | Technology | |
dc.type.uppsok | H2 | |
dc.contributor.department | Göteborgs universitet/Institutionen för data- och informationsteknik | swe |
dc.contributor.department | University of Gothenburg/Department of Computer Science and Engineering | eng |
dc.type.degree | Student essay | |