Clustering Non-Stationary Data Streams with Online Deep Learning

dc.contributor.authorHontabat, Aurélien
dc.contributor.authorRising, Magnus
dc.contributor.departmentGöteborgs universitet/Institutionen för data- och informationsteknikswe
dc.contributor.departmentUniversity of Gothenburg/Department of Computer Science and Engineeringeng
dc.date.accessioned2016-06-30T08:07:01Z
dc.date.available2016-06-30T08:07:01Z
dc.date.issued2016-06-30
dc.description.abstractWith more devices connected, sensor data logged and people active in social networks, the trend towards working with dynamic data is clear. The number of applications where it becomes essential to perform real time analysis on data streams grows accordingly, each with its own challenges. From this area of data stream analysis we benchmark the performance of current state of the art clustering algorithms: CluStream, DenStream and ClusTree. We also adapt a Variational Autoencoder to perform in the context of non-stationary data streams and assess its generative capabilities for dimensionality reduction. From this limited lab experiment we show that while there is a significant improvement in the clustering accuracy of high dimensional datasets after a dimensionality reduction with a Variational Autoencoder, not all clustering algorithms benefit in the same way from it. Additionally we show that regardless of the clustering algorithm, no relevant improvement in the purity of the clusters could be obtained after the dimensionality reduction.sv
dc.identifier.urihttp://hdl.handle.net/2077/44782
dc.language.isoengsv
dc.setspec.uppsokTechnology
dc.subjectClusteringsv
dc.subjectData Streamssv
dc.subjectDeep Learningsv
dc.subjectDimensionality Reductionsv
dc.titleClustering Non-Stationary Data Streams with Online Deep Learningsv
dc.typetext
dc.type.degreeStudent essay
dc.type.uppsokM2

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
gupea_2077_44782_6.pdf
Size:
4.45 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
gupea_2077_44782_2.txt
Size:
876 B
Format:
Item-specific license agreed upon to submission
Description: