• English
    • svenska
  • English 
    • English
    • svenska
  • Login
View Item 
  •   Home
  • Student essays / Studentuppsatser
  • Department of Computer Science and Engineering / Institutionen för data- och informationsteknik
  • Masteruppsatser
  • View Item
  •   Home
  • Student essays / Studentuppsatser
  • Department of Computer Science and Engineering / Institutionen för data- och informationsteknik
  • Masteruppsatser
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Continuous Parallel Approximate Frequent Elements Queries on Data Streams

Abstract
The frequent elements problem involves processing a stream of elements and finding all elements that occur more than a given fraction of the time. A relaxed version of this problem is the -approximate elements problem which allows some false positives. This thesis aims to solve this problem in a parallel context, where multiple threads work together to speed up computation. Previous research has been successful in producing algorithms that can process large streams of data very quickly, however they divide the input stream equally among the threads in the system, which results in excessive memory usage. The algorithm presented in this thesis, the Delegation Space-Saving algorithm, logically assigns ownership of certain elements to certain threads. This decreases space consumption and increases accuracy. The Delegation Space-Saving algorithm was evaluated on the metrics of throughput, accuracy, and memory consumption. The algorithm was evaluated using both synthetic data with varying skew and real-world network packet data from a backbone router. The Delegation Space-Saving algorithm uses as little as almost the same amount of memory as the single-threaded version, while also having several times higher query and update throughput and equivalent accuracy.
Degree
Student essay
URI
http://hdl.handle.net/2077/69761
Collections
  • Masteruppsatser
View/Open
Master Thesis (4.250Mb)
Date
2021-10-06
Author
Jarlow, Victor
Keywords
computer science
big data
Space-Saving
Misra-Gries summary
frequent items
frequent elements
concurrent programming
Delegation Sketch
domain splitting
Count-Min Sketch
Majority algorithm
pproximate frequent-elements algorithm
approximate top-k elements algorithm
Language
eng
Metadata
Show full item record

DSpace software copyright © 2002-2016  DuraSpace
Contact Us | Send Feedback
Theme by 
Atmire NV
 

 

Browse

All of DSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

My Account

LoginRegister

DSpace software copyright © 2002-2016  DuraSpace
Contact Us | Send Feedback
Theme by 
Atmire NV