• English
    • svenska
  • svenska 
    • English
    • svenska
  • Logga in
Redigera dokument 
  •   Startsida
  • Student essays / Studentuppsatser
  • Department of Computer Science and Engineering / Institutionen för data- och informationsteknik
  • Masteruppsatser
  • Redigera dokument
  •   Startsida
  • Student essays / Studentuppsatser
  • Department of Computer Science and Engineering / Institutionen för data- och informationsteknik
  • Masteruppsatser
  • Redigera dokument
JavaScript is disabled for your browser. Some features of this site may not work without it.

Continuous Parallel Approximate Frequent Elements Queries on Data Streams

Sammanfattning
The frequent elements problem involves processing a stream of elements and finding all elements that occur more than a given fraction of the time. A relaxed version of this problem is the -approximate elements problem which allows some false positives. This thesis aims to solve this problem in a parallel context, where multiple threads work together to speed up computation. Previous research has been successful in producing algorithms that can process large streams of data very quickly, however they divide the input stream equally among the threads in the system, which results in excessive memory usage. The algorithm presented in this thesis, the Delegation Space-Saving algorithm, logically assigns ownership of certain elements to certain threads. This decreases space consumption and increases accuracy. The Delegation Space-Saving algorithm was evaluated on the metrics of throughput, accuracy, and memory consumption. The algorithm was evaluated using both synthetic data with varying skew and real-world network packet data from a backbone router. The Delegation Space-Saving algorithm uses as little as almost the same amount of memory as the single-threaded version, while also having several times higher query and update throughput and equivalent accuracy.
Examinationsnivå
Student essay
URL:
http://hdl.handle.net/2077/69761
Samlingar
  • Masteruppsatser
Fil(er)
Master Thesis (4.250Mb)
Datum
2021-10-06
Författare
Jarlow, Victor
Nyckelord
computer science
big data
Space-Saving
Misra-Gries summary
frequent items
frequent elements
concurrent programming
Delegation Sketch
domain splitting
Count-Min Sketch
Majority algorithm
pproximate frequent-elements algorithm
approximate top-k elements algorithm
Språk
eng
Metadata
Visa fullständig post

DSpace software copyright © 2002-2016  DuraSpace
gup@ub.gu.se | Teknisk hjälp
Theme by 
Atmire NV
 

 

Visa

VisaSamlingarI datumordningFörfattareTitlarNyckelordDenna samlingI datumordningFörfattareTitlarNyckelord

Mitt konto

Logga inRegistrera dig

DSpace software copyright © 2002-2016  DuraSpace
gup@ub.gu.se | Teknisk hjälp
Theme by 
Atmire NV