dc.contributor.author | Jarlow, Victor | |
dc.date.accessioned | 2021-10-06T07:25:09Z | |
dc.date.available | 2021-10-06T07:25:09Z | |
dc.date.issued | 2021-10-06 | |
dc.identifier.uri | http://hdl.handle.net/2077/69761 | |
dc.description.abstract | The frequent elements problem involves processing a stream of elements and finding all elements that occur more than a given fraction of the time. A relaxed version
of this problem is the -approximate elements problem which allows some false positives.
This thesis aims to solve this problem in a parallel context, where multiple
threads work together to speed up computation. Previous research has been successful
in producing algorithms that can process large streams of data very quickly,
however they divide the input stream equally among the threads in the system,
which results in excessive memory usage. The algorithm presented in this thesis, the Delegation Space-Saving algorithm, logically assigns ownership of certain elements to certain threads. This decreases space consumption and increases accuracy.
The Delegation Space-Saving algorithm was evaluated on the metrics of throughput, accuracy, and memory consumption. The algorithm was evaluated using both synthetic data with varying skew and real-world network packet data from a backbone
router. The Delegation Space-Saving algorithm uses as little as almost the same amount of memory as the single-threaded version, while also having several times higher query and update throughput and equivalent accuracy. | sv |
dc.language.iso | eng | sv |
dc.subject | computer science | sv |
dc.subject | big data | sv |
dc.subject | Space-Saving | sv |
dc.subject | Misra-Gries summary | sv |
dc.subject | frequent items | sv |
dc.subject | frequent elements | sv |
dc.subject | concurrent programming | sv |
dc.subject | Delegation Sketch | sv |
dc.subject | domain splitting | sv |
dc.subject | Count-Min Sketch | sv |
dc.subject | Majority algorithm | sv |
dc.subject | pproximate frequent-elements algorithm | sv |
dc.subject | approximate top-k elements algorithm | sv |
dc.title | Continuous Parallel Approximate Frequent Elements Queries on Data Streams | sv |
dc.type | text | |
dc.setspec.uppsok | Technology | |
dc.type.uppsok | H2 | |
dc.contributor.department | Göteborgs universitet/Institutionen för data- och informationsteknik | swe |
dc.contributor.department | University of Gothenburg/Department of Computer Science and Engineering | eng |
dc.type.degree | Student essay | |