An Empirical Survey of Bandits in an Industrial Recommender System Setting

dc.contributor.authorSchwarz, Tobias
dc.contributor.authorBrandby, Johan
dc.contributor.departmentGöteborgs universitet/Institutionen för data- och informationsteknikswe
dc.contributor.departmentUniversity of Gothenburg/Department of Computer Science and Engineeringeng
dc.date.accessioned2023-09-21T12:53:52Z
dc.date.available2023-09-21T12:53:52Z
dc.date.issued2023-09-21
dc.description.abstractIn this thesis, the effects of incorporating unstructured data—images in the wild—in contextual multi-armed bandits are investigated, when used within a recommender system setting, which focuses on picture-based content suggestion. The idea is to employ image features, extracted by a pre-trained convolutional neural network, and study the resulting bandit behaviors when including respective excluding this information in the typical context creation, which normally relies on structured data sources—such as metadata. The evaluation is made both online, through A/B-testing enabled by the industrial partner YouPic AB, and offline, effectuated by a simulation pipeline that models the online counterpart. The results are compiled as a survey, covering a selection of contextual bandit algorithms, highlighting the differences brought by the unstructured data. The offline result points towards that if the contextual bandit utilizes a joint or hybrid action-value function, with respect to the parameterization, the addition of the image vectors can significantly outperform the instances without it; however, if a disjoint model is instead employed, no noticeable change is observed. In comparison, those from the online trials can be interpreted as supporting the inclusion of convolutional features, but due to meager and unbalanced sample sizes, the outcomes are deemed inconclusive. To summarize, though there is support for incorporating unstructured data, given that the action-value function is joint or hybrid, the online experiments gave too little evidence for any trustworthy findings; in other words, the question is still partially open.en
dc.identifier.urihttps://hdl.handle.net/2077/78577
dc.language.isoengen
dc.setspec.uppsokTechnology
dc.subjectcomputer scienceen
dc.subjectindustrial applicationen
dc.subjectmachine learningen
dc.subjectreinforcement learningen
dc.subjectmulti-armed banditsen
dc.subjectMABen
dc.subjectcontextual multi-armed banditsen
dc.subjectsurveyen
dc.subjectbatch learningen
dc.titleAn Empirical Survey of Bandits in an Industrial Recommender System Settingen
dc.typetext
dc.type.degreeStudent essay
dc.type.uppsokH2

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
CSE 23-02 TS JB.pdf
Size:
9.76 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
876 B
Format:
Item-specific license agreed upon to submission
Description:

Collections