Elaborate Operational Requirements to Address Reward Hacking in Reinforcement Learning Agents

dc.contributor.author	Yaghoobzadehtari, Sina
dc.contributor.author	Owusu Adomako, Colin
dc.contributor.author	Paidar, Siavash
dc.date.accessioned	2019-11-18T13:45:22Z
dc.date.available	2019-11-18T13:45:22Z
dc.date.issued	2019-11-18
dc.identifier.uri	http://hdl.handle.net/2077/62526
dc.description.abstract	Autonomous agents, in recent times have been used to address several problems, but these agents in their course of achieving their task also emit side effects to the environment in which they operate. Paramount of these side effects is reward hacking. In this report, we try to address reward hacking using elaborate operational requirements. The results is evaluated on the unity machine learning platform using multi agents, a goalkeeper and a striker where the elaborate operational requirements helped address these agents from hacking or gaming their results.	sv
dc.language.iso	eng	sv
dc.title	Elaborate Operational Requirements to Address Reward Hacking in Reinforcement Learning Agents	sv
dc.type	text
dc.setspec.uppsok	Technology
dc.type.uppsok	M2
dc.contributor.department	Göteborgs universitet/Institutionen för data- och informationsteknik	swe
dc.contributor.department	University of Gothenburg/Department of Computer Science and Engineering	eng
dc.type.degree	Student essay