dc.contributor.author | Yaghoobzadehtari, Sina | |
dc.contributor.author | Owusu Adomako, Colin | |
dc.contributor.author | Paidar, Siavash | |
dc.date.accessioned | 2019-11-18T13:45:22Z | |
dc.date.available | 2019-11-18T13:45:22Z | |
dc.date.issued | 2019-11-18 | |
dc.identifier.uri | http://hdl.handle.net/2077/62526 | |
dc.description.abstract | Autonomous agents, in recent times have been used to address
several problems, but these agents in their course of achieving
their task also emit side effects to the environment in which they
operate. Paramount of these side effects is reward hacking. In
this report, we try to address reward hacking using elaborate
operational requirements. The results is evaluated on the unity
machine learning platform using multi agents, a goalkeeper and
a striker where the elaborate operational requirements helped
address these agents from hacking or gaming their results. | sv |
dc.language.iso | eng | sv |
dc.title | Elaborate Operational Requirements to Address Reward Hacking in Reinforcement Learning Agents | sv |
dc.type | text | |
dc.setspec.uppsok | Technology | |
dc.type.uppsok | M2 | |
dc.contributor.department | Göteborgs universitet/Institutionen för data- och informationsteknik | swe |
dc.contributor.department | University of Gothenburg/Department of Computer Science and Engineering | eng |
dc.type.degree | Student essay | |