Show simple item record

dc.contributor.authorKoivisto, Marco
dc.contributor.authorCrockett, Philip
dc.contributor.authorSpångberg, Axel
dc.date.accessioned2019-11-12T10:40:01Z
dc.date.available2019-11-12T10:40:01Z
dc.date.issued2019-11-12
dc.identifier.urihttp://hdl.handle.net/2077/62440
dc.description.abstractArtificial intelligence can be trained with a trial and error based approach. In an environment where a catastrophe can not be accepted a human overseer can be used, but this might lower the efficiency of the learning. The study includes implementation of an artifact meant to replace the human overseer when training an AI in simulated unsafe environments. The results of testing the implemented blocker shows that it can be used for avoiding catastrophes and finding a path to reach the goal in 17 out of 18 runs. The single failed execution shows that the implemented blocker is in need of improvement in terms of data efficiency. Shaping rewards solely to reduce number of steps and catastrophes for a reinforcement learning agent has been done successfully to some degree, but further steps can be taken to lower the number of catastrophes and steps.sv
dc.language.isoengsv
dc.subjectArtificial Intelligencesv
dc.subjectReinforcement learningsv
dc.subjectSafe explorationsv
dc.subjectBlockersv
dc.subjectMachine Learningsv
dc.subjectBaby AI Gamesv
dc.subjectGym Mini Gridsv
dc.titleAI Safe Exploration: Reinforced learning with a blocker in unsafe environmentssv
dc.typetext
dc.setspec.uppsokTechnology
dc.type.uppsokM2
dc.contributor.departmentGöteborgs universitet/Institutionen för data- och informationsteknikswe
dc.contributor.departmentUniversity of Gothenburg/Department of Computer Science and Engineeringeng
dc.type.degreeStudent essay


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record