AI Safe Exploration: Reinforced learning with a blocker in unsafe environments

dc.contributor.authorKoivisto, Marco
dc.contributor.authorCrockett, Philip
dc.contributor.authorSpångberg, Axel
dc.contributor.departmentGöteborgs universitet/Institutionen för data- och informationsteknikswe
dc.contributor.departmentUniversity of Gothenburg/Department of Computer Science and Engineeringeng
dc.date.accessioned2019-11-12T10:40:01Z
dc.date.available2019-11-12T10:40:01Z
dc.date.issued2019-11-12
dc.description.abstractArtificial intelligence can be trained with a trial and error based approach. In an environment where a catastrophe can not be accepted a human overseer can be used, but this might lower the efficiency of the learning. The study includes implementation of an artifact meant to replace the human overseer when training an AI in simulated unsafe environments. The results of testing the implemented blocker shows that it can be used for avoiding catastrophes and finding a path to reach the goal in 17 out of 18 runs. The single failed execution shows that the implemented blocker is in need of improvement in terms of data efficiency. Shaping rewards solely to reduce number of steps and catastrophes for a reinforcement learning agent has been done successfully to some degree, but further steps can be taken to lower the number of catastrophes and steps.sv
dc.identifier.urihttp://hdl.handle.net/2077/62440
dc.language.isoengsv
dc.setspec.uppsokTechnology
dc.subjectArtificial Intelligencesv
dc.subjectReinforcement learningsv
dc.subjectSafe explorationsv
dc.subjectBlockersv
dc.subjectMachine Learningsv
dc.subjectBaby AI Gamesv
dc.subjectGym Mini Gridsv
dc.titleAI Safe Exploration: Reinforced learning with a blocker in unsafe environmentssv
dc.typetext
dc.type.degreeStudent essay
dc.type.uppsokM2

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
gupea_2077_62440_1.pdf
Size:
469.91 KB
Format:
Adobe Portable Document Format
Description:
CSE Group 8 - Koivisto, Crockett & Spångberg

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
876 B
Format:
Item-specific license agreed upon to submission
Description: