Show simple item record

dc.contributor.authorTammen, Johann Henri
dc.date.accessioned2022-12-22T08:12:20Z
dc.date.available2022-12-22T08:12:20Z
dc.date.issued2022-12-22
dc.identifier.urihttps://hdl.handle.net/2077/74490
dc.description.abstractAs technical debt in software engineering projects continues to negatively impact the development process, this study focuses on technical debt in form of code smells in machine learning pipelines and in code written by data scientists. This study contributes to the body of knowledge on technical debt as it tries to quantify the assumption in the literature that scientists without a software engineering background struggle with software engineering’s best practices when writing code. Furthermore, as machine learning continues to evolve in software engineering, it makes sense to minimize technical debt in machine learning pipelines. Therefore, the source code from repositories in the version control system GitHub was analyzed. The results show that indeed data scientists produce more code smells than soft ware engineers. In addition, the study fails to demonstrate that data pipelines yield more code smells than non-data pipelines.en_US
dc.language.isoengen_US
dc.subjecttechnical debten_US
dc.subjectcode smellsen_US
dc.subjectmachine learningen_US
dc.subjectdata scienceen_US
dc.titleCode smells in machine learning pipelines: an MSR sample studyen_US
dc.typetext
dc.setspec.uppsokTechnology
dc.type.uppsokM2
dc.contributor.departmentGöteborgs universitet/Institutionen för data- och informationsteknikswe
dc.contributor.departmentUniversity of Gothenburg/Department of Computer Science and Engineeringeng
dc.type.degreeStudent essay


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record