Code smells in machine learning pipelines: an MSR sample study

dc.contributor.authorTammen, Johann Henri
dc.contributor.departmentGöteborgs universitet/Institutionen för data- och informationsteknikswe
dc.contributor.departmentUniversity of Gothenburg/Department of Computer Science and Engineeringeng
dc.date.accessioned2022-12-22T08:12:20Z
dc.date.available2022-12-22T08:12:20Z
dc.date.issued2022-12-22
dc.description.abstractAs technical debt in software engineering projects continues to negatively impact the development process, this study focuses on technical debt in form of code smells in machine learning pipelines and in code written by data scientists. This study contributes to the body of knowledge on technical debt as it tries to quantify the assumption in the literature that scientists without a software engineering background struggle with software engineering’s best practices when writing code. Furthermore, as machine learning continues to evolve in software engineering, it makes sense to minimize technical debt in machine learning pipelines. Therefore, the source code from repositories in the version control system GitHub was analyzed. The results show that indeed data scientists produce more code smells than soft ware engineers. In addition, the study fails to demonstrate that data pipelines yield more code smells than non-data pipelines.en
dc.identifier.urihttps://hdl.handle.net/2077/74490
dc.language.isoengen
dc.setspec.uppsokTechnology
dc.subjecttechnical debten
dc.subjectcode smellsen
dc.subjectmachine learningen
dc.subjectdata scienceen
dc.titleCode smells in machine learning pipelines: an MSR sample studyen
dc.typetext
dc.type.degreeStudent essay
dc.type.uppsokM2

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
CSE 22-11 GUPEA 73822 Tammen.pdf
Size:
721.33 KB
Format:
Adobe Portable Document Format
Description:
Thesis

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
876 B
Format:
Item-specific license agreed upon to submission
Description: