Quality Attributes of Data in Distributed Deep Learning Architectures
Abstract
Large volume of data is generated by different systems. Intelligent systems such as
autonomous driving uses such large volume of data to train their artificial intelligence models. However, good quality data is one of the foremost needs of any system to function in an effective and safe manner. Especially in critical systems such as those related with autonomous driving, quality data becomes sacrosanct as fault in
such systems could result in fatal accidents. In this thesis, a Design Science Research is conducted to identify challenges related with data quality of a distributed deep learning system. The challenges are identified by conducing interviews with five experts from autonomous driving domain as well as through literature review. The challenges and their severity are validated using a survey. After identification of the challenges, five artifact components are developed that relate with assessing and improving data quality. The artifact components include Data Quality Workflow,
List of Challenges, List of Data Quality Attributes, List of Data Quality Attribute Metrics, and Potential Solutions. The abstract artifact components and concrete implementation of those components are devised and validated using second round of interviews. In the third iteration of this study, the final artifact components are validated through a focus group session with experts and survey. Furthermore, the artifact also presents the information regarding which challenges affect which data quality attributes. This association between challenges and attributes are also validated in the focus group session. The results depict that most of the challenge - attribute association presumed by the researchers of this thesis are valid. Similarly, the templates developed for the artifact components are regarded as appropriate
as well. A contribution of this thesis study towards the body of software engineering and requirements engineering research is the comprehensive and unified "Data
Quality Assessment and Maintenance Framework" developed as a series of artifact components in this thesis. This framework can be used by researchers and practitioners to improve processes related with data quality as well as enhance data quality of the systems they develop.
Degree
Student essay
Collections
View/ Open
Date
2021-09-28Author
PRADHAN, SHAMEER KUMAR
TUNGAL, SAGAR
Keywords
Data quality
Data
Data quality attributes
Data quality challenges
Data quality workflow
Data quality assessment
Data quality maintenance
Design science research
Artifacts
Template
Deep learning
Distributed architecture
Distributed deep learning architecture
Advanced driver assistance systems
Language
eng
Metadata
Show full item recordRelated items
Showing items related by title, author, creator and subject.
-
Experience of adjuvant treatment among postmenopausal women with breast cancer - Health-Related Quality of Life, symptom experience, stressful events and coping strategies.
Browall, Maria (2008-02-22)In Sweden, breast cancer is today the most common type of cancer among women. Of the approximately 7,059 women who developed the disease in Sweden during 2006, about 73% were postmenopausal and aged 55 or older at time of ... -
Pedagogical quality in preschool : an issue of perspectives
Sheridan, Sonja (Göteborg : Acta Universitatis Gothoburgensis, 2001)The main aims of this thesis on the pedagogical quality in preschool are: to define and describe a pedagogical concept of quality; to explore how quality is experienced and valued from different perspectives; to find out ... -
EDUCATIONAL QUALITY AND EQUITY IN SOUTH AFRICA: EVIDENCE FROM TIMSS 2015
Mensah, Ernest (2020-09-09)Aim: This study aims to investigate the relationship between teacher qualification and characteristics, teacher instructional quality, students’ family socioeconomic background, and student mathematics achievement with ...