Natural Language Processing Model for Maltese Syntax

Attard, Greta

dc.contributor.author	Attard, Greta
dc.date.accessioned	2021-10-08T08:10:43Z
dc.date.available	2021-10-08T08:10:43Z
dc.date.issued	2021-10-08
dc.identifier.uri	http://hdl.handle.net/2077/69768
dc.description.abstract	The objective of this thesis is to create a Natural Language Processing Model for the Maltese Language. The ultimate goal is that the model would be able to recognise syntactical features, that is the linguistic features and the relationship of a sequence of words, in Maltese. The performance and accuracy of the Maltese model is compared with the models of languages that have great influence on the Maltese language. The results outputted by the dependency parser were linguistically analysed to provide in depth analysis of the results outputted during training and testing. The model is tested on unseen text to provide a further understanding of the level of accuracy of the machine learning algorithm. For this syntax annotator, the model created is trained on manually annotated data and then the output is syntax data that is processed by the dependency parser and part-of- speech tagger. This model is made using the Python package spaCy. Since every language is unique, the linguistic rules are evaluated, to teach the model the rules of the language being researched. The MUDTv1 corpus developed by Slavomír Céplö for his Phd Thesis is used to train this model. The results show that the Maltese syntax model had a 91% part-of-speech tag accuracy, 74% unlabelled attachment score and 66% labelled attachment score. The model is further tested on unseen non-annotated text, the tag accuracy is 75% and the tokeniser accuracy is 99%.	sv
dc.language.iso	eng	sv
dc.subject	natural language processing	sv
dc.subject	syntax	sv
dc.subject	spaCy	sv
dc.subject	universal dependency	sv
dc.subject	dependency parser	sv
dc.subject	part-of-speech tagger	sv
dc.subject	maltese nlp pipeline	sv
dc.title	Natural Language Processing Model for Maltese Syntax	sv
dc.title.alternative	Natural Language Processing-modell för Maltesisk Syntax	sv
dc.type	Text
dc.setspec.uppsok	HumanitiesTheology
dc.type.uppsok	H1
dc.contributor.department	Göteborgs universitet/Institutionen för filosofi, lingvistik och vetenskapsteori	swe
dc.contributor.department	Göteborg University/Department of Philosophy, Linguistics and Theory of Science	eng
dc.type.degree	Student essay

Files in this item

Name:: gupea_2077_69768_1.pdf
Size:: 1.300Mb
Format:: PDF
Description:: thesis

View/Open

This item appears in the following Collection(s)

Magisteruppsatser/ Institutionen för filosofi, lingvistik och vetenskapsteori

Show simple item record