Show simple item record

dc.contributor.authorHeimann Mühlenbock, Katarina
dc.date.accessioned2013-04-03T10:51:22Z
dc.date.available2013-04-03T10:51:22Z
dc.date.issued2013-04-03
dc.identifier.isbn978-91-87850-50-9
dc.identifier.urihttp://hdl.handle.net/2077/32472
dc.description.abstractThis thesis aims to identify linguistic factors that affect readability and text comprehension, viewed as a function of text complexity. Features at various linguistic levels suggested in existing literature are evaluated, including the Swedish readability formula LIX. Natural language processing methods and resources are employed to investigate characteristics that go beyond traditional superficial measures. A comparable corpus of eay-to-read and ordinary texts from three genres is investigated, and it is shown how features present at various levels of representation differ quantitatively across text types and genres. The findings are confirmed in significance tests as well as principal component analysis. Three machine learning algorithms are employed and evaluated in order to build a statistical model for text classification. The results demonstrate that a proposed language model for Swedish (SVIT), utilizing a combination of linguistic features, actually predicts text complexity and genre with a higher accuracy than LIX. It is suggested that the SVIT language model should be adopted to assess surface language properties, vocabulary load, sentence structure, idea density levels as well as the personal interests of different texts. Specific target groups of readers may then be provided with materials tailored to their level of proficiency.sv
dc.language.isoengsv
dc.relation.ispartofseriesData Linguisticasv
dc.relation.ispartofseries24sv
dc.subjectreadabilitysv
dc.subjecttext complexitysv
dc.subjectcomputational linguisticssv
dc.subjectlanguage resourcessv
dc.subjectlanguage technologysv
dc.subjectlinguistic featuressv
dc.subjectLIXsv
dc.subjectSVITsv
dc.subjectcorpus linguisticssv
dc.subjecttext classificationsv
dc.subjectquantitative methodssv
dc.subjectnatural language processingsv
dc.subjectmultilevel text analysissv
dc.titleI see what you meansv
dc.title.alternativeAssessing readability for specific target groupssv
dc.typeText
dc.type.svepDoctoral thesiseng
dc.gup.mailkatarina.heimann.muhlenbock@gu.sesv
dc.type.degreeDoctor of Philosophysv
dc.gup.originGöteborgs universitet. Humanistiska fakultetenswe
dc.gup.originUniversity of Gothenburg. Faculty of Artseng
dc.gup.departmentDepartment of Swedish ; Institutionen för svenska språketsv
dc.gup.defenceplaceFredagen den 26 april 2013, kl. 10.15, Stora hörsalen, Humanistensv
dc.gup.defencedate2013-04-26
dc.gup.dissdb-fakultetHF


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record