Show simple item record

dc.contributor.authorRibeck, Judy
dc.date.accessioned2015-11-13T07:48:08Z
dc.date.available2015-11-13T07:48:08Z
dc.date.issued2015-11-13
dc.identifier.isbn978-91-87850-59-2
dc.identifier.issn0347-948X
dc.identifier.urihttp://hdl.handle.net/2077/40506
dc.description.abstractIn this work, I present a linguistic investigation of the language of Swedish textbooks in the natural sciences, i.e., biology, physics and chemistry. The textbooks, which are used in secondary and upper secondary school, are examined with respect to traditional readability measures, e.g., LIX, OVIX and nominal ratio. I also extract typical linguistic features of the texts, typicality being determined using a proposed quantitative method, labelled the index principle. This empirical, corpus-based method relies on automatic linguistic annotations produced by language technology tools to calculate what I call index lists, rank-ordered lists of characteristic linguistic features of specific text corpora as compared to reference texts. I produce index lists for typical vocabulary, noun phrase structures and syntactic structures, extracted from a 5.2 million word textbook corpus, compiled as a part of the work presented. As well as being frequent and well dispersed, the linguistic variables selected for the index lists are also characteristic of the text type in question, as is evident when they are compared to a reference corpus, comprising textbooks in the social sciences and mathematics, as well as narrative and academic (university-level) texts. The results show that textbooks in natural science contain a lot of content-specific, technical vocabulary. This characteristic not only distinguishes natural scientific language from everyday language, but also from social scientific language, which on the lexical level has more in common with narrative texts. On the other hand, the textbook language as a whole is structurally distinguishable from narrative texts, as clearly seen, e.g., in its noun phrase complexity. In the transition between secondary and upper secondary school, the scores of almost every readability measure go up, indicating an increase in linguistic demands on the readers. In the upper secondary textbooks the words are longer, the vocabulary more varied, the noun phrases longer and more elaborate, and the most typical syntactic structures more complex. Notably, the linguistic development between the form levels is more marked in the natural-science textbooks, compared to social sciences and mathematics. Nevertheless, the textbook language overall shows a relatively low complexity in comparison to academic language.sv
dc.language.isoswesv
dc.relation.ispartofseriesData linguisticasv
dc.relation.ispartofseries28sv
dc.subjectacademic languagesv
dc.subjectcomputational linguisticssv
dc.subjectcorpus linguisticssv
dc.subjectlanguage technologysv
dc.subjectnatural language processingsv
dc.subjectscientific languagesv
dc.subjectsubject-specific languagesv
dc.subjectSwedish textbookssv
dc.subjectquantitative stylisticssv
dc.titleSteg för steg. Naturvetenskapligt ämnesspråk som räknassv
dc.title.alternativeStep by step. A computational analysis of Swedish textbook languagesv
dc.typeText
dc.type.svepDoctoral thesiseng
dc.type.degreeDoctor of Philosophysv
dc.gup.originGöteborgs universitet. Humanistiska fakultetenswe
dc.gup.originUniversity of Gothenburg. Faculty of Artseng
dc.gup.departmentDepartment of Swedish ; Institutionen för svenska språketsv
dc.gup.defenceplaceFredagen den 4 december 2015, kl. 13:15, Lilla Hörsalen, Humanistensv
dc.gup.defencedate2015-12-04
dc.gup.dissdb-fakultetHF


Files in this item

Thumbnail
Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record