Visa enkel post

dc.contributor.authorFornmark, Filip
dc.date.accessioned2023-01-19T10:57:30Z
dc.date.available2023-01-19T10:57:30Z
dc.date.issued2023-01-19
dc.identifier.urihttps://hdl.handle.net/2077/74597
dc.description.abstractThis thesis presents an empirical study connected to historical cryptography and especially within the framework of the research project DECRYPT. One of the research questions in the DECRYPT project relates to the use of language models for automatic cryptanalysis. In particular, whether historical language data result in more performant models than large scale models generated from contemporary language corpora. The present thesis aims to explore this question for the English language applied to the classical cipher known as homophonic substitution. Key complexity and message lengths are also taken into consideration. A shorter survey of real historic cryptological keys is also performed in order to gain insights into key design. Statistical n-gram models are generated from the HistCORP collection of historical language and corpora. Test data is generated from the same dataset and encrypted with keys of different complexity. Each sample of test data is then cryptanalysed with a publicly available algorithm for cryptanalysis, and the results from different models are evaluated and compared. The results of the experiments show that there are tendencies that historical texts are better analysed with models based on historical language data. In particular, the performance seems to correlate with the evolution of orthography. Key complexity and message length influence the results, where a less complex key and longer message length generally lead to better accuracy of the cryptanalysis. The results can be viewed as a stepping stone into the broader question of automatic cryptanalysis of historical ciphers, and how suitable language models could or should be assembled.en_US
dc.language.isoengen_US
dc.subjectstatistical language models, cryptanalysis, historical cryptology, homophonic substitutionen_US
dc.titleModels, Keys, and Cryptanalysis: Evaluating historical statistical language models in cryptanalysis of homophonic substitution ciphersen_US
dc.typeText
dc.setspec.uppsokHumanitiesTheology
dc.type.uppsokM2
dc.contributor.departmentGöteborgs universitet/Institutionen för filosofi, lingvistik och vetenskapsteoriswe
dc.contributor.departmentGöteborg University/Department of Philosophy, Linguistics and Theory of Scienceeng
dc.type.degreeStudent essay


Filer under denna titel

Thumbnail

Dokumentet tillhör följande samling(ar)

Visa enkel post