dc.contributor.author | Makashova, Liliia | |
dc.date.accessioned | 2021-09-23T06:37:00Z | |
dc.date.available | 2021-09-23T06:37:00Z | |
dc.date.issued | 2021-09-23 | |
dc.identifier.uri | http://hdl.handle.net/2077/69692 | |
dc.description.abstract | Speech synthesis (text-to-speech, TTS) and speech recognition (automatic speech recognition, ASR) are the NLP technologies that are the least available for low-resource and indigenous languages. Lack of computational and data resources is the major obstacle when it comes to the development of linguistic tools for these languages.
We present a framework that does not require enormous GPU and target data resources, as well as guarantees reasonably good results in performance for the end-product. In this work we perform dual connection between TTS and ASR models and make them learn from each other in a low-resource setup. This project, being the first open-source implementation of such a bidirectional algorithm, leverages the power of open-source projects for the benefit of indigenous languages. We release the first ever functioning ASR tool for the North Sámi language along with a competitive TTS technology, which fulfills the demand of the North Sámi community and globally contributes to the further development of AI tools for low-resource languages. | sv |
dc.language.iso | eng | sv |
dc.subject | Speech synthesis | sv |
dc.subject | automatic speech recognition | sv |
dc.subject | low-resource language | sv |
dc.subject | machine learning | sv |
dc.subject | transfer learning | sv |
dc.title | SPEECH SYNTHESIS AND RECOGNITION FOR A LOW-RESOURCE LANGUAGE Connecting TTS and ASR for mutual benefit | sv |
dc.type | Text | |
dc.setspec.uppsok | HumanitiesTheology | |
dc.type.uppsok | H2 | |
dc.contributor.department | University of Gothenburg / Department of Philosophy,Lingustics and Theory of Science | eng |
dc.contributor.department | Göteborgs universitet / Institutionen för filosofi, lingvistik och vetenskapsteori | swe |
dc.type.degree | Student essay | |