Who is laughing now? Laughter-infused dialogue systems

Maraev, Vladislav

Who is laughing now? Laughter-infused dialogue systems

Files

Thesis final_8aug.pdf (2.35 MB)

spikblad final.pdf (81.59 KB)

Date

2022-08-16

Authors

Maraev, Vladislav

Abstract

This thesis paves the way for including laughter in spoken dialogue systems in a domain-general and linguistically valid way using computational linguistics tools and methods. The thesis is concerned with three main areas. The first area concerns the placement of laughter in relation to speech and other behaviours. We show that convolutional and recurrent neural networks can effectively predict laughs from dialogue transcripts, whereas human perforance in this task is significantly worse. Such models allow dialogue systems to predict user laughter and, if needed, put system laughter in an appropriate place. Further, we look at laughter placement in relation to gaze and show that laughter performing different pragmatic functions is related to different gaze patterns. These findings provide important implications for embodied conversational agents and social robots in regard to multimodal behaviour realisation and coordination. The second area is concerned with the interaction between laughter and the communicative intent of a user and system, as well as with the context in which the given intent occurs. We lay the groundwork for the central component of a spoken dialogue system by implementing a dialogue manager in a theoretically informed way using a proof-theoretic model based on linear logic. Our dialogue manager is then extended to support laughter functioning as feedback or as a signal accompanying system feedback, and an answer to polar questions. Additionally, we look at how laughter can modify or shape a dialogue act, and how the inclusion of laughter can improve Transformer-based deep learning models in the task of dialogue act recognition. The third area is humour. Even though humour is not necessary for laughter, they are closely related. We look at how humour is related to reasoning about social conventions and other learned and accommodated implicit assumptions. We show how these can be modelled informally, suggesting one way in which the goal of situational and conversational creativity can be realised in artificial agents.

Description

Denna avhandling banar väg för att inkludera skratt i dialogsystem på ett domänoberoende och språkvetenskapligt giltigt sätt, med hjälp av verktyg och metoder från datalingvistiken. Avhandlingen behandlar tre huvudsakliga områden. Det första området rör placeringen av skratt i relation till tal och andra beteenden. Vi visar att neurala nätverk kan förutsäga skratt från transkriberad dialog, medan människor är sämre på detta. Sådana modeller gör det möjligt för dialogsystem att förutsäga mänskligt skratt och, vid behov, placera skratt från systemet på lämpliga ställen. Vi undersöker också placeringen av skratt i förhållande till ögonrörelser och visar att skrattets funktion är relaterad till olika ögonrörelsemönster. Dessa resultat har viktiga följder för konversationella agenter och sociala robotar med avseende på generering och samordning av multimodalt beteende. Det andra området rör interaktionen mellan skratt, den kommunikativa avsikten hos användare och system, och den kontext i vilken avsikten realiseras. Vi lägger grunden till den centrala komponenten i ett dialogsystem genom att implementera en teoretiskt motiverad dialoghanterare baserad på linjär logik. Vår dialoghanterare utvidgas sedan till att hantera skratt som utgör eller ackompanjerar återkoppling, och skratt som utgör svar på ja/nej-frågor. Därtill undersöker vi hur skratt kan modifiera eller forma en dialoghandling, samt hur skratt kan förbättra neurala modeller för igenkänning av dialoghandlingar. Det tredje området är humor. Även om humor inte är en förutsättning för skratt så är de nära sammankopplade. Vi undersöker hur humor är relaterad till resonerande om sociala konventioner och andra inlärda och implicita antaganden. Vi visar hur dessa kan modelleras informellt och påvisar därigenom ett sätt på vilket situationsbunden kreativitet i samtal kan realiseras i konversationella agenter.

Keywords

laughter, dialogue systems, humour, machine learning, reasoning, natural language processing, artificial agents

URI

https://hdl.handle.net/2077/72030

Collections

Doctoral Theses / Doktorsavhandlingar Institutionen för filosofi, lingvistik och vetenskapsteori

Full item page

Who is laughing now? Laughter-infused dialogue systems

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections