GENERATING ROUTE DESCRIPTIONS Automatic generation of route descriptions with visual elements from graph and salient landmarks
No Thumbnail Available
Date
2024-10-25
Authors
Akhavan, Kamaneh
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
Can machines find the shortest route and guide us using intuitive, human-like instructions, such as ”move
towards the big black piano” or ”head past the green armchair”? This thesis investigates the potential of
machines to generate navigation instructions that combine the efficiency of graph-based systems with the
clarity provided by salient landmarks.
Our research focuses on creating a system that determines the shortest path between two points and enriches
navigation with human-like, landmark-based descriptions. Integrating allocentric 1 and egocentric 2
perspectives, we aim to improve the quality and naturalness of the generated instructions.
To extract salient landmarks, we utilized data from Bérénice Le Glouanec’s project(Le Glouanec (2024)3),
which employed object recognition techniques, including a Faster R-CNN model. This approach allowed
us to identify significant visual attributes such as spatial location, shape, and color of landmarks within
the environment. We integrated these visually salient elements into our system to improve the clarity and
relatability of the generated navigation instructions.
The system was evaluated through measures of similarity between machine-generated and human-written
instructions, yielding a mean cosine similarity score of 54 % and a Jaccard similarity score of 13 %. These
results indicate a reasonable resemblance to human-generated navigation instructions, demonstrating the
system’s potential.
Future work will focus on expanding the dataset to include more diverse environments, such as outdoor
spaces, and exploring customizable multimodal systems to enhance user experience and accessibility.
Description
Keywords
salience, graph, route description