A multimodal speech interface for dynamic creation and retrieval of geographical landmarks on a mobile device
Institution: | MIT |
---|---|
Department: | Electrical Engineering and Computer Science |
Degree: | M. Eng. |
Year: | 2010 |
Keywords: | Electrical Engineering and Computer Science. |
Record ID: | 1887829 |
Full text PDF: | http://hdl.handle.net/1721.1/62638 |
As mobile devices become more powerful, researchers look to develop innovative applications that use new and effective means of input. Furthermore, developers must exploit the device's many capabilities (GPS, camera, touch screen, etc) in order to make equally powerful applications. This thesis presents the development of a multimodal system that allows users to create and share informative geographical landmarks using Android-powered smart-phones. The content associated with each landmark is dynamically integrated into the system's vocabulary, which allows users to easily use speech to access landmarks by the information related to them. The initial results of releasing the application on the Android Market have been encouraging, but also suggest that improvements need to be made to the system.