Voice Gene
Join now to read essay Voice Gene
TABLE OF CONTENTS
Executive Summary
Introduction
Analysis
2.1 Reusability of code and developers skills
2.2 Suitability for VoiceGenies platform
– – – – – – – – – – – – – – –
– – – – –
2.3 Style
2.4 Industry Standard
– – – – – – – – – –
– – – – – – – – – –
Conclusions
Recommendations
References
Appendix A – An Example of an X+V Application
– – – – – – – – – –
– – – – – – – – – –
Appendix B – An Example of a SALT Application
– – – – – – – – – –
– – – – – – – – – –
Executive Summary
VoiceGenie Technologies Inc., a VoiceXML Gateway solutions company, is striving to support multimodal applications. The company must decide which multimodal markup language to support. The markup languages being considered are:

X+V (XHTML+Voice), a combination of XHTML and VoiceXML
SALT (Speech Application Language Tag), an extension of HTML/XHTML using SALT tags.
VoiceGenies method, a set of HTML pages and a set of VoiceXML pages synchronized by sending messages to each other.
This report assumes that an X+V or SALT browser is to be implemented by VoiceGenie. The benefits and drawbacks of these markup languages are analyzed using the following criteria:

Reusability of code and developers skills: Both Web and voice application developers would find X+V easy to learn. An existing Web application can be reused if it should be converted to SALT. Both Web and voice applications can be easily converted to X+V.

Suitability for VoiceGenies platform: X+V is more suitable for VoiceGenie since it already supports VoiceXML.
Style: The layout of elements in an X+V application is more elegant and intuitive.
Industry Standard: It is not clear whether X+V or SALT will become the standard.
This report recommends that VoiceGenies method be used until a standard multimodal markup language emerges.
1.0 Introduction
The development of multimodal technology has become increasingly significant over the past several years. A multimodal application accepts different modes of input and output. The possible inputs may include speech, key strokes, or mouse click; the possible outputs may include synthesized speech, text, graphics, or videos. Multimodality is most useful in a mobile environment, where keyboard input is difficult due to movements or the small size of the keyboard.

Since multimodality is a relatively new technology, there is not yet a single standard markup language accepted by the industry for developing multimodal applications. There are currently two markup languages submitted to the W3C for review: X+V and SALT.

X+V stands for XHTML+Voice, a language that is basically a combination of XHTML for the visual content and VoiceXML for the audio component. This multimodal markup language is an initiative of IBM, Motorola, and Opera Software. Version 1.0 was submitted to W3C for review at the end of 2001, and version 1.1 was submitted on March 11, 2003 (Multimodal, 2003).

SALT (Speech Application Language Tag) is a language proposed by Microsoft and submitted to W3C for review in July, 2002. It extends XHTML with SALT tags, which are used to handle the audio component of the application (Multimodal, 2003).

Due to the lack of an X+V interpreter or a SALT browser, VoiceGenie is currently developing its own multimodal “language” that the

Get Your Essay

Cite this page

Multimodal Markup Language And V Application. (June 12, 2021). Retrieved from https://www.freeessays.education/multimodal-markup-language-and-v-application-essay/