Semantic Interpretation for Speech Recognition


Semantic Interpretation for Speech Recognition defines the syntax and semantics of annotations to grammar rules in the Speech Recognition Grammar Specification. Since 5 April 2007, it is a World Wide Web Consortium recommendation.
By building upon SRGS grammars, it allows voice browsers via ECMAScript to semantically interpret complex grammars and provide the information back to the application. For example, it allows utterances like "I would like a Coca-cola and three large pizzas with pepperoni and mushrooms." to be interpreted into an object that can be understood by an application. For example, the utterance could produce the following object named order:


If used against this grammar that includes SISR markup in addition to the standard SRGS grammar in XML format:


GRAMMAR 1.0//EN"
"http://www.w3.org/TR/speech-grammar/grammar.dtd">
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xsi:schemaLocation="http://www.w3.org/2001/06/grammar
http://www.w3.org/TR/speech-grammar/grammar.xsd"
version="1.0" mode="voice" tag-format="semantics/1.0" root="order">

I would like a

out.drink = new Object; out.drink.liquid=rules.drink.type;
out.drink.drinksize=rules.drink.drinksize;

and

out.pizza=rules.pizza;



coke
pepsi
coca colaout="coke";



out="medium";


smallout="small";
medium
largeout="large";
regularout="medium";





out=new Array;

out.push;

and

out.push;




anchovies
pepperoni
mushroomout="mushrooms";
mushrooms






out.drinksize=rules.foodsize; out.type=rules.kindofdrink;





out.pizzasize=rules.foodsize; out.number=rules.number;
pizzas with

out.topping=rules.tops;




out=1;

a
one


twoout=2;
threeout=3;