Copyright © 1999 W3C® (MIT, INRIA, Keio), All Rights Reserved. W3C liability, trademark, document use and software licensing rules apply.
The W3C Voice Browser working group aims to develop specifications to enable access to the Web using spoken interaction. This document is part of a set of requirements studies for voice browsers, and provides details of the requirements for natural language processing.
This document describes the requirements for natural language processing for voice browsers, as a precursor to starting work on specifications. Related requirement drafts are linked from the introduction. The requirements are being released as working drafts but are not intended to become proposed recommendations.
This specification is a Working Draft of the Voice Browser working group for review by W3C members and other interested parties. This is the first public version of this document. It is a draft document and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use W3C Working Drafts as reference material or to cite them as other than "work in progress".
Publication as a Working Draft does not imply endorsement by the W3C membership, nor of members of the Voice Browser working groups. This is still a draft document and may be updated, replaced or obsoleted by other documents at any time. It is inappropriate to cite W3C Working Drafts as other than "work in progress."
This document has been produced as part of the W3C Voice Browser Activity, following the procedures set out for the W3C Process. The authors of this document are members of the Voice Browser Working Group. This document is for public review. Comments should be sent to the public mailing list <[email protected]> (archive) by 14th January 2000.
A list of current W3C Recommendations and other technical documents can be found at http://www.w3.org/TR.
The main goal of this subgroup is to establish a prioritized list of requirements for natural language processing in a voice browser environment.
The process will consist of the following steps:
This document specifies requirements that define the capabilities of any component of a voice browser system which performs natural language interpretation, that is, the task of determining and representing the content of a natural language input from a user. Interpretation components include both stand-alone natural language understanding (NLU) components which receive text string results from a speech recognizer or keyboard as well as speech recognizers that incorporate natural language understanding functionality by returning interpretations rather than, or in addition to, text strings.
The activities of the Natural Language Requirements Subgroup will be coordinated with the activities of the Grammar Representation Subgroup, the Synthesis Markup Subgroup, and the Dialog Subgroup.
The NLU system should be able to:
A standalone (i.e., not integrated with a speech recognizer) NLU system should be able to:
Any NLU system should be able to:
These requirements are intended to insure that the natural language component is capable of representing results of processing task-specific utterances.
An NLU system should be able to:
Represent task information:
Represent meta-task information (all nice to specify)
An NLU system should be able to represent meta-dialog information having to do with the communication process.(all nice to specify except as noted)
The NLU system should be able to represent: (all must specify except as noted)
This is an example of a banking application with the user's utterances annotated with an example of a possible NLU representation, based on the following task model.
Identification: Name: Address: Street: City: Zip Code: Phone: Action: Transfer: Source_account: Destination_account: Amount: Value: Currency: Balance: Account:
Dialog |
Dialog Comments |
NLU output |
1. System: "Welcome to Ajax Bank. Please tell me your name |
|
|
2. User: I'm Jack Jones |
Name: Jack Jones |
|
3. System: I know three people with the name Jack Jones, will you tell my your address, please? |
||
4. User: I live at 18773 Union Street in Carlton |
Address: Street: 18773 Union St City: Carlton |
|
5. System: Then your phone number is 773-8427? |
|
|
6. User: Yep, that's me. |
Yes |
|
7. System: What would you like me to do? |
|
|
8. User: Transfer $200 from my savings account to my checking account. |
Action: transfer Source_account: savings Destination_account: checking Amount: Value: 200 Currency: USD |
|
9. System: You have two savings accounts, your individual savings account and a joint savings account with June Jones. From which savings account should I transfer the $200 from? |
||
10. User: From my individual account, not the joint account |
User clarifies by answering the question twice (once identifying the correct account, and again by identifying the incorrect account) |
Correction: Source_account: individual |
11. System: I'm sorry, I can't do that. You have only $150 in your savings account. |
System enforces a business rule that says a user can't transfer more money that what's in his/her account. |
|
12. User: OK, just transfer the $150. |
User modifies the request by editing the amount to be transferred. |
Correction: Amount: Value: 150 Currency: USD |
13. System: OK, I have transferred $150 from your individual saving account to your checking account. Is there anything else I can do for you? |
System confirms that the transaction is completed. Invite the user to begin another transaction |
|
14. User: Yes, tell me how much money is in our joint saving account |
User asks a new question |
balance: joint account |
15. System: You have $600 in your joint savings account with June Jones. What else can I do for you? |
System correctly identifies the source because of the phrase "our joint" savings account. The system prompt message changes so that the user does not get tired of hearing the same prompt message. |
|
16. User: Yes, transfer $50 from that account. |
|
Action: transfer Source_account: savings Amount: Value: 50 Currency: USD |
17. System: Do you want me to transfer $50 from your joint account with June Jones to your checking account? |
||
18. No, to my individual savings account. |
|
Destination_account: individual savings |
19. System: Do you want me to transfer $50 from your joint savings account with June Jones to your individual savings account? |
System assumes that the source is still joint savings account and that the amount is still $50 |
|
20: User: Yes |
|
Yes |
21.System: I've transferred $50 from your joint savings account with June Jones to your individual savings account. |
System confirms transaction is completed. |
|
22.System: Is there anything else I can do for you? |
System prompts the user for a new transaction |
|
23. User: Yes, can you pay my electric bill? |
User isn't actually asking to have the bill paid, just whether the system can do it. |
Capability_query: Action: transfer Destination_account: electric bill |
24. System: Yes, I can transfer money to General Utilities |
|
|
25: User: OK, pay it |
Action: transfer Destination_account: electric bill |
|
26: System, How much money should I transfer to General Utilities? |
System solicits a value for the amount parameter from the user |
|
27: User: transfer $80 from my joint checking account |
User responds with the value for the requested parameter, and volunteers a parameter for another parameter, the source. |
Operation: transfer Source_account: joint checking Amount: Value: 80 Currency: USD |
28: System: You want to transfer $80 from your joint checking account to General Utilities? |
System asks user to confirm entire transaction |
|
29: User: Make it so |
The user watches Captain Picard on Star Trek and copies his command phrasing. |
Yes |
30: System: Do you want me to do anything else? |
System prompts user for another transaction |
|
31: User: Yes, transfer $800 from the IRS to my savings account |
User tries to do the impossible. |
Operation: transfer Source_account: IRS Destination_account: joint checking account |
32: System: I am not allowed to transfer money from the IRS account |
system enforces business rules about source accounts. |
|
33: User: What can I do to the IRS? |
While the user may be joking, the system treats this as a help request |
Capability_query: action actor: user theme: IRS If the request is not representable within the task model, it may be useful for the system to fall back to a less fully analyzed representation of the utterance, rather than simply to fail to understand it. |
34: System: you may pay your taxes by transferring money to the IRS |
System explains what operations the user can perform with the IRS account |
|
35: User: I don't want to do that. |
User cancels a transaction in progress |
Cancel |
36. System: OK. What else can I do for you? |
|
|
37: User: You're cute, can I call you? |
User tries to make fun of the system |
did not understand |
38: System: I'm sorry, I didn't understand your request. Can you please rephrase it? |
|
|
29: No. Goodbye |
|
Goodbye |
30: Thanks for using Ajax bank, Good bye. |
User is logged off. |
|
Mike Brown, Lucent
Carolina Di Cristo, Telecom Italia
Deborah Dahl, Unisys
Linda Dorrian, Productivity Works
Robert Keiller, Canon
Bill Ledingham, SpeechWorks
Stephen Potter, Entropic
Dave Raggett, HP and W3C
Ramesh Sarukkai, Lernout and Hauspie
Volker Steinbliss, Philips