Automated phrase generation

- Lucent Technologies Inc.

A methodology for automated task selection is provided, where the selected task is identified in natural speech of a user making such a selection. A set of meaningful phrases are determined by a grammatical inference algorithm which operates on a predetermined corpus of speech utterances, each such utterance being associated with a specific task objective, and wherein each utterance is marked with its associated task objective. Each meaningful phrase developed by the grammatical inference algorithm can be characterized as having both a Mutual Information value and a Salience value (relative to an associated task objective) above a predetermined threshold.

Skip to:  ·  Claims  ·  References Cited  · Patent History  ·  Patent History

Claims

1. A method for automated determination of a classification parameter for a selected task, where said selected task is expressed in natural speech of a user, comprising the steps of:

providing a database of speech utterances, each said utterances being characterized as directed to ones of a predetermined set of tasks;
forming from ones of a plurality of said speech utterances a set of speech phrases, each said phrase containing at least one word;
determining a likelihood measure for co-occurrence of constituent words in each of said phrases;
selecting from said set of phrases a subset thereof having a value of said likelihood measure exceeding a predetermined threshold;
determining a significance measure relative to a specified one of said predetermined set of tasks for phrases in said selected subset of said phrases;
selecting from said selected subset of said phrases a set of meaningful phrases having a value of said significance measure exceeding a predetermined threshold, said meaningful phrases constituting said classification parameter.

2. The method for automated determination of a classification parameter of claim 1 wherein said likelihood measure is manifested as a mutual information measure.

3. The method for automated determination of a classification parameter of claim 1 wherein said significance measure is manifested as a salience measure.

4. The method for automated determination of a classification parameter of claim 3 wherein said salience measure is represented as a conditional probability of said task being selected given said speech phrase, said conditional probability being a highest value in a distribution of said conditional probabilities over said set of predetermined tasks.

5. The method for automated determination of a classification parameter of claim 1 wherein said speech utterances are generated in response to a query of a form "How may I help you?".

6. The method for automated determination of a classification parameter of claim 1 wherein each of said speech utterances is labelled with a one of said predetermined set of tasks to which each said utterance is directed.

7. The method for automated determination of a classification parameter of claim 1 wherein said first selecting step and all subsequent steps are iteratively repeated for speech phase of size n words, where n is an integer in a range 1.ltoreq.n.ltoreq.N, N being selected by an implementer of said method.

8. A method for establishing a classification relationship between at least one speech phrase and a one of a predetermined set of task objectives, wherein each said speech phrase is formed from a one of a known corpus of speech utterances and each said speech utterance in said corpus is related to a one of said predetermined set of task objectives, said method comprising the steps of:

generating a plurality of said speech phrases, each having a predetermined number of words, from said corpus;
evaluating a mutual information measure and a salience measure for each said generated speech phrase relative to a given one of said task objectives;
selecting a portion of said plurality of said speech phrases having said mutual information measure and said salience measure above predetermined thresholds;
generating from said corpus a second plurality of said speech phrases using said selected portion as a base, wherein each said speech phrase in said second plurality contains at least one additional word relative to said number of words comprising said speech phrases generated by the initial generating step;
evaluating a mutual information measure and a salience measure for each said speech phrase in said second plurality of said speech phrases relative to said given one of said task objectives;
selecting a portion of said second plurality of said speech phrases having said mutual information measure and said salience measure above predetermined thresholds;
iteratively repeating the immediately preceding generating, evaluating and selecting steps until a set of speech phrases having a desired relationship with said given one of said task objectives is selected.

9. The method for establishing a classification relationship of claim 8 wherein said salience measure is represented as a conditional probability of said task being selected given said speech phrase, said conditional probability being a highest value in a distribution of said conditional probabilities over said set of predetermined tasks.

10. The method for establishing a classification relationship of claim 8 wherein each of said speech utterances is labeled with a one of said predetermined set of tasks to which each said utterance is directed.

11. The method for establishing a classification relationship of claim 8 wherein said speech utterances are generated in response to a query of the form "How may I help you?".

Referenced Cited
U.S. Patent Documents
4866778 September 12, 1989 Baker
5033088 July 16, 1991 Shipman
5384892 January 24, 1995 Strong
5390279 February 14, 1995 Strong
5434906 July 18, 1995 Robinson et al.
Other references
  • Gorin et al., ("Automated Call Routing in a Telecommunications Network", 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications, Kyoto Research park, Kyoto, Japan, Sep. 26-27, 1994, pp. 137-140). Gorin et al., ("An Experiment in Spoken Language Acquisition", IEEE Transactions on Speech and Audio Processing, vol. 2, No. 1, Part II, Jan. 1994, pp. 224-240). Gorin et al., ("On Adaptive Acquisition of Spoken Language", Neural Networks for Signal processing, Aug. 1991, pp. 422-431). Miller et al., ("A Structured Network Architecture for Adaptive Language Acquisition", ICASSP'92: Acoustics, Speech & Signal Processing Conference, vol. 1, 1992, pp. 201-204). Cole et al., ("The Challenge of Spoken Language Systems: Research Directions for the Nineties", IEEE Transactions on Speech and audio processing, Jan. 1995, vol. 3, Issue 1, pp. 1-21).
Patent History
Patent number: 5794193
Type: Grant
Filed: Sep 15, 1995
Date of Patent: Aug 11, 1998
Assignee: Lucent Technologies Inc. (Murray Hill, NJ)
Inventor: Allen Louis Gorin (Berkeley Heights, NJ)
Primary Examiner: David R. Hudspeth
Assistant Examiner: Vijay B. Chawan
Application Number: 8/528,577
Classifications
Current U.S. Class: Specialized Models (704/250); Subportions (704/254); Word Recognition (704/251); Natural Language (704/257)
International Classification: G10L 506;