Patents by Inventor Gokhan Tur

Gokhan Tur has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

System and method of spoken language understanding using word confusion networks

Patent number: 7571098

Abstract: Word lattices that are generated by an automatic speech recognition system are used to generate a modified word lattice that is usable by a spoken language understanding module. In one embodiment, the spoken language understanding module determines a set of salient phrases by calculating an intersection of the modified word lattice, which is optionally preprocessed, and a finite state machine that includes a plurality of salient grammar fragments.

Type: Grant

Filed: May 29, 2003

Date of Patent: August 4, 2009

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Allen Louis Gorin, Dilek Z. Hakkani-Tur, Giuseppe Riccardi, Gokhan Tur, Jeremy Huntley Wright
Active learning process for spoken dialog systems

Patent number: 7562014

Abstract: A large amount of human labor is required to transcribe and annotate a training corpus that is needed to create and update models for automatic speech recognition (ASR) and spoken language understanding (SLU). Active learning enables a reduction in the amount of transcribed and annotated data required to train ASR and SLU models. In one aspect of the present invention, an active learning ASR process and active learning SLU process are coupled, thereby enabling further efficiencies to be gained relative to a process that maintains an isolation of data in both the ASR and SLU domains.

Type: Grant

Filed: September 26, 2007

Date of Patent: July 14, 2009

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Dilek Z Hakkani-Tur, Mazin G Rahim, Giuseppe Riccardi, Gokhan Tur
Active labeling for spoken language understanding

Patent number: 7562017

Abstract: An active labeling process is provided that aims to minimize the number of utterances to be checked again by automatically selecting the ones that are likely to be erroneous or inconsistent with the previously labeled examples. In one embodiment, the errors and inconsistencies are identified based on the confidences obtained from a previously trained classifier model. In a second embodiment, the errors and inconsistencies are identified based on an unsupervised learning process. In both embodiments, the active labeling process is not dependent upon the particular classifier model.

Type: Grant

Filed: September 27, 2007

Date of Patent: July 14, 2009

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Dilek Z. Hakkani-Tur, Mazin G. Rahim, Gokhan Tur
SYSTEMS AND METHODS FOR REDUCING ANNOTATION TIME

Publication number: 20080270130

Abstract: Systems and methods for annotating speech data. The present invention reduces the time required to annotate speech data by selecting utterances for annotation that will be of greatest benefit. A selection module uses speech models, including speech recognition models and spoken language understanding models, to identify utterances that should be annotated based on criteria such as confidence scores generated by the models. These utterances are placed in an annotation list along with a type of annotation to be performed for the utterances and an order in which the annotation should proceed. The utterances in the annotation list can be annotated for speech recognition purposes, spoken language understanding purposes, labeling purposes, etc. The selection module can also select utterances for annotation based on previously annotated speech data and deficiencies in the various models.

Type: Application

Filed: July 1, 2008

Publication date: October 30, 2008

Applicant: AT&T Corp.

Inventors: Tirso M. Alonso, Ilana Bromberg, Dilek Z. Hakkani-Tur, Barbara B. Hollister, Mazin G. Rahim, Giuseppe Riccardi, Lawrence Lyon Rose, Daniel Leon Stern, Gokhan Tur, James M. Wilson
Reducing time for annotating speech data to develop a dialog application

Patent number: 7412383

Abstract: Systems and methods for annotating speech data. The present invention reduces the time required to annotate speech data by selecting utterances for annotation that will be of greatest benefit. A selection module uses speech models, including speech recognition models and spoken language understanding models, to identify utterances that should be annotated based on criteria such as confidence scores generated by the models. These utterances are placed in an annotation list along with a type of annotation to be performed for the utterances and an order in which the annotation should proceed. The utterances in the annotation list can be annotated for speech recognition purposes, spoken language understanding purposes, labeling purposes, etc. The selection module can also select utterances for annotation based on previously annotated speech data and deficiencies in the various models.

Type: Grant

Filed: April 4, 2003

Date of Patent: August 12, 2008

Assignee: AT&T Corp

Inventors: Tirso M. Alonso, Ilana Bromberg, Dilek Z. Hakkani-Tur, Barbara B. Hollister, Mazin G. Rahim, Giuseppe Riccardi, Lawrence Lyon Rose, Daniel Leon Stern, Gokhan Tur, James M. Wilson
METHOD AND APPARATUS FOR SPEAKER RECOGNITION

Publication number: 20080010065

Abstract: A method and apparatus for speaker recognition is provided. One embodiment of a method for determining whether a given speech signal is produced by an alleged speaker, where a plurality of statistical models (including at least one support vector machine) have been produced for the alleged speaker based on a previous speech signal received from the alleged speaker, includes receiving the given speech signal, the speech signal representing an utterance made by a speaker claiming to be the alleged speaker, scoring the given speech signal using at least two modeling systems, where at least one of the modeling systems is a support vector machine, combining scores produced by the modeling systems, with equal weights, to produce a final score, and determining, in accordance with the final score, whether the speaker is likely the alleged speaker.

Type: Application

Filed: June 5, 2007

Publication date: January 10, 2008

Inventors: Harry BRATT, Luciana Ferrer, Martin Graciarena, Sachin Kajarekar, Elizabeth Shriberg, Mustafa Sonmez, Andreas Stolcke, Gokhan Tur, Anand Venkataraman
Method for building a natural language understanding model for a spoken dialog system

Patent number: 7295981

Abstract: A method of generating a natural language model for use in a spoken dialog system is disclosed. The method comprises using sample utterances and creating a number of hand crafted rules for each call-type defined in a labeling guide. A first NLU model is generated and tested using the hand crafted rules and sample utterances. A second NLU model is built using the sample utterances as new training data and using the hand crafted rules. The second NLU model is tested for performance using a first batch of labeled data. A series of NLU models are built by adding a previous batch of labeled data to training data and using a new batch of labeling data as test data to generate the series of NLU models with training data that increases constantly. If not all the labeling data is received, the method comprises repeating the step of building a series of NLU models until all labeling data is received.

Type: Grant

Filed: January 9, 2004

Date of Patent: November 13, 2007

Assignee: AT&T Corp.

Inventors: Narendra K. Gupta, Mazin G. Rahim, Gokhan Tur, Antony Van der Mude
Active labeling for spoken language understanding

Patent number: 7292982

Abstract: An active labeling process is provided that aims to minimize the number of utterances to be checked again by automatically selecting the ones that are likely to be erroneous or inconsistent with the previously labeled examples. In one embodiment, the errors and inconsistencies are identified based on the confidences obtained from a previously trained classifier model. In a second embodiment, the errors and inconsistencies are identified based on an unsupervised learning process. In both embodiments, the active labeling process is not dependent upon the particular classifier model.

Type: Grant

Filed: May 29, 2003

Date of Patent: November 6, 2007

Assignee: AT&T Corp.

Inventors: Dilek Z. Hakkani-Tur, Mazin G. Rahim, Gokhan Tur
Active learning process for spoken dialog systems

Patent number: 7292976

Abstract: A large amount of human labor is required to transcribe and annotate a training corpus that is needed to create and update models for automatic speech recognition (ASR) and spoken language understanding (SLU). Active learning enables a reduction in the amount of transcribed and annotated data required to train ASR and SLU models. In one aspect of the present invention, an active learning ASR process and active learning SLU process are coupled, thereby enabling further efficiencies to be gained relative to a process that maintains an isolation of data in both the ASR and SLU domains.

Type: Grant

Filed: May 29, 2003

Date of Patent: November 6, 2007

Assignee: AT&T Corp.

Inventors: Dilek Z. Hakkani-Tur, Mazin G. Rahim, Giuseppe Riccardi, Gokhan Tur
Active learning for spoken language understanding

Patent number: 7263486

Abstract: Disclosed is a system and method of training a spoken language understanding module. Such a module may be utilized in a spoken dialog system. The method of training a spoken language understanding module comprises training acoustic and language models using a small set of transcribed data ST, recognizing utterances in a set Su that are candidates for transcription using the acoustic and language models, computing confidence scores of the utterances, selecting k utterances that have the smallest confidence scores from Su and transcribing them into a new set Si, redefining St as the union of St and Si, redefining Su as Su minus Si, and returning to the step of training acoustic and language models if word accuracy has not converged.

Type: Grant

Filed: April 1, 2003

Date of Patent: August 28, 2007

Assignee: AT&T Corp.

Inventors: Dilek Z. Hakkani-Tur, Robert Elias Schapire, Gokhan Tur
Answer determination for natural language questioning

Publication number: 20070136246

Abstract: Open-domain question answering is the task of finding a concise answer to a natural language question using a large domain, such as the Internet. The use of a semantic role labeling approach to the extraction of the answers to an open domain factoid (Who/When/What/Where) natural language question that contains a predicate is described. Semantic role labeling identities predicates and semantic argument phrases in the natural language question and the candidate sentences. When searching for an answer to a natural language question, the missing argument in the question is matched using semantic parses of the candidate answers. Such a technique may improve the accuracy of a question answering system and may decrease the length of answers for enabling voice interface to a question answering system.

Type: Application

Filed: December 28, 2005

Publication date: June 14, 2007

Applicant: AT&T Corp.

Inventors: Svetlana Stenchikova, Gokhan Tur, Dilek Tur
System and method for integrating and managing E-mail, voicemail, and telephone conversations using speech processing techniques

Publication number: 20070041522

Abstract: A system and a method are provided. A textual transcript of a recorded voice communication is received. Text from a non-voice communication is received. Information about the textual transcript of the recorded voice communication and the text from the non-voice communication is provided such that a user can manage a group of communications, based at least in part on the textual transcript of the recorded voice communication and the text from the non-voice communication.

Type: Application

Filed: August 19, 2005

Publication date: February 22, 2007

Applicant: AT&T Corp.

Inventors: Alicia Abella, Brian Amento, Dilek Hakkani-Tur, Larry Stead, Gokhan Tur, Jay Wilpon, Jeremy Wright
Apparatus and method for model adaptation for spoken language understanding

Publication number: 20060212293

Abstract: An apparatus and a method are provided for building a spoken language understanding model. Labeled data may be obtained for a target application. A new classification model may be formed for use with the target application by using the labeled data for adaptation of an existing classification model. In some implementations, the existing classification model may be used to determine the most informative examples to label.

Type: Application

Filed: March 21, 2005

Publication date: September 21, 2006

Applicant: AT&T Corp.

Inventor: Gokhan Tur
Unsupervised and active learning in automatic speech recognition for call classification

Publication number: 20060190253

Abstract: Utterance data that includes at least a small amount of manually transcribed data is provided. Automatic speech recognition is performed on ones of the utterance data not having a corresponding manual transcription to produce automatically transcribed utterances. A model is trained using all of the manually transcribed data and the automatically transcribed utterances. A predetermined number of utterances not having a corresponding manual transcription are intelligently selected and manually transcribed. Ones of the automatically transcribed data as well as ones having a corresponding manual transcription are labeled. In another aspect of the invention, audio data is mined from at least one source, and a language model is trained for call classification from the mined audio data to produce a language model.

Type: Application

Filed: February 23, 2005

Publication date: August 24, 2006

Applicant: AT&T Corp.

Inventors: Dilek Hakkani-Tur, Mazin Rahim, Giuseppe Riccardi, Gokhan Tur
System and method of providing an automated data-collection in spoken dialog systems

Publication number: 20060149555

Abstract: The invention relates to a system and method for gathering data for use in a spoken dialog system. An aspect of the invention is generally referred to as an automated hidden human that performs data collection automatically at the beginning of a conversation with a user in a spoken dialog system. The method comprises presenting an initial prompt to a user, recognizing a received user utterance using an automatic speech recognition engine and classifying the recognized user utterance using a spoken language understanding module. If the recognized user utterance is not understood or classifiable to a predetermined acceptance threshold, then the method re-prompts the user. If the recognized user utterance is not classifiable to a predetermined rejection threshold, then the method transfers the user to a human as this may imply a task-specific utterance. The received and classified user utterance is then used for training the spoken dialog system.

Type: Application

Filed: January 5, 2005

Publication date: July 6, 2006

Applicant: AT&T Corp.

Inventors: Giuseppe Fabbrizio, Dilek Hakkani-Tur, Mazin Rahim, Bernard Renger, Gokhan Tur
Library of existing spoken dialog data for use in generating new natural language spoken dialog systems

Publication number: 20060149554

Abstract: A machine-readable medium may include a group of reusable components for building a spoken dialog system. The reusable components may include a group of previously collected audible utterances. A machine-implemented method to build a library of reusable components for use in building a natural language spoken dialog system may include storing a dataset in a database. The dataset may include a group of reusable components for building a spoken dialog system. The reusable components may further include a group of previously collected audible utterances. A second method may include storing at least one set of data. Each one of the at least one set of data may include ones of the reusable components associated with audible data collected during a different collection phase.

Type: Application

Filed: January 5, 2005

Publication date: July 6, 2006

Applicant: AT&T Corp.

Inventors: Lee Begeja, Giuseppe Fabbrizio, David Gibbon, Dilek Hakkani-Tur, Zhu Liu, Bernard Renger, Behzad Shahraray, Gokhan Tur
Error prediction in spoken dialog systems

Publication number: 20060149544

Abstract: A spoken dialog system configured to use a combined confidence score. A first confidence score, indicating a confidence level in a speech recognition result of recognizing an utterance, is provided. A second confidence level, indicating a confidence level of mapping the speech recognition result to an intent, is provided. The first confidence score and the second confidence score are combined to form a combined confidence score. A determination is made, with respect to whether to accept the intent, based on the combined confidence score.

Type: Application

Filed: January 5, 2005

Publication date: July 6, 2006

Applicant: AT&T Corp.

Inventors: Dilek Hakkani-Tur, Giuseppe Riccardi, Gokhan Tur
Apparatus and method for spoken language understanding by using semantic role labeling

Publication number: 20060080101

Abstract: An apparatus and a method are provided for using semantic role labeling for spoken language understanding. A received utterance semantically parsed by semantic role labeling. A predicate or at least one argument is extracted from the semantically parsed utterance. An intent is estimated based on the predicate or the at least one argument. In another aspect, a method is provided for training a spoken language dialog system that uses semantic role labeling. An expert is provided with a group of predicate/argument pairs. Ones of the predicate/argument pairs are selected as intents. Ones of the arguments are selected as named entities. Mappings from the arguments to frame slots are designed.

Type: Application

Filed: March 31, 2005

Publication date: April 13, 2006

Applicant: AT&T Corp.

Inventors: Ananlada Chotimongkol, Dilek Hakkani-Tur, Gokhan Tur

prev … 4 5 6 7 8