Patents by Inventor Mazin G. Rahim

Mazin G. Rahim has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Unsupervised and active learning in automatic speech recognition for call classification

Patent number: 8818808

Abstract: Utterance data that includes at least a small amount of manually transcribed data is provided. Automatic speech recognition is performed on ones of the utterance data not having a corresponding manual transcription to produce automatically transcribed utterances. A model is trained using all of the manually transcribed data and the automatically transcribed utterances. A predetermined number of utterances not having a corresponding manual transcription are intelligently selected and manually transcribed. Ones of the automatically transcribed data as well as ones having a corresponding manual transcription are labeled. In another aspect of the invention, audio data is mined from at least one source, and a language model is trained for call classification from the mined audio data to produce a language model.

Type: Grant

Filed: February 23, 2005

Date of Patent: August 26, 2014

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Dilek Z. Hakkani-Tur, Mazin G. Rahim, Giuseppe Riccardi, Gokhan Tur
System and Method of Providing an Automated Data-Collection in Spoken Dialog Systems

Publication number: 20140222426

Abstract: The invention relates to a system and method for gathering data for use in a spoken dialog system. An aspect of the invention is generally referred to as an automated hidden human that performs data collection automatically at the beginning of a conversation with a user in a spoken dialog system. The method comprises presenting an initial prompt to a user, recognizing a received user utterance using an automatic speech recognition engine and classifying the recognized user utterance using a spoken language understanding module. If the recognized user utterance is not understood or classifiable to a predetermined acceptance threshold, then the method re-prompts the user. If the recognized user utterance is not classifiable to a predetermined rejection threshold, then the method transfers the user to a human as this may imply a task-specific utterance. The received and classified user utterance is then used for training the spoken dialog system.

Type: Application

Filed: April 7, 2014

Publication date: August 7, 2014

Applicant: AT&T Intellectual Property II, L.P.

Inventors: Giuseppe Di Fabbrizio, Dilek Z. Hakkani-Tur, Mazin G. Rahim, Bernard S. Renger, Gokhan Tur
System and Method of Providing a Spoken Dialog Interface to a Website

Publication number: 20140207462

Abstract: Disclosed is a method for training a spoken dialog service component from website data. Spoken dialog service components typically include an automatic speech recognition module, a language understanding module, a dialog management module, a language generation module and a text-to-speech module. The method includes selecting anchor texts within a website based on a term density, weighting those anchor texts based on a percent of salient words to total words, and incorporating the weighted anchor texts into a live spoken dialog interface, the weights determining a level of incorporation into the live spoken dialog interface.

Type: Application

Filed: March 25, 2014

Publication date: July 24, 2014

Applicant: AT&T Intellectual Property II, L.P.

Inventors: Srinivas Bangalore, Junlan Feng, Mazin G. Rahim
SYSTEM AND METHOD FOR GENERATING CUSTOMIZED TEXT-TO-SPEECH VOICES

Publication number: 20140188480

Abstract: A system and method are disclosed for generating customized text-to-speech voices for a particular application. The method comprises generating a custom text-to-speech voice by selecting a voice for generating a custom text-to-speech voice associated with a domain, collecting text data associated with the domain from a pre-existing text data source and using the collected text data, generating an in-domain inventory of synthesis speech units by selecting speech units appropriate to the domain via a search of a pre-existing inventory of synthesis speech units, or by recording the minimal inventory for a selected level of synthesis quality. The text-to-speech custom voice for the domain is generated utilizing the in-domain inventory of synthesis speech units. Active learning techniques may also be employed to identify problem phrases wherein only a few minutes of recorded data is necessary to deliver a high quality TTS custom voice.

Type: Application

Filed: March 4, 2014

Publication date: July 3, 2014

Applicant: AT&T Intellectual Property II, L.P.

Inventors: Srinivas BANGALORE, Junlan Feng, Mazin G. Rahim, Juergen Schroeter, Ann K. Syrdal, David Schulz
Recognizing the Numeric Language in Natural Spoken Dialogue

Publication number: 20140163988

Abstract: A system and a method are provided. A speech recognition processor receives unconstrained input speech and outputs a string of words. The speech recognition processor is based on a numeric language that represents a subset of a vocabulary. The subset includes a set of words identified as being for interpreting and understanding number strings. A numeric understanding processor contains classes of rules for converting the string of words into a sequence of digits. The speech recognition processor utilizes an acoustic model database. A validation database stores a set of valid sequences of digits. A string validation processor outputs validity information based on a comparison of a sequence of digits output by the numeric understanding processor with valid sequences of digits in the validation database.

Type: Application

Filed: February 17, 2014

Publication date: June 12, 2014

Applicant: AT&T Intellectual Property II, L.P.

Inventors: Mazin G. Rahim, Giuseppe Riccardi, Jeremy Huntley Wright, Bruce Melvin Buntschuh, Allen Louis Gorin
Method of Handling Frequently Asked Questions in a Natural Language Dialog Service

Publication number: 20140149121

Abstract: A voice-enabled help desk service is disclosed. The service comprises an automatic speech recognition module for recognizing speech from a user, a spoken language understanding module for understanding the output from the automatic speech recognition module, a dialog management module for generating a response to speech from the user, a natural voices text-to-speech synthesis module for synthesizing speech to generate the response to the user, and a frequently asked questions module. The frequently asked questions module handles frequently asked questions from the user by changing voices and providing predetermined prompts to answer frequently asked questions.

Type: Application

Filed: February 3, 2014

Publication date: May 29, 2014

Applicant: AT&T Intellectual Property II, L.P.

Inventors: Giuseppe Di Fabbrizio, Dawn L. Dutton, Narendra K. Gupta, Barbara B. Hollister, Mazin G. Rahim, Giuseppe Riccardi, Robert Elias Schapire, Juergen Schroeter
System and Method for Providing a Natural Language Interface to a Database

Publication number: 20140136507

Abstract: A system and method for providing a natural language interface to a database or the Internet. The method provides a response from a database to a natural language query. The method comprises receiving a user query, extracting key data from the user query, submitting the extracted key data to a data base search engine to retrieve a top n pages from the data base, processing of the top n pages through a natural language dialog engine and providing a response based on processing the top n pages.

Type: Application

Filed: January 23, 2014

Publication date: May 15, 2014

Applicant: AT&T INTELLECTUAL PROPERTY II, L.P.

Inventors: Richard Vandervoort COX, Hossein ESLAMBOLCHI, Behzad NADJI, Mazin G. RAHIM
Method and apparatus for automatically building conversational systems

Patent number: 8718242

Abstract: A system and method provides a natural language interface to world-wide web content. Either in advance or dynamically, webpage content is parsed using a parsing algorithm. A person using a telephone interface can provide speech information, which is converted to text and used to automatically fill in input fields on a webpage form. The form is then submitted to a database search and a response is generated. Information contained on the responsive webpage is extracted and converted to speech via a text-to-speech engine and communicated to the person.

Type: Grant

Filed: June 11, 2013

Date of Patent: May 6, 2014

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Srinivas Bangalore, Mazin G. Rahim, Junlan Feng
System and method of providing an automated data-collection in spoken dialog systems

Patent number: 8694324

Abstract: The invention relates to a system and method for gathering data for use in a spoken dialog system. An aspect of the invention is generally referred to as an automated hidden human that performs data collection automatically at the beginning of a conversation with a user in a spoken dialog system. The method comprises presenting an initial prompt to a user, recognizing a received user utterance using an automatic speech recognition engine and classifying the recognized user utterance using a spoken language understanding module. If the recognized user utterance is not understood or classifiable to a predetermined acceptance threshold, then the method re-prompts the user. If the recognized user utterance is not classifiable to a predetermined rejection threshold, then the method transfers the user to a human as this may imply a task-specific utterance. The received and classified user utterance is then used for training the spoken dialog system.

Type: Grant

Filed: May 21, 2012

Date of Patent: April 8, 2014

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Giuseppe Di Fabbrizio, Dilek Z. Hakkani-Tur, Mazin G. Rahim, Bernard S. Renger, Gokhan Tur
System and method of providing a spoken dialog interface to a website

Patent number: 8688456

Abstract: Disclosed is a method for training a spoken dialog service component from website data. Spoken dialog service components typically include an automatic speech recognition module, a language understanding module, a dialog management module, a language generation module and a text-to-speech module. The method includes selecting anchor texts within a website based on a term density, weighting those anchor texts based on a percent of salient words to total words, and incorporating the weighted anchor texts into a live spoken dialog interface, the weights determining a level of incorporation into the live spoken dialog interface.

Type: Grant

Filed: May 10, 2013

Date of Patent: April 1, 2014

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Srinivas Bangalore, Junlan Feng, Mazin G. Rahim
System and Method of Spoken Language Understanding in Human Computer Dialogs

Publication number: 20140074477

Abstract: A system and method are disclosed that improve automatic speech recognition in a spoken dialog system. The method comprises partitioning speech recognizer output into self-contained clauses, identifying a dialog act in each of the self-contained clauses, qualifying dialog acts by identifying a current domain object and/or a current domain action, and determining whether further qualification is possible for the current domain object and/or current domain action. If further qualification is possible, then the method comprises identifying another domain action and/or another domain object associated with the current domain object and/or current domain action, reassigning the another domain action and/or another domain object as the current domain action and/or current domain object and then recursively qualifying the new current domain action and/or current object. This process continues until nothing is left to qualify.

Type: Application

Filed: November 15, 2013

Publication date: March 13, 2014

Applicant: AT&T Intellectual Property II, L.P.

Inventors: Srinivas Bangalore, Narendra K. Gupta, Mazin G. Rahim
System and method for providing a natural language interface to a database

Patent number: 8671088

Abstract: A system and method for providing a natural language interface to a database or the Internet. The method provides a response from a database to a natural language query. The method comprises receiving a user query, extracting key data from the user query, submitting the extracted key data to a data base search engine to retrieve a top n pages from the data base, processing of the top n pages through a natural language dialog engine and providing a response based on processing the top n pages.

Type: Grant

Filed: April 1, 2013

Date of Patent: March 11, 2014

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Richard Vandervoort Cox, Hossein Eslambolchi, Behzad Nadji, Mazin G. Rahim
System and method for generating customized text-to-speech voices

Patent number: 8666746

Abstract: A system and method are disclosed for generating customized text-to-speech voices for a particular application. The method comprises generating a custom text-to-speech voice by selecting a voice for generating a custom text-to-speech voice associated with a domain, collecting text data associated with the domain from a pre-existing text data source and using the collected text data, generating an in-domain inventory of synthesis speech units by selecting speech units appropriate to the domain via a search of a pre-existing inventory of synthesis speech units, or by recording the minimal inventory for a selected level of synthesis quality. The text-to-speech custom voice for the domain is generated utilizing the in-domain inventory of synthesis speech units. Active learning techniques may also be employed to identify problem phrases wherein only a few minutes of recorded data is necessary to deliver a high quality TTS custom voice.

Type: Grant

Filed: May 13, 2004

Date of Patent: March 4, 2014

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Srinivas Bangalore, Junlan Feng, Mazin G. Rahim, Juergen Schroeter, David Eugene Schulz, Ann K. Syrdal
Recognizing the numeric language in natural spoken dialogue

Patent number: 8655658

Abstract: A system and a method are provided. A speech recognition processor receives unconstrained input speech and outputs a string of words. The speech recognition processor is based on a numeric language that represents a subset of a vocabulary. The subset includes a set of words identified as being for interpreting and understanding number strings. A numeric understanding processor contains classes of rules for converting the string of words into a sequence of digits. The speech recognition processor utilizes an acoustic model database. A validation database stores a set of valid sequences of digits. A string validation processor outputs validity information based on a comparison of a sequence of digits output by the numeric understanding processor with valid sequences of digits in the validation database.

Type: Grant

Filed: October 25, 2011

Date of Patent: February 18, 2014

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Mazin G. Rahim, Giuseppe Riccardi, Jeremy Huntley Wright, Bruce Melvin Buntschuh, Allen Louis Gorin
Method of handling frequently asked questions in a natural language dialog service

Patent number: 8645122

Abstract: A voice-enabled help desk service is disclosed. The service comprises an automatic speech recognition module for recognizing speech from a user, a spoken language understanding module for understanding the output from the automatic speech recognition module, a dialog management module for generating a response to speech from the user, a natural voices text-to-speech synthesis module for synthesizing speech to generate the response to the user, and a frequently asked questions module. The frequently asked questions module handles frequently asked questions from the user by changing voices and providing predetermined prompts to answer frequently asked questions.

Type: Grant

Filed: December 19, 2002

Date of Patent: February 4, 2014

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Giuseppe Di Fabbrizio, Dawn L Dutton, Narendra K. Gupta, Barbara B. Hollister, Mazin G Rahim, Giuseppe Riccardi, Robert Elias Schapire, Juergen Schroeter
System and method of spoken language understanding in human computer dialogs

Patent number: 8612232

Abstract: A system and method are disclosed that improve automatic speech recognition in a spoken dialog system. The method comprises partitioning speech recognizer output into self-contained clauses, identifying a dialog act in each of the self-contained clauses, qualifying dialog acts by identifying a current domain object and/or a current domain action, and determining whether further qualification is possible for the current domain object and/or current domain action. If further qualification is possible, then the method comprises identifying another domain action and/or another domain object associated with the current domain object and/or current domain action, reassigning the another domain action and/or another domain object as the current domain action and/or current domain object and then recursively qualifying the new current domain action and/or current object. This process continues until nothing is left to qualify.

Type: Grant

Filed: February 25, 2013

Date of Patent: December 17, 2013

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Srinivas Bangalore, Narendra K. Gupta, Mazin G. Rahim
Automatic detection, summarization and reporting of business intelligence highlights from automated dialog systems

Patent number: 8589172

Abstract: A method and system for reporting data from a spoken dialog service is disclosed. The method comprises extracting data regarding user dialogs using a dialog logging module in the spoken dialog service, analyzing the data to identify trends and reporting the trends. The data may be presented in a visual form for easier consumption. The method may also relate to identifying data within the control or outside the control of a service provider that is used to adjust the spoken dialog service to maximize customer retention.

Type: Grant

Filed: November 28, 2011

Date of Patent: November 19, 2013

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Tirso M. Alonso, Shona Douglas, Mazin G. Rahim, Benjamin J. Stern
System and method of automating a spoken dialogue service

Patent number: 8566102

Abstract: A system and method of generating and operating a spoken dialog service for a web-site are disclosed. The system parses web-site data and organizes the web-site data in a task knowledge data bank. The system receives text associated with a user query; processes the received text in a spoken language understanding (SLU) module, the SLU module using the web-site data from the task knowledge data bank; generates a ranked list of relevant responses to the user query; generates a hierarchical tree using the web-site data and the ranked list of relevant responses to the user query, generates a response to the user query using the hierarchical tree; and presents the response to the user.

Type: Grant

Filed: November 6, 2002

Date of Patent: October 22, 2013

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Srinivas Bangalore, Junlan Feng, Mazin G Rahim
Method and Apparatus for Automatically Building Conversational Systems

Publication number: 20130275132

Abstract: A system and method provides a natural language interface to world-wide web content. Either in advance or dynamically, webpage content is parsed using a parsing algorithm. A person using a telephone interface can provide speech information, which is converted to text and used to automatically fill in input fields on a webpage form. The form is then submitted to a database search and a response is generated. Information contained on the responsive webpage is extracted and converted to speech via a text-to-speech engine and communicated to the person.

Type: Application

Filed: June 11, 2013

Publication date: October 17, 2013

Inventors: Srinivas Bangalore, Mazin G. Rahim, Junlan Feng
System and Method of Providing a Spoken Dialog Interface to a Website

Publication number: 20130246069

Abstract: Disclosed is a method for training a spoken dialog service component from website data. Spoken dialog service components typically include an automatic speech recognition module, a language understanding module, a dialog management module, a language generation module and a text-to-speech module. The method includes selecting anchor texts within a website based on a term density, weighting those anchor texts based on a percent of salient words to total words, and incorporating the weighted anchor texts into a live spoken dialog interface, the weights determining a level of incorporation into the live spoken dialog interface.

Type: Application

Filed: May 10, 2013

Publication date: September 19, 2013

Applicant: AT&T Intellectual Property II, L.P.

Inventors: Srinivas Bangalore, Junlan Feng, Mazin G. Rahim

prev 1 2 3 4 5 6 next