Patents by Inventor Mazin Rahim

Mazin Rahim has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

System and method of automatically generating building dialog services by exploiting the content and structure of websites

Patent number: 8090583

Abstract: A method and system are disclosed for providing a dialog interface for a website. The method comprises at each node in a website, computing a summary, a document description and an alias. A dialog manager within a spoken dialog service utilizes the summary, document description and alias for each website node to generate prompts to a user, wherein nodes in the website are matched with user requests. In this manner, a spoken dialog interface to the website content and navigation may be generated automatically.

Type: Grant

Filed: October 30, 2007

Date of Patent: January 3, 2012

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Srinivas Bangalore, Junlan Feng, Mazin Rahim
System and method of automatically building dialog services by exploiting the content and structure of websites

Patent number: 8065151

Abstract: A method and system are disclosed for providing a dialog interface for a website. The method comprises at each node in a website, computing a summary, a document description and an alias. A dialog manager within a spoken dialog service utilizes the summary, document description and alias for each website node to generate prompts to a user, wherein nodes in the website are matched with user requests. In this manner, a spoken dialog interface to the website content and navigation may be generated automatically.

Type: Grant

Filed: December 18, 2003

Date of Patent: November 22, 2011

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Srinivas Bangalore, Junlan Feng, Mazin Rahim
Unsupervised and active learning in automatic speech recognition for call classification

Publication number: 20060190253

Abstract: Utterance data that includes at least a small amount of manually transcribed data is provided. Automatic speech recognition is performed on ones of the utterance data not having a corresponding manual transcription to produce automatically transcribed utterances. A model is trained using all of the manually transcribed data and the automatically transcribed utterances. A predetermined number of utterances not having a corresponding manual transcription are intelligently selected and manually transcribed. Ones of the automatically transcribed data as well as ones having a corresponding manual transcription are labeled. In another aspect of the invention, audio data is mined from at least one source, and a language model is trained for call classification from the mined audio data to produce a language model.

Type: Application

Filed: February 23, 2005

Publication date: August 24, 2006

Applicant: AT&T Corp.

Inventors: Dilek Hakkani-Tur, Mazin Rahim, Giuseppe Riccardi, Gokhan Tur
System and method of providing an automated data-collection in spoken dialog systems

Publication number: 20060149555

Abstract: The invention relates to a system and method for gathering data for use in a spoken dialog system. An aspect of the invention is generally referred to as an automated hidden human that performs data collection automatically at the beginning of a conversation with a user in a spoken dialog system. The method comprises presenting an initial prompt to a user, recognizing a received user utterance using an automatic speech recognition engine and classifying the recognized user utterance using a spoken language understanding module. If the recognized user utterance is not understood or classifiable to a predetermined acceptance threshold, then the method re-prompts the user. If the recognized user utterance is not classifiable to a predetermined rejection threshold, then the method transfers the user to a human as this may imply a task-specific utterance. The received and classified user utterance is then used for training the spoken dialog system.

Type: Application

Filed: January 5, 2005

Publication date: July 6, 2006

Applicant: AT&T Corp.

Inventors: Giuseppe Fabbrizio, Dilek Hakkani-Tur, Mazin Rahim, Bernard Renger, Gokhan Tur
System and method for providing a natural language interface to a database

Publication number: 20060136375

Abstract: A system and method for providing a natural language interface to a database or the Internet. The method provides a response from a database to a natural language query. The method comprises receiving a user query, extracting key data from the user query, submitting the extracted key data to a data base search engine to retrieve a top n pages from the data base, processing of the top n pages through a natural language dialog engine and providing a response based on processing the top n pages.

Type: Application

Filed: December 16, 2004

Publication date: June 22, 2006

Applicant: AT&T Corp.

Inventors: Richard Cox, Hossein Eslambolchi, Behzad Nadji, Mazin Rahim
System and method for generating customized text-to-speech voices

Publication number: 20050256716

Abstract: A system and method are disclosed for generating customized text-to-speech voices for a particular application. The method comprises generating a custom text-to-speech voice by selecting a voice for generating a custom text-to-speech voice associated with a domain, collecting text data associated with the domain from a pre-existing text data source and using the collected text data, generating an in-domain inventory of synthesis speech units by selecting speech units appropriate to the domain via a search of a pre-existing inventory of synthesis speech units, or by recording the minimal inventory for a selected level of synthesis quality. The text-to-speech custom voice for the domain is generated utilizing the in-domain inventory of synthesis speech units. Active learning techniques may also be employed to identify problem phrases wherein only a few minutes of recorded data is necessary to deliver a high quality TTS custom voice.

Type: Application

Filed: May 13, 2004

Publication date: November 17, 2005

Applicant: AT&T Corp.

Inventors: Srinivas Bangalore, Junlan Feng, Mazin Rahim, Juergen Schroeter, David Schulz, Ann Syrdal
Method and apparatus for automatically building conversational systems

Publication number: 20050135571

Abstract: A system and method provides a natural language interface to world-wide web content. Either in advance or dynamically, webpage content is parsed using a parsing algorithm. A person using a telephone interface can provide speech information, which is converted to text and used to automatically fill in input fields on a webpage form. The form is then submitted to a database search and a response is generated. Information contained on the responsive webpage is extracted and converted to speech via a text-to-speech engine and communicated to the person.

Type: Application

Filed: December 19, 2003

Publication date: June 23, 2005

Applicant: AT&T Corp.

Inventors: Srinivas Bangalore, Mazin Rahim, Junlan Feng
Method and apparatus including microphone arrays and neural networks for speech/speaker recognition systems

Patent number: 5737485

Abstract: A neural network is trained to transform distant-talking cepstrum coefficients, derived from a microphone array receiving speech from a speaker distant therefrom, into a form substantially similar to close-talking cepstrum coefficients that would be derived from a microphone close to the speaker, for providing robust hands-free speech and speaker recognition in adverse practical environments with existing speech and speaker recognition systems which have been trained on close-talking speech.

Type: Grant

Filed: March 7, 1995

Date of Patent: April 7, 1998

Assignee: Rutgers The State University of New Jersey

Inventors: James L. Flanagan, Qiguang Lin, Mazin Rahim, Chiwei Che