Patents by Inventor Mazin Rahim

Mazin Rahim has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 8090583
    Abstract: A method and system are disclosed for providing a dialog interface for a website. The method comprises at each node in a website, computing a summary, a document description and an alias. A dialog manager within a spoken dialog service utilizes the summary, document description and alias for each website node to generate prompts to a user, wherein nodes in the website are matched with user requests. In this manner, a spoken dialog interface to the website content and navigation may be generated automatically.
    Type: Grant
    Filed: October 30, 2007
    Date of Patent: January 3, 2012
    Assignee: AT&T Intellectual Property II, L.P.
    Inventors: Srinivas Bangalore, Junlan Feng, Mazin Rahim
  • Patent number: 8065151
    Abstract: A method and system are disclosed for providing a dialog interface for a website. The method comprises at each node in a website, computing a summary, a document description and an alias. A dialog manager within a spoken dialog service utilizes the summary, document description and alias for each website node to generate prompts to a user, wherein nodes in the website are matched with user requests. In this manner, a spoken dialog interface to the website content and navigation may be generated automatically.
    Type: Grant
    Filed: December 18, 2003
    Date of Patent: November 22, 2011
    Assignee: AT&T Intellectual Property II, L.P.
    Inventors: Srinivas Bangalore, Junlan Feng, Mazin Rahim
  • Publication number: 20060190253
    Abstract: Utterance data that includes at least a small amount of manually transcribed data is provided. Automatic speech recognition is performed on ones of the utterance data not having a corresponding manual transcription to produce automatically transcribed utterances. A model is trained using all of the manually transcribed data and the automatically transcribed utterances. A predetermined number of utterances not having a corresponding manual transcription are intelligently selected and manually transcribed. Ones of the automatically transcribed data as well as ones having a corresponding manual transcription are labeled. In another aspect of the invention, audio data is mined from at least one source, and a language model is trained for call classification from the mined audio data to produce a language model.
    Type: Application
    Filed: February 23, 2005
    Publication date: August 24, 2006
    Applicant: AT&T Corp.
    Inventors: Dilek Hakkani-Tur, Mazin Rahim, Giuseppe Riccardi, Gokhan Tur
  • Publication number: 20060149555
    Abstract: The invention relates to a system and method for gathering data for use in a spoken dialog system. An aspect of the invention is generally referred to as an automated hidden human that performs data collection automatically at the beginning of a conversation with a user in a spoken dialog system. The method comprises presenting an initial prompt to a user, recognizing a received user utterance using an automatic speech recognition engine and classifying the recognized user utterance using a spoken language understanding module. If the recognized user utterance is not understood or classifiable to a predetermined acceptance threshold, then the method re-prompts the user. If the recognized user utterance is not classifiable to a predetermined rejection threshold, then the method transfers the user to a human as this may imply a task-specific utterance. The received and classified user utterance is then used for training the spoken dialog system.
    Type: Application
    Filed: January 5, 2005
    Publication date: July 6, 2006
    Applicant: AT&T Corp.
    Inventors: Giuseppe Fabbrizio, Dilek Hakkani-Tur, Mazin Rahim, Bernard Renger, Gokhan Tur
  • Publication number: 20060136375
    Abstract: A system and method for providing a natural language interface to a database or the Internet. The method provides a response from a database to a natural language query. The method comprises receiving a user query, extracting key data from the user query, submitting the extracted key data to a data base search engine to retrieve a top n pages from the data base, processing of the top n pages through a natural language dialog engine and providing a response based on processing the top n pages.
    Type: Application
    Filed: December 16, 2004
    Publication date: June 22, 2006
    Applicant: AT&T Corp.
    Inventors: Richard Cox, Hossein Eslambolchi, Behzad Nadji, Mazin Rahim
  • Publication number: 20050256716
    Abstract: A system and method are disclosed for generating customized text-to-speech voices for a particular application. The method comprises generating a custom text-to-speech voice by selecting a voice for generating a custom text-to-speech voice associated with a domain, collecting text data associated with the domain from a pre-existing text data source and using the collected text data, generating an in-domain inventory of synthesis speech units by selecting speech units appropriate to the domain via a search of a pre-existing inventory of synthesis speech units, or by recording the minimal inventory for a selected level of synthesis quality. The text-to-speech custom voice for the domain is generated utilizing the in-domain inventory of synthesis speech units. Active learning techniques may also be employed to identify problem phrases wherein only a few minutes of recorded data is necessary to deliver a high quality TTS custom voice.
    Type: Application
    Filed: May 13, 2004
    Publication date: November 17, 2005
    Applicant: AT&T Corp.
    Inventors: Srinivas Bangalore, Junlan Feng, Mazin Rahim, Juergen Schroeter, David Schulz, Ann Syrdal
  • Publication number: 20050135571
    Abstract: A system and method provides a natural language interface to world-wide web content. Either in advance or dynamically, webpage content is parsed using a parsing algorithm. A person using a telephone interface can provide speech information, which is converted to text and used to automatically fill in input fields on a webpage form. The form is then submitted to a database search and a response is generated. Information contained on the responsive webpage is extracted and converted to speech via a text-to-speech engine and communicated to the person.
    Type: Application
    Filed: December 19, 2003
    Publication date: June 23, 2005
    Applicant: AT&T Corp.
    Inventors: Srinivas Bangalore, Mazin Rahim, Junlan Feng
  • Patent number: 5737485
    Abstract: A neural network is trained to transform distant-talking cepstrum coefficients, derived from a microphone array receiving speech from a speaker distant therefrom, into a form substantially similar to close-talking cepstrum coefficients that would be derived from a microphone close to the speaker, for providing robust hands-free speech and speaker recognition in adverse practical environments with existing speech and speaker recognition systems which have been trained on close-talking speech.
    Type: Grant
    Filed: March 7, 1995
    Date of Patent: April 7, 1998
    Assignee: Rutgers The State University of New Jersey
    Inventors: James L. Flanagan, Qiguang Lin, Mazin Rahim, Chiwei Che