Patents by Inventor Mazin Gilbert

Mazin Gilbert has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

System and Method for Combining Speech Recognition Outputs From a Plurality of Domain-Specific Speech Recognizers Via Machine Learning

Publication number: 20140358537

Abstract: Disclosed herein are systems, methods and non-transitory computer-readable media for performing speech recognition across different applications or environments without model customization or prior knowledge of the domain of the received speech. The disclosure includes recognizing received speech with a collection of domain-specific speech recognizers, determining a speech recognition confidence for each of the speech recognition outputs, selecting speech recognition candidates based on a respective speech recognition confidence for each speech recognition output, and combining selected speech recognition candidates to generate text based on the combination.

Type: Application

Filed: August 14, 2014

Publication date: December 4, 2014

Inventors: Mazin GILBERT, Srinivas BANGALORE, Patrick HAFFNER, Robert BELL
On-Demand Language Translation for Television Programs

Publication number: 20140350915

Abstract: In an embodiment, a method of providing an on demand translation service is provided. A subscriber may be charged a reduced fee or no fee for use of the on demand translation service in exchange for displaying commercial messages to the subscriber, the commercial messages being selected based on subscriber information. A multimedia signal including information in a source language may be received. The information may be obtained as text in the source language from the multimedia signal. The text may be translated from the source language to a target language. Translated information, based on the translated text, may be transmitted to a processing device for presentation to the subscriber. The received multimedia signal may be sent to a multimedia device for viewing.

Type: Application

Filed: August 12, 2014

Publication date: November 27, 2014

Inventors: Srinivas BANGALORE, David Crawford GIBBON, Mazin GILBERT, Patrick Guy HAFFNER, Zhu LIU, Behzad SHAHRARAY
System and method for increasing recognition rates of in-vocabulary words by improving pronunciation modeling

Patent number: 8892441

Abstract: The present disclosure relates to systems, methods, and computer-readable media for generating a lexicon for use with speech recognition. The method includes overgenerating potential pronunciations based on symbolic input, identifying potential pronunciations in a speech recognition context, and storing the identified potential pronunciations in a lexicon. Overgenerating potential pronunciations can include establishing a set of conversion rules for short sequences of letters, converting portions of the symbolic input into a number of possible lexical pronunciation variants based on the set of conversion rules, modeling the possible lexical pronunciation variants in one of a weighted network and a list of phoneme lists, and iteratively retraining the set of conversion rules based on improved pronunciations. Symbolic input can include multiple examples of a same spoken word. Speech data can be labeled explicitly or implicitly and can include words as text and recorded audio.

Type: Grant

Filed: December 5, 2011

Date of Patent: November 18, 2014

Assignee: AT&T Intellectual Property I, L.P.

Inventors: Alistair D. Conkie, Mazin Gilbert, Andrej Ljolje
Methods and Systems for Natural Language Understanding Using Human Knowledge and Collected Data

Publication number: 20140330555

Abstract: Disclosed herein are systems and methods to incorporate human knowledge when developing and using statistical models for natural language understanding. The disclosed systems and methods embrace a data-driven approach to natural language understanding which progresses seamlessly along the continuum of availability of annotated collected data, from when there is no available annotated collected data to when there is any amount of annotated collected data.

Type: Application

Filed: July 23, 2014

Publication date: November 6, 2014

Inventors: Srinivas Bangalore, Mazin Gilbert, Narendra K. Gupta
METHOD AND SYSTEM FOR PROVIDING AN AUTOMATED WEB TRANSCRIPTION SERVICE

Publication number: 20140316780

Abstract: A system, method and computer readable medium that provides an automated web transcription service is disclosed. The method may include receiving input speech from a user using a communications network, recognizing the received input speech, understanding the recognized speech, transcribing the understood speech to text, storing the transcribed text in a database, receiving a request via a web page to display the transcribed text, retrieving transcribed text from the database, and displaying the transcribed text to the requester using the web page.

Type: Application

Filed: July 2, 2014

Publication date: October 23, 2014

Inventors: Mazin GILBERT, Stephan KANTHAK
Method and Apparatus for Identifying Acoustic Background Environments Based on Time and Speed to Enhance Automatic Speech Recognition

Publication number: 20140303972

Abstract: Disclosed are systems, methods, and computer readable media for identifying an acoustic environment of a caller. The method embodiment comprises analyzing acoustic features of a received audio signal from a caller, receiving meta-data information, classifying a background environment of the caller based on the analyzed acoustic features and the meta-data, selecting an acoustic model matched to the classified background environment from a plurality of acoustic models, and performing speech recognition as the received audio signal using the selected acoustic model.

Type: Application

Filed: June 23, 2014

Publication date: October 9, 2014

Inventor: Mazin GILBERT
System and method for tracking fraudulent electronic transactions using voiceprints of uncommon words

Patent number: 8831941

Abstract: Disclosed are systems, methods, and computer readable media for comparing customer voice prints comprising of uncommonly spoken words with a database of known fraudulent voice signatures and continually updating the database to decrease the risk of identity theft. The method embodiment comprises comparing a received voice signal against a database of known fraudulent voice signatures, denying the caller's transaction if the voice signal substantially matches the database of known fraudulent voice signatures, adding the caller's voice signal to the database of known fraudulent voice signatures if the voice signal does not substantially match a separate speaker verification database and received additional information is not verified.

Type: Grant

Filed: May 29, 2007

Date of Patent: September 9, 2014

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Mazin Gilbert, Jay Wilpon
System and method for combining speech recognition outputs from a plurality of domain-specific speech recognizers via machine learning

Patent number: 8812321

Abstract: Disclosed herein are systems, methods and non-transitory computer-readable media for performing speech recognition across different applications or environments without model customization or prior knowledge of the domain of the received speech. The disclosure includes recognizing received speech with a collection of domain-specific speech recognizers, determining a speech recognition confidence for each of the speech recognition outputs, selecting speech recognition candidates based on a respective speech recognition confidence for each speech recognition output, and combining selected speech recognition candidates to generate text based on the combination.

Type: Grant

Filed: September 30, 2010

Date of Patent: August 19, 2014

Assignee: AT&T Intellectual Property I, L.P.

Inventors: Mazin Gilbert, Srinivas Bangalore, Patrick Haffner, Robert Bell
On-demand language translation for television programs

Patent number: 8805668

Abstract: In an embodiment, a method of providing an on demand translation service is provided. A subscriber may be charged a reduced fee or no fee for use of the on demand translation service in exchange for displaying commercial messages to the subscriber, the commercial messages being selected based on subscriber information. A multimedia signal including information in a source language may be received. The information may be obtained as text in the source language from the multimedia signal. The text may be translated from the source language to a target language. Translated information, based on the translated text, may be transmitted to a processing device for presentation to the subscriber. The received multimedia signal may be sent to a multimedia device for viewing.

Type: Grant

Filed: October 4, 2010

Date of Patent: August 12, 2014

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Srinivas Bangalore, David Crawford Gibbon, Mazin Gilbert, Patrick Guy Haffner, Zhu Liu, Behzad Shahraray
Methods and systems for natural language understanding using human knowledge and collected data

Patent number: 8798990

Abstract: Disclosed herein are systems and methods to incorporate human knowledge when developing and using statistical models for natural language understanding. The disclosed systems and methods embrace a data-driven approach to natural language understanding which progresses seamlessly along the continuum of availability of annotated collected data, from when there is no available annotated collected data to when there is any amount of annotated collected data.

Type: Grant

Filed: April 30, 2013

Date of Patent: August 5, 2014

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Srinivas Bangalore, Mazin Gilbert, Narendra K. Gupta
Method and Apparatus for Responding to an Inquiry

Publication number: 20140205985

Abstract: Disclosed is a method and apparatus for responding to an inquiry from a client via a network. The method and apparatus receive the inquiry from a client via a network. Based on the inquiry, question-answer pairs retrieved from the network are analyzed to determine a response to the inquiry. The QA pairs are not predefined. As a result, the QA pairs have to be analyzed in order to determine whether they are responsive to a particular inquiry. Questions of the QA pairs may be repetitive and, without more, will not be useful in determining whether their corresponding answer responds to an inquiry.

Type: Application

Filed: March 19, 2014

Publication date: July 24, 2014

Applicant: AT&T Intellectual Property II, L.P.

Inventors: Junlan Feng, JR., Mazin Gilbert, Dilek Hakkani-Tur, Gokhan Tur
Method and system for providing an automated web transcription service

Patent number: 8775176

Abstract: A system, method and computer readable medium that provides an automated web transcription service is disclosed. The method may include receiving input speech from a user using a communications network, recognizing the received input speech, understanding the recognized speech, transcribing the understood speech to text, storing the transcribed text in a database, receiving a request via a web page to display the transcribed text, retrieving transcribed text from the database, and displaying the transcribed text to the requester using the web page.

Type: Grant

Filed: August 26, 2013

Date of Patent: July 8, 2014

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Mazin Gilbert, Stephan Kanthak
Method and apparatus for identifying acoustic background environments based on time and speed to enhance automatic speech recognition

Patent number: 8762143

Abstract: Disclosed are systems, methods, and computer readable media for identifying an acoustic environment of a caller. The method embodiment comprises analyzing acoustic features of a received audio signal from a caller, receiving meta-data information based on a previously recorded time and speed of the caller, classifying a background environment of the caller based on the analyzed acoustic features and the meta-data, selecting an acoustic model matched to the classified background environment from a plurality of acoustic models, and performing speech recognition as the received audio signal using the selected acoustic model.

Type: Grant

Filed: May 29, 2007

Date of Patent: June 24, 2014

Assignee: AT&T Intellectual Property II, L.P.

Inventor: Mazin Gilbert
System and method for optimizing speech recognition and natural language parameters with user feedback

Patent number: 8738375

Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for assigning saliency weights to words of an ASR model. The saliency values assigned to words within an ASR model are based on human perception judgments of previous transcripts. These saliency values are applied as weights to modify an ASR model such that the results of the weighted ASR model in converting a spoken document to a transcript provide a more accurate and useful transcription to the user.

Type: Grant

Filed: May 9, 2011

Date of Patent: May 27, 2014

Assignee: AT&T Intellectual Property I, L.P.

Inventors: Andrej Ljolje, Diamantino Antonio Caseiro, Mazin Gilbert, Vincent Goffin, Taniya Mishra
Method and apparatus for responding to an inquiry

Patent number: 8719010

Abstract: Disclosed is a method and apparatus for responding to an inquiry from a client via a network. The method and apparatus receive the inquiry from a client via a network. Based on the inquiry, question-answer pairs retrieved from the network are analyzed to determine a response to the inquiry. The QA pairs are not predefined. As a result, the QA pairs have to be analyzed in order to determine whether they are responsive to a particular inquiry. Questions of the QA pairs may be repetitive and, without more, will not be useful in determining whether their corresponding answer responds to an inquiry.

Type: Grant

Filed: March 1, 2013

Date of Patent: May 6, 2014

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Junlan Feng, Mazin Gilbert, Dilek Hakkani-Tur, Gokhan Tur
On-Demand Language Translation for Television Programs

Publication number: 20140053171

Abstract: A method, a system and a machine-readable medium are provided for an on demand translation service. A translation module including at least one language pair module for translating a source language to a target language may be made available for use by a subscriber. The subscriber may be charged a fee for use of the requested on demand translation service or may be provided use of the on demand translation service for free in exchange for displaying commercial messages to the subscriber. A video signal may be received including information in the source language, which may be obtained as text from the video signal and may be translated from the source language to the target language by use of the translation module. Translated information, based on the translated text, may be added into the received video signal.

Type: Application

Filed: October 24, 2013

Publication date: February 20, 2014

Applicant: AT&T Intellectual Property II, L.P.

Inventors: Srinivas Bangalore, David Crawford Gibbon, Mazin Gilbert, Patrick Guy Haffner, Zhu Liu, Behzad Shahraray
Method and System for Providing an Automated Web Transcription Service

Publication number: 20130346086

Abstract: A system, method and computer readable medium that provides an automated web transcription service is disclosed. The method may include receiving input speech from a user using a communications network, recognizing the received input speech, understanding the recognized speech, transcribing the understood speech to text, storing the transcribed text in a database, receiving a request via a web page to display the transcribed text, retrieving transcribed text from the database, and displaying the transcribed text to the requester using the web page.

Type: Application

Filed: August 26, 2013

Publication date: December 26, 2013

Applicant: AT&T Intellectual Property II, L.P.

Inventors: Mazin GILBERT, Stephan KANTHAK
System and method for optimizing response handling time and customer satisfaction scores

Patent number: 8612532

Abstract: A system and method disclosed for using and updating a database of template responses for a live agent in response to user communications. The method includes computing an average string distance between each response from a live agent and a template, use to generate the response, modifying the computed average string distance based on a customer satisfaction score associated with each response and selecting a response that minimizes the computed average string distance and maximizes customer satisfaction. Upon receiving a further communication on a certain issue, the system presents a prototype response that has been added to the template database to the live agent for use in generating a response to the further communication that reduces handling time and increases customer satisfaction.

Type: Grant

Filed: November 30, 2012

Date of Patent: December 17, 2013

Assignee: AT&T Intellectual Property I, L.P.

Inventors: Srinivas Bangalore, Mazin Gilbert
System and method for improving robustness of speech recognition using vocal tract length normalization codebooks

Patent number: 8600744

Abstract: Disclosed are systems, methods, and computer readable media for performing speech recognition. The method embodiment comprises selecting a codebook from a plurality of codebooks with a minimal acoustic distance to a received speech sample, the plurality of codebooks generated by a process of (a) computing a vocal tract length for a each of a plurality of speakers, (b) for each of the plurality of speakers, clustering speech vectors, and (c) creating a codebook for each speaker, the codebook containing entries for the respective speaker's vocal tract length, speech vectors, and an optional vector weight for each speech vector, (2) applying the respective vocal tract length associated with the selected codebook to normalize the received speech sample for use in speech recognition, and (3) recognizing the received speech sample based on the respective vocal tract length associated with the selected codebook.

Type: Grant

Filed: April 13, 2012

Date of Patent: December 3, 2013

Assignee: AT&T Intellectual Property II, L.P.

Inventor: Mazin Gilbert
Methods and Systems for Natural Language Understanding Using Human Knowledge and Collected Data

Publication number: 20130311170

Abstract: Disclosed herein are systems and methods to incorporate human knowledge when developing and using statistical models for natural language understanding. The disclosed systems and methods embrace a data-driven approach to natural language understanding which progresses seamlessly along the continuum of availability of annotated collected data, from when there is no available annotated collected data to when there is any amount of annotated collected data.

Type: Application

Filed: April 30, 2013

Publication date: November 21, 2013

Applicant: AT&T Intellectual Property II, L.P.

Inventors: Srinivas Bangalore, Mazin Gilbert, Narendra K. Gupta

prev … 2 3 4 5 6 7 8 9 10 … next