Patents by Inventor Mazin Gilbert

Mazin Gilbert has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Method and apparatus for responding to an inquiry

Patent number: 10115056

Abstract: Disclosed is a method and apparatus for responding to an inquiry from a client via a network. The method and apparatus receive the inquiry from a client via a network. Based on the inquiry, question-answer pairs retrieved from the network are analyzed to determine a response to the inquiry. The QA pairs are not predefined. As a result, the QA pairs have to be analyzed in order to determine whether they are responsive to a particular inquiry. Questions of the QA pairs may be repetitive and similar to one another even for very different subjects, and without additional contextual and meta-level information, are not useful in determining whether their corresponding answer responds to an inquiry.

Type: Grant

Filed: October 6, 2016

Date of Patent: October 30, 2018

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Junlan Feng, Mazin Gilbert, Dilek Hakkani-Tur, Gokhan Tur
System and Method for Optimizing Speech Recognition and Natural Language Parameters with User Feedback

Publication number: 20180277102

Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for assigning saliency weights to words of an ASR model. The saliency values assigned to words within an ASR model are based on human perception judgments of previous transcripts. These saliency values are applied as weights to modify an ASR model such that the results of the weighted ASR model in converting a spoken document to a transcript provide a more accurate and useful transcription to the user.

Type: Application

Filed: May 25, 2018

Publication date: September 27, 2018

Inventors: Andrej LJOLJE, Diamantino Antonio CASEIRO, Mazin GILBERT, Vincent GOFFIN, Taniya MISHRA
Method and apparatus for identifying acoustic background environments based on time and speed to enhance automatic speech recognition

Patent number: 10083687

Abstract: Disclosed are systems, methods, and computer readable media for identifying an acoustic environment of a caller. The method embodiment comprises analyzing acoustic features of a received audio signal from a caller, receiving meta-data information based on a previously recorded time and speed of the caller, classifying a background environment of the caller based on the analyzed acoustic features and the meta-data, selecting an acoustic model matched to the classified background environment from a plurality of acoustic models, and performing speech recognition as the received audio signal using the selected acoustic model.

Type: Grant

Filed: October 16, 2017

Date of Patent: September 25, 2018

Assignee: NUANCE COMMUNICATIONS, INC.

Inventor: Mazin Gilbert
SYSTEM AND METHOD FOR RAPID CUSTOMIZATION OF SPEECH RECOGNITION MODELS

Publication number: 20180268810

Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for generating domain-specific speech recognition models for a domain of interest by combining and tuning existing speech recognition models when a speech recognizer does not have access to a speech recognition model for that domain of interest and when available domain-specific data is below a minimum desired threshold to create a new domain-specific speech recognition model. A system configured to practice the method identifies a speech recognition domain and combines a set of speech recognition models, each speech recognition model of the set of speech recognition models being from a respective speech recognition domain. The system receives an amount of data specific to the speech recognition domain, wherein the amount of data is less than a minimum threshold to create a new domain-specific model, and tunes the combined speech recognition model for the speech recognition domain based on the data.

Type: Application

Filed: May 21, 2018

Publication date: September 20, 2018

Inventors: Srinivas BANGALORE, Robert BELL, Diamantino Antonio CASEIRO, Mazin GILBERT, Patrick HAFFNER
SYSTEM AND METHOD FOR DESIGNING AND EXECUTING CONTROL LOOPS IN A CLOUD ENVIRONMENT

Publication number: 20180260745

Abstract: A system for designing and executing control loops in a cloud environment includes a control platform implemented in the cloud environment having a data collection, analytics and events module, a policy module and an application controller module. The system includes a business process management application coupled to the control platform having a control loop designer module for designing a control loop template and a workflow engine for distributing the control loop template. The business process management application is coupled to the data collection analytics and events module of the control platform as well as the policy module in the application controller module to control platform. The control loop is activated by the control platform.

Type: Application

Filed: March 9, 2017

Publication date: September 13, 2018

Inventors: Rittwik Jana, Mazin Gilbert, Eric Noel, Vijay Gopalakrishnan
System and Method for Creating a Presentation Using Natural Language

Publication number: 20180246864

Abstract: The invention provides for a system, method, and computer readable medium storing instructions related to controlling a presentation in a multimodal system. The method embodiment of the invention is a method for the retrieval of information on the basis of its content for incorporation into an electronic presentation. The method comprises receiving from a user a content-based request for at least one segment from a first plurality of segments within a media presentation preprocessed to enable natural language content searchability; in response to the request, presenting a subset of the first plurality of segments to the user; receiving a selection indication from the user associated with at least one segment of the subset of the first plurality of segments and adding the selected at least one segment to a deck for use in a presentation.

Type: Application

Filed: April 27, 2018

Publication date: August 30, 2018

Inventors: Patrick Ehlen, David Crawford GIBBON, Mazin Gilbert, Michael Johnston, Zhu Liu, Behzad Shahraray
System and method for optimizing speech recognition and natural language parameters with user feedback

Patent number: 9984679

Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for assigning saliency weights to words of an ASR model. The saliency values assigned to words within an ASR model are based on human perception judgments of previous transcripts. These saliency values are applied as weights to modify an ASR model such that the results of the weighted ASR model in converting a spoken document to a transcript provide a more accurate and useful transcription to the user.

Type: Grant

Filed: July 18, 2016

Date of Patent: May 29, 2018

Assignee: NUANCE COMMUNICATIONS, INC.

Inventors: Andrej Ljolje, Diamantino Antonio Caseiro, Mazin Gilbert, Vincent Goffin, Taniya Mishra
System and method for rapid customization of speech recognition models

Patent number: 9978363

Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for generating domain-specific speech recognition models for a domain of interest by combining and tuning existing speech recognition models when a speech recognizer does not have access to a speech recognition model for that domain of interest and when available domain-specific data is below a minimum desired threshold to create a new domain-specific speech recognition model. A system configured to practice the method identifies a speech recognition domain and combines a set of speech recognition models, each speech recognition model of the set of speech recognition models being from a respective speech recognition domain. The system receives an amount of data specific to the speech recognition domain, wherein the amount of data is less than a minimum threshold to create a new domain-specific model, and tunes the combined speech recognition model for the speech recognition domain based on the data.

Type: Grant

Filed: June 12, 2017

Date of Patent: May 22, 2018

Assignee: NUANCE COMMUNICATIONS, INC.

Inventors: Srinivas Bangalore, Robert Bell, Diamantino Antonio Caseiro, Mazin Gilbert, Patrick Haffner
System and method for creating a presentation using natural language

Patent number: 9959260

Abstract: The invention provides for a system, method, and computer readable medium storing instructions related to controlling a presentation in a multimodal system. The method embodiment of the invention is a method for the retrieval of information on the basis of its content for incorporation into an electronic presentation. The method comprises receiving from a user a content-based request for at least one segment from a first plurality of segments within a media presentation preprocessed to enable natural language content searchability; in response to the request, presenting a subset of the first plurality of segments to the user; receiving a selection indication from the user associated with at least one segment of the subset of the first plurality of segments and adding the selected at least one segment to a deck for use in a presentation.

Type: Grant

Filed: May 4, 2015

Date of Patent: May 1, 2018

Assignee: Nuance Communications, Inc.

Inventors: Patrick Ehlen, David Crawford Gibbon, Mazin Gilbert, Michael Johnston, Zhu Liu, Behzad Shahraray
Multimodal portable communication interface for accessing video content

Patent number: 9942616

Abstract: A portable communication device has a touch screen display that receives tactile input and a microphone that receives audio input. The portable communication device initiates a query for media based at least in part on tactile input and audio input. The touch screen display is a multi-touch screen. The portable communication device sends an initiated query and receives a text response indicative of a speech to text conversion of the query. The portable communication device then displays video in response to tactile input and audio input.

Type: Grant

Filed: May 2, 2016

Date of Patent: April 10, 2018

Assignee: AT&T INTELLECTUAL PROPERTY I, L.P.

Inventors: Behzad Shahraray, David Crawford Gibbon, Bernard S. Renger, Zhu Liu, Andrea Basso, Mazin Gilbert, Michael J. Johnston
On-Demand Language Translation for Television Programs

Publication number: 20180046617

Abstract: In an embodiment, a method of providing an on demand translation service is provided. A subscriber may be charged a reduced fee or no fee for use of the on demand translation service in exchange for displaying commercial messages to the subscriber, the commercial messages being selected based on subscriber information. A multimedia signal including information in a source language may be received. The information may be obtained as text in the source language from the multimedia signal. The text may be translated from the source language to a target language. Translated information, based on the translated text, may be transmitted to a processing device for presentation to the subscriber. The received multimedia signal may be sent to a multimedia device for viewing.

Type: Application

Filed: October 30, 2017

Publication date: February 15, 2018

Inventors: Srinivas BANGALORE, David Crawford GIBBON, Mazin GILBERT, Patrick Guy HAFFNER, Zhu LIU, Behzad SHAHRARAY
METHOD AND APPARATUS FOR IDENTIFYING ACOUSTIC BACKGROUND ENVIRONMENTS BASED ON TIME AND SPEED TO ENHANCE AUTOMATIC SPEECH RECOGNITION

Publication number: 20180040318

Abstract: Disclosed are systems, methods, and computer readable media for identifying an acoustic environment of a caller. The method embodiment comprises analyzing acoustic features of a received audio signal from a caller, receiving meta-data information based on a previously recorded time and speed of the caller, classifying a background environment of the caller based on the analyzed acoustic features and the meta-data, selecting an acoustic model matched to the classified background environment from a plurality of acoustic models, and performing speech recognition as the received audio signal using the selected acoustic model.

Type: Application

Filed: October 16, 2017

Publication date: February 8, 2018

Inventor: Mazin GILBERT
System and method for increasing recognition rates of in-vocabulary words by improving pronunciation modeling

Patent number: 9880996

Abstract: The present disclosure relates to systems, methods, and computer-readable media for generating a lexicon for use with speech recognition. The method includes overgenerating potential pronunciations by converting portions of symbolic input into a number of possible lexical pronunciation variants based on an established set of conversion rules, wherein the symbolic input comprises labeled speech data and selecting pronunciations in a speech recognition context from the potential pronunciations, to yield selected potential pronunciations. The method further includes retraining the established set of conversion rules based on the selected potential pronunciations.

Type: Grant

Filed: November 12, 2014

Date of Patent: January 30, 2018

Assignee: Nuance Communications, Inc.

Inventors: Alistair D. Conkie, Mazin Gilbert, Andrej Ljolje
SYSTEM AND METHOD FOR RAPID CUSTOMIZATION OF SPEECH RECOGNITION MODELS

Publication number: 20170345418

Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for generating domain-specific speech recognition models for a domain of interest by combining and tuning existing speech recognition models when a speech recognizer does not have access to a speech recognition model for that domain of interest and when available domain-specific data is below a minimum desired threshold to create a new domain-specific speech recognition model. A system configured to practice the method identifies a speech recognition domain and combines a set of speech recognition models, each speech recognition model of the set of speech recognition models being from a respective speech recognition domain. The system receives an amount of data specific to the speech recognition domain, wherein the amount of data is less than a minimum threshold to create a new domain-specific model, and tunes the combined speech recognition model for the speech recognition domain based on the data.

Type: Application

Filed: June 12, 2017

Publication date: November 30, 2017

Inventors: Srinivas BANGALORE, Robert BELL, Diamantino Antonio CASEIRO, Mazin GILBERT, Patrick HAFFNER
SYSTEM AND METHOD FOR GENERATING CUSTOMIZED TEXT-TO-SPEECH VOICES

Publication number: 20170330554

Abstract: A system and method are disclosed for generating customized text-to-speech voices for a particular application. The method comprises generating a custom text-to-speech voice by selecting a voice for generating a custom text-to-speech voice associated with a domain, collecting text data associated with the domain from a pre-existing text data source and using the collected text data, generating an in-domain inventory of synthesis speech units by selecting speech units appropriate to the domain via a search of a pre-existing inventory of synthesis speech units, or by recording the minimal inventory for a selected level of synthesis quality. The text-to-speech custom voice for the domain is generated utilizing the in-domain inventory of synthesis speech units. Active learning techniques may also be employed to identify problem phrases wherein only a few minutes of recorded data is necessary to deliver a high quality TTS custom voice.

Type: Application

Filed: July 31, 2017

Publication date: November 16, 2017

Inventors: Srinivas BANGALORE, Junlan FENG, Mazin GILBERT, Juergen SCHROETER, Ann K. SYRDAL, David SCHULZ
On-demand language translation for television programs

Patent number: 9805026

Abstract: In an embodiment, a method of providing an on demand translation service is provided. A subscriber may be charged a reduced fee or no fee for use of the on demand translation service in exchange for displaying commercial messages to the subscriber, the commercial messages being selected based on subscriber information. A multimedia signal including information in a source language may be received. The information may be obtained as text in the source language from the multimedia signal. The text may be translated from the source language to a target language. Translated information, based on the translated text, may be transmitted to a processing device for presentation to the subscriber. The received multimedia signal may be sent to a multimedia device for viewing.

Type: Grant

Filed: August 12, 2014

Date of Patent: October 31, 2017

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Srinivas Bangalore, David Crawford Gibbon, Mazin Gilbert, Patrick Guy Haffner, Zhu Liu, Behzad Shahraray
Method and apparatus for identifying acoustic background environments based on time and speed to enhance automatic speech recognition

Patent number: 9792906

Abstract: Disclosed are systems, methods, and computer readable media for identifying an acoustic environment of a caller. The method embodiment comprises analyzing acoustic features of a received audio signal from a caller, receiving meta-data information based on a previously recorded time and speed of the caller, classifying a background environment of the caller based on the analyzed acoustic features and the meta-data, selecting an acoustic model matched to the classified background environment from a plurality of acoustic models, and performing speech recognition as the received audio signal using the selected acoustic model.

Type: Grant

Filed: June 2, 2016

Date of Patent: October 17, 2017

Assignee: Nuance Communications, Inc.

Inventor: Mazin Gilbert
Methods and systems for natural language understanding using human knowledge and collected data

Patent number: 9792904

Abstract: Disclosed herein are systems and methods to incorporate human knowledge when developing and using statistical models for natural language understanding. The disclosed systems and methods embrace a data-driven approach to natural language understanding which progresses seamlessly along the continuum of availability of annotated collected data, from when there is no available annotated collected data to when there is any amount of annotated collected data.

Type: Grant

Filed: July 23, 2014

Date of Patent: October 17, 2017

Assignee: Nuance Communications, Inc.

Inventors: Srinivas Bangalore, Mazin Gilbert, Narendra K. Gupta
System and method for generating customized text-to-speech voices

Patent number: 9721558

Abstract: A system and method are disclosed for generating customized text-to-speech voices for a particular application. The method comprises generating a custom text-to-speech voice by selecting a voice for generating a custom text-to-speech voice associated with a domain, collecting text data associated with the domain from a pre-existing text data source and using the collected text data, generating an in-domain inventory of synthesis speech units by selecting speech units appropriate to the domain via a search of a pre-existing inventory of synthesis speech units, or by recording the minimal inventory for a selected level of synthesis quality. The text-to-speech custom voice for the domain is generated utilizing the in-domain inventory of synthesis speech units. Active learning techniques may also be employed to identify problem phrases wherein only a few minutes of recorded data is necessary to deliver a high quality TTS custom voice.

Type: Grant

Filed: December 10, 2015

Date of Patent: August 1, 2017

Assignee: Nuance Communications, Inc.

Inventors: Srinivas Bangalore, Junlan Feng, Mazin Gilbert, Juergen Schroeter, Ann K. Syrdal, David Schulz
LEARNING FROM INTERACTIONS FOR A SPOKEN DIALOG SYSTEM

Publication number: 20170213546

Abstract: In one embodiment, a semantic classifier input and a corresponding label attributed to the semantic classifier input may be obtained. A determination may be made whether the corresponding label is correct based on logged interaction data. An entry of an adaptation corpus may be generated based on a result of the determination. Operation of the semantic classifier may be adapted based on the adaptation corpus.

Type: Application

Filed: April 10, 2017

Publication date: July 27, 2017

Inventors: Mazin GILBERT, Esther LEVIN, Michael Lederman LITTMAN, Robert E. Schapire

prev 1 2 3 4 5 6 7 … next