Patents by Inventor Andrej Ljolje
Andrej Ljolje has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11620988Abstract: Disclosed herein are systems, computer-implemented methods, and tangible computer-readable storage media for speaker recognition personalization. The method recognizes speech received from a speaker interacting with a speech interface using a set of allocated resources, the set of allocated resources including bandwidth, processor time, memory, and storage. The method records metrics associated with the recognized speech, and after recording the metrics, modifies at least one of the allocated resources in the set of allocated resources commensurate with the recorded metrics. The method recognizes additional speech from the speaker using the modified set of allocated resources. Metrics can include a speech recognition confidence score, processing speed, dialog behavior, requests for repeats, negative responses to confirmations, and task completions.Type: GrantFiled: December 9, 2019Date of Patent: April 4, 2023Assignee: Nuance Communications, Inc.Inventors: Andrej Ljolje, Alistair D. Conkie, Ann K. Syrdal
-
Patent number: 10699702Abstract: Disclosed herein are methods, systems, and computer-readable storage media for automatic speech recognition. The method includes selecting a speaker independent model, and selecting a quantity of speaker dependent models, the quantity of speaker dependent models being based on available computing resources, the selected models including the speaker independent model and the quantity of speaker dependent models. The method also includes recognizing an utterance using each of the selected models in parallel, and selecting a dominant speech model from the selected models based on recognition accuracy using the group of selected models. The system includes a processor and modules configured to control the processor to perform the method. The computer-readable storage medium includes instructions for causing a computing device to perform the steps of the method.Type: GrantFiled: December 4, 2017Date of Patent: June 30, 2020Assignee: NUANCE COMMUNICATIONS, INC.Inventors: Andrej Ljolje, Diamantino Antonio Caseiro, Alistair D. Conkie
-
Publication number: 20200111479Abstract: Disclosed herein are systems, computer-implemented methods, and tangible computer-readable storage media for speaker recognition personalization. The method recognizes speech received from a speaker interacting with a speech interface using a set of allocated resources, the set of allocated resources including bandwidth, processor time, memory, and storage. The method records metrics associated with the recognized speech, and after recording the metrics, modifies at least one of the allocated resources in the set of allocated resources commensurate with the recorded metrics. The method recognizes additional speech from the speaker using the modified set of allocated resources. Metrics can include a speech recognition confidence score, processing speed, dialog behavior, requests for repeats, negative responses to confirmations, and task completions.Type: ApplicationFiled: December 9, 2019Publication date: April 9, 2020Inventors: Andrej LJOLJE, Alistair D. CONKIE, Ann K. SYRDAL
-
Patent number: 10504505Abstract: Disclosed herein are systems, computer-implemented methods, and tangible computer-readable storage media for speaker recognition personalization. The method recognizes speech received from a speaker interacting with a speech interface using a set of allocated resources, the set of allocated resources including bandwidth, processor time, memory, and storage. The method records metrics associated with the recognized speech, and after recording the metrics, modifies at least one of the allocated resources in the set of allocated resources commensurate with the recorded metrics. The method recognizes additional speech from the speaker using the modified set of allocated resources. Metrics can include a speech recognition confidence score, processing speed, dialog behavior, requests for repeats, negative responses to confirmations, and task completions.Type: GrantFiled: December 4, 2017Date of Patent: December 10, 2019Assignee: NUANCE COMMUNICATIONS, INC.Inventors: Andrej Ljolje, Alistair D. Conkie, Ann K. Syrdal
-
Patent number: 10291966Abstract: A method includes receiving, at a content server from a media device, a request for media content at a first playback rate. The media content is available to the content server at a second playback rate that is different from the first playback rate. The method includes generating modified media content by modifying a first portion of the media content to have a second format corresponding to a third media playback rate. The first portion having a first media characteristic. The third playback rate is different than the first playback rate and is different than the second playback rate. The third playback rate is selected such that the modified media content has a third format corresponding to the first playback rate. The method further includes sending the modified media content from the content server to a media device.Type: GrantFiled: March 23, 2017Date of Patent: May 14, 2019Assignee: AT&T INTELLECTUAL PROPERTY I, L.P.Inventors: Andrej Ljolje, Ann Syrdal, Alistair Conkie
-
Patent number: 10199035Abstract: Systems, methods, and computer-readable storage devices for performing per-channel automatic speech recognition. An example system configured to practice the method combines a first audio signal of a first speaker in a communication session and a second audio signal from a second speaker in the communication session as a first audio channel and a second audio channel. The system can recognize speech in the first audio channel of the recording using a first model specific to the first speaker, and recognize speech in the second audio channel of the recording using a second model specific to the second speaker, wherein the first model is different from the second model. The system can generate recognized speech as an output from the communication session. The system can identify the models based on identifiers of the speakers, such as a telephone number, an IP address, a customer number, or account number.Type: GrantFiled: November 22, 2013Date of Patent: February 5, 2019Assignee: NUANCE COMMUNICATIONS, INC.Inventors: Ilya Dan Melamed, Andrej Ljolje
-
Publication number: 20180277102Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for assigning saliency weights to words of an ASR model. The saliency values assigned to words within an ASR model are based on human perception judgments of previous transcripts. These saliency values are applied as weights to modify an ASR model such that the results of the weighted ASR model in converting a spoken document to a transcript provide a more accurate and useful transcription to the user.Type: ApplicationFiled: May 25, 2018Publication date: September 27, 2018Inventors: Andrej LJOLJE, Diamantino Antonio CASEIRO, Mazin GILBERT, Vincent GOFFIN, Taniya MISHRA
-
Patent number: 9984679Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for assigning saliency weights to words of an ASR model. The saliency values assigned to words within an ASR model are based on human perception judgments of previous transcripts. These saliency values are applied as weights to modify an ASR model such that the results of the weighted ASR model in converting a spoken document to a transcript provide a more accurate and useful transcription to the user.Type: GrantFiled: July 18, 2016Date of Patent: May 29, 2018Assignee: NUANCE COMMUNICATIONS, INC.Inventors: Andrej Ljolje, Diamantino Antonio Caseiro, Mazin Gilbert, Vincent Goffin, Taniya Mishra
-
Patent number: 9934792Abstract: A method of detecting pre-determined phrases to determine compliance quality of an agent includes determining a presence of a predetermined input based on a comparison between stored pre-determined phrases and a received communication, and determining a compliance rating of the agent based on a presence of a pre-determined phrase associated with the predetermined input in the communication.Type: GrantFiled: January 31, 2017Date of Patent: April 3, 2018Assignee: AT&T INTELLECTUAL PROPERTY I, L.P.Inventors: I. Dan Melamed, Andrej Ljolje, Bernard S. Renger, David J. Smith, Yeon-Jun Kim
-
Publication number: 20180090129Abstract: Disclosed herein are systems, computer-implemented methods, and tangible computer-readable storage media for speaker recognition personalization. The method recognizes speech received from a speaker interacting with a speech interface using a set of allocated resources, the set of allocated resources including bandwidth, processor time, memory, and storage. The method records metrics associated with the recognized speech, and after recording the metrics, modifies at least one of the allocated resources in the set of allocated resources commensurate with the recorded metrics. The method recognizes additional speech from the speaker using the modified set of allocated resources. Metrics can include a speech recognition confidence score, processing speed, dialog behavior, requests for repeats, negative responses to confirmations, and task completions.Type: ApplicationFiled: December 4, 2017Publication date: March 29, 2018Inventors: Andrej LJOLJE, Alistair D. CONKIE, Ann K. SYRDAL
-
Publication number: 20180090130Abstract: Disclosed herein are methods, systems, and computer-readable storage media for automatic speech recognition. The method includes selecting a speaker independent model, and selecting a quantity of speaker dependent models, the quantity of speaker dependent models being based on available computing resources, the selected models including the speaker independent model and the quantity of speaker dependent models. The method also includes recognizing an utterance using each of the selected models in parallel, and selecting a dominant speech model from the selected models based on recognition accuracy using the group of selected models. The system includes a processor and modules configured to control the processor to perform the method. The computer-readable storage medium includes instructions for causing a computing device to perform the steps of the method.Type: ApplicationFiled: December 4, 2017Publication date: March 29, 2018Inventors: Andrej LJOLJE, Diamantino Antonio CASEIRO, Alistair D. CONKIE
-
Patent number: 9880996Abstract: The present disclosure relates to systems, methods, and computer-readable media for generating a lexicon for use with speech recognition. The method includes overgenerating potential pronunciations by converting portions of symbolic input into a number of possible lexical pronunciation variants based on an established set of conversion rules, wherein the symbolic input comprises labeled speech data and selecting pronunciations in a speech recognition context from the potential pronunciations, to yield selected potential pronunciations. The method further includes retraining the established set of conversion rules based on the selected potential pronunciations.Type: GrantFiled: November 12, 2014Date of Patent: January 30, 2018Assignee: Nuance Communications, Inc.Inventors: Alistair D. Conkie, Mazin Gilbert, Andrej Ljolje
-
Publication number: 20180018962Abstract: Disclosed herein are systems, computer-implemented methods, and tangible computer-readable media for handling missing speech data. The computer-implemented method includes receiving speech with a missing segment, generating a plurality of hypotheses for the missing segment, identifying a best hypothesis for the missing segment, and recognizing the received speech by inserting the identified best hypothesis for the missing segment. In another method embodiment, the final step is replaced with synthesizing the received speech by inserting the identified best hypothesis for the missing segment. In one aspect, the method further includes identifying a duration for the missing segment and generating the plurality of hypotheses of the identified duration for the missing segment. The step of identifying the best hypothesis for the missing segment can be based on speech context, a pronouncing lexicon, and/or a language model. Each hypothesis can have an identical acoustic score.Type: ApplicationFiled: September 25, 2017Publication date: January 18, 2018Inventors: Andrej LJOLJE, Alistair D. CONKIE
-
Patent number: 9837071Abstract: Disclosed herein are systems, computer-implemented methods, and tangible computer-readable storage media for speaker recognition personalization. The method recognizes speech received from a speaker interacting with a speech interface using a set of allocated resources, the set of allocated resources including bandwidth, processor time, memory, and storage. The method records metrics associated with the recognized speech, and after recording the metrics, modifies at least one of the allocated resources in the set of allocated resources commensurate with the recorded metrics. The method recognizes additional speech from the speaker using the modified set of allocated resources. Metrics can include a speech recognition confidence score, processing speed, dialog behavior, requests for repeats, negative responses to confirmations, and task completions.Type: GrantFiled: April 6, 2015Date of Patent: December 5, 2017Assignee: Nuance Communications, Inc.Inventors: Andrej Ljolje, Alistair D. Conkie, Ann K. Syrdal
-
Patent number: 9837072Abstract: Disclosed herein are methods, systems, and computer-readable storage media for automatic speech recognition. The method includes selecting a speaker independent model, and selecting a quantity of speaker dependent models, the quantity of speaker dependent models being based on available computing resources, the selected models including the speaker independent model and the quantity of speaker dependent models. The method also includes recognizing an utterance using each of the selected models in parallel, and selecting a dominant speech model from the selected models based on recognition accuracy using the group of selected models. The system includes a processor and modules configured to control the processor to perform the method. The computer-readable storage medium includes instructions for causing a computing device to perform the steps of the method.Type: GrantFiled: May 15, 2017Date of Patent: December 5, 2017Assignee: Nuance Communications, Inc.Inventors: Andrej Ljolje, Diamantino Antonio Caseiro, Alistair D. Conkie
-
Patent number: 9773497Abstract: Disclosed herein are systems, computer-implemented methods, and tangible computer-readable media for handling missing speech data. The computer-implemented method includes receiving speech with a missing segment, generating a plurality of hypotheses for the missing segment, identifying a best hypothesis for the missing segment, and recognizing the received speech by inserting the identified best hypothesis for the missing segment. In another method embodiment, the final step is replaced with synthesizing the received speech by inserting the identified best hypothesis for the missing segment. In one aspect, the method further includes identifying a duration for the missing segment and generating the plurality of hypotheses of the identified duration for the missing segment. The step of identifying the best hypothesis for the missing segment can be based on speech context, a pronouncing lexicon, and/or a language model. Each hypothesis can have an identical acoustic score.Type: GrantFiled: March 2, 2016Date of Patent: September 26, 2017Assignee: Nuance Communications, Inc.Inventors: Andrej Ljolje, Alistair D. Conkie
-
Publication number: 20170249937Abstract: Disclosed herein are methods, systems, and computer-readable storage media for automatic speech recognition. The method includes selecting a speaker independent model, and selecting a quantity of speaker dependent models, the quantity of speaker dependent models being based on available computing resources, the selected models including the speaker independent model and the quantity of speaker dependent models. The method also includes recognizing an utterance using each of the selected models in parallel, and selecting a dominant speech model from the selected models based on recognition accuracy using the group of selected models. The system includes a processor and modules configured to control the processor to perform the method. The computer-readable storage medium includes instructions for causing a computing device to perform the steps of the method.Type: ApplicationFiled: May 15, 2017Publication date: August 31, 2017Inventors: Andrej LJOLJE, Diamantino Antonio CASEIRO, Alistair D. CONKIE
-
Publication number: 20170195741Abstract: A method includes receiving, at a content server from a media device, a request for media content at a first playback rate. The media content is available to the content server at a second playback rate that is different from the first playback rate. The method includes generating modified media content by modifying a first portion of the media content to have a second format corresponding to a third media playback rate. The first portion having a first media characteristic. The third playback rate is different than the first playback rate and is different than the second playback rate. The third playback rate is selected such that the modified media content has a third format corresponding to the first playback rate. The method further includes sending the modified media content from the content server to a media device.Type: ApplicationFiled: March 23, 2017Publication date: July 6, 2017Inventors: Andrej Ljolje, Ann Syrdal, Alistair Conkie
-
Publication number: 20170140775Abstract: A method of detecting pre-determined phrases to determine compliance quality of an agent includes determining a presence of a predetermined input based on a comparison between stored pre-determined phrases and a received communication, and determining a compliance rating of the agent based on a presence of a pre-determined phrase associated with the predetermined input in the communication.Type: ApplicationFiled: January 31, 2017Publication date: May 18, 2017Applicant: AT&T INTELLECTUAL PROPERTY I, L.P.Inventors: I. Dan MELAMED, Andrej LJOLJE, Bernard S. Renger, David J. Smith, Yeon-Jun Kim
-
Patent number: 9653069Abstract: Disclosed herein are methods, systems, and computer-readable storage media for automatic speech recognition. The method includes selecting a speaker independent model, and selecting a quantity of speaker dependent models, the quantity of speaker dependent models being based on available computing resources, the selected models including the speaker independent model and the quantity of speaker dependent models. The method also includes recognizing an utterance using each of the selected models in parallel, and selecting a dominant speech model from the selected models based on recognition accuracy using the group of selected models. The system includes a processor and modules configured to control the processor to perform the method. The computer-readable storage medium includes instructions for causing a computing device to perform the steps of the method.Type: GrantFiled: April 30, 2015Date of Patent: May 16, 2017Assignee: Nuance Communications, Inc.Inventors: Andrej Ljolje, Diamantino Antonio Caseiro, Alistair D. Conkie