Patents Assigned to Nuance Communications
  • Patent number: 10049669
    Abstract: Techniques for combining the results of multiple recognizers in a distributed speech recognition architecture. Speech data input to a client device is encoded and processed both locally and remotely by different recognizers configured to be proficient at different speech recognition tasks. The client/server architecture is configurable to enable network providers to specify a policy directed to a trade-off between reducing recognition latency perceived by a user and usage of network resources. The results of the local and remote speech recognition engines are combined based, at least in part, on logic stored by one or more components of the client/server architecture.
    Type: Grant
    Filed: January 6, 2012
    Date of Patent: August 14, 2018
    Assignee: Nuance Communications, Inc.
    Inventors: Michael Newman, Anthony Gillet, David Mark Krowitz, Michael D. Edgington
  • Publication number: 20180211668
    Abstract: Method and apparatus for providing visual feedback on an electronic device in a client/server speech recognition system comprising the electronic device and a network device remotely located from the electronic device. The method comprises processing, by an embedded speech recognizer of the electronic device, at least a portion of input audio comprising speech to produce local recognized speech, sending at least a portion of the input audio to the network device for remote speech recognition, and displaying, on a user interface of the electronic device, visual feedback based on at least a portion of the local recognized speech prior to receiving streaming recognition results from the network device.
    Type: Application
    Filed: July 17, 2015
    Publication date: July 26, 2018
    Applicant: Nuance Communications, Inc.
    Inventors: Daniel WILLETT, Christian GOLLAN, Carl Benjamin QUILLEN, Stefan HAHN, Fabian STEMMER
  • Publication number: 20180211234
    Abstract: According to at least one aspect, a system for remotely controlling an application installed on a device is provided. The system includes at least one processor and at least one computer-readable storage medium storing instructions which program the at least one processor to identify a task for the application installed on the device to perform, transmit a binary short message service (SMS) message to the device including a task code associated with the identified task, receive an information request from the device responsive to the binary SMS message, and transmit task information to the device responsive to receiving the information request.
    Type: Application
    Filed: January 26, 2017
    Publication date: July 26, 2018
    Applicant: Nuance Communications, Inc.
    Inventors: Abhishek Rohatgi, John Dolan Heater, Flaviu Negrean, Mark P. Hanson
  • Patent number: 10032455
    Abstract: Techniques for combining the results of multiple recognizers in a distributed speech recognition architecture. Speech data input to a client device is encoded and processed both locally and remotely by different recognizers configured to be proficient at different speech recognition tasks. The client/server architecture is configurable to enable network providers to specify a policy directed to a trade-off between reducing recognition latency perceived by a user and usage of network resources. The results of the local and remote speech recognition engines are combined based, at least in part, on logic stored by one or more components of the client/server architecture.
    Type: Grant
    Filed: January 6, 2012
    Date of Patent: July 24, 2018
    Assignee: Nuance Communications, Inc.
    Inventors: Michael Newman, Anthony Gillet, David Mark Krowitz, Michael D. Edgington
  • Patent number: 10032127
    Abstract: Techniques for determining a clinician's intent to order an item may include processing a free-form narration, of an encounter with a patient, narrated by a clinician, using a natural language understanding engine implemented by one or more processors, to extract at least one clinical fact corresponding to a mention of an orderable item from the free-form narration. The processing may comprise distinguishing between whether the at least one clinical fact indicates an intent to order the orderable item or does not indicate an intent to order the orderable item. In response to determining that the at least one clinical fact indicates an intent to order the orderable item, an order may be generated for the orderable item.
    Type: Grant
    Filed: March 1, 2013
    Date of Patent: July 24, 2018
    Assignee: Nuance Communications, Inc.
    Inventors: Isam H. Habboush, Davide Zaccagnini
  • Patent number: 10032454
    Abstract: Techniques disclosed herein include systems and methods for open-domain voice-enabled searching that is speaker sensitive. Techniques include using speech information, speaker information, and information associated with a spoken query to enhance open voice search results. This includes integrating a textual index with a voice index to support the entire search cycle. Given a voice query, the system can execute two matching processes simultaneously. This can include a text matching process based on the output of speech recognition, as well as a voice matching process based on characteristics of a caller or user voicing a query. Characteristics of the caller can include output of voice feature extraction and metadata about the call. The system clusters callers according to these characteristics. The system can use specific voice and text clusters to modify speech recognition results, as well as modifying search results.
    Type: Grant
    Filed: June 25, 2015
    Date of Patent: July 24, 2018
    Assignee: Nuance Communications, Inc.
    Inventors: Shilei Zhang, Shenghua Bao, Wen Liu, Yong Qin, Zhiwei Shuang, Jian Chen, Zhong Su, Qin Shi, William F. Ganong, III
  • Patent number: 10025848
    Abstract: A system and method for speech file processing which provides users with differentially selectable speech file transcripts which can be sent to one or more other users. The speech files may be voicemail messages from which respective voicemail transcripts are created. The voicemail transcripts are provided in a user selectable format from which users may select non-contiguous portions of the transcript.
    Type: Grant
    Filed: April 25, 2016
    Date of Patent: July 17, 2018
    Assignee: NUANCE COMMUNICATIONS, INC.
    Inventors: Julia Hirschberg, Stephen Whittaker
  • Patent number: 10027799
    Abstract: Embodiments are provided for the automatic real-time recording and processing of media in a communications network based on the context of the media. In one embodiment, a media stream is received in an analysis module in a service platform in the communications network. The media stream may represent a communication session between a calling party and a call center in the network. The incoming media steam is analyzed to identify words comprising a context of the communication session. A determination is then made as to whether the context of the communication session is related to a set of business rules associated with the service platform which may automatically trigger the retention of a recording of the communication session. If the context of the communication session is related to the set of business rules, the retention of the communication session is automatically triggered in real-time at a recording module.
    Type: Grant
    Filed: February 2, 2017
    Date of Patent: July 17, 2018
    Assignee: NUANCE COMMUNICATIONS, INC.
    Inventor: David Anderson
  • Patent number: 10026092
    Abstract: An automated system processes customer requests and interactions with a customer service representative. During such interactions, the representative's actions at a computer interface are monitored, and the dialog between the customer and representative are recorded. Based on the dialog and the actions, a script is created or updated for a given request, where the script encompasses a dialog tree and actions relating to the customer's account. When a subsequent customer submits the same or a comparable request, an automated agent utilizes the script to handle the request. Using the script, the automated agent performs a dialog with the customer, accesses the customer account, and updates the account in accordance with the request.
    Type: Grant
    Filed: December 9, 2016
    Date of Patent: July 17, 2018
    Assignee: Nuance Communications, Inc.
    Inventors: John Dolan Heater, Abhishek Rohatgi, Flaviu Gelu Negrean
  • Publication number: 20180197545
    Abstract: Methods and apparatus for selectively performing speech processing in a hybrid speech processing system. The hybrid speech processing system includes at least one mobile electronic device and a network-connected server remotely located from the at least one mobile electronic device. The mobile electronic device is configured to use an embedded speech recognizer to process at least a portion of input audio to produce recognized text. A controller on the mobile electronic device determines whether to send information from the mobile electronic device to the server for speech processing. The determination of whether to send the information is based, at least in part, on an analysis of the input audio, the recognized text, or a semantic category associated with the recognized text.
    Type: Application
    Filed: January 11, 2017
    Publication date: July 12, 2018
    Applicant: Nuance Communications, Inc.
    Inventors: Daniel Willett, Joel Pinto, William F. Ganong, III
  • Patent number: 10021169
    Abstract: A system logs application usage data on a mobile device, processes the data on an analysis system and outputs a current and predicted score to, e.g. third parties. The system logs application-related usage data, which is collected via, e.g. a keyboard application running in the background on the mobile device. The system then evaluates the logged usage data and the events corresponding to a particular application. The events can be analyzed to score the user engagement level with the application, e.g., more events recorded for a given application per day, the more engaged a user is with that application. The engagement level can further be predicated based on historical usage log data from which a score decay model can be generated.
    Type: Grant
    Filed: September 20, 2013
    Date of Patent: July 10, 2018
    Assignee: NUANCE COMMUNICATIONS, INC.
    Inventors: Dan Hendrick, Eric Jun Fu
  • Publication number: 20180190272
    Abstract: According to some aspects, a method of processing user input received from a user is provided. The method comprises generating a plurality of segmentation hypotheses from content of the user input based, at least in part, on a set of parameters, querying a domain-specific database using each of the plurality of segmentation hypotheses to obtain at least one result, and modifying at least one of the set of parameters based, at least in part, on the at least one result.
    Type: Application
    Filed: June 30, 2015
    Publication date: July 5, 2018
    Applicant: Nuance Communications, Inc.
    Inventors: Munir Nikolai Alexander Georges, Eduardo Vellasques, Friederike Eva Anabel Niedtner, Diana Deokhwa Jung, Oliver Bender, Josef Damianus Anstasiadis
  • Patent number: 10013652
    Abstract: Deep Neural Networks (DNNs) with many hidden layers and many units per layer are very flexible models with a very large number of parameters. As such, DNNs are challenging to optimize. To achieve real-time computation, embodiments disclosed herein enable fast DNN feature transformation via optimized memory bandwidth utilization. To optimize memory bandwidth utilization, a rate of accessing memory may be reduced based on a batch setting. A memory, corresponding to a selected given output neuron of a current layer of the DNN, may be updated with an incremental output value computed for the selected given output neuron as a function of input values of a selected few non-zero input neurons of a previous layer of the DNN in combination with weights between the selected few non-zero input neurons and the selected given output neuron, wherein a number of the selected few corresponds to the batch setting.
    Type: Grant
    Filed: April 29, 2015
    Date of Patent: July 3, 2018
    Assignee: Nuance Communications, Inc.
    Inventors: Jan Vlietinck, Stephan Kanthak, Rudi Vuerinckx, Christophe Ris
  • Patent number: 10008208
    Abstract: Embodiments of the present invention perform speaker identification and verification by first prompting a user to speak a phrase that includes a common phrase component and a personal identifier. Then, the embodiments decompose the spoken phrase to locate the personal identifier. Finally, the embodiments identify and verify the user based on the results of the decomposing.
    Type: Grant
    Filed: September 18, 2014
    Date of Patent: June 26, 2018
    Assignee: Nuance Communications, Inc.
    Inventors: Almog Aley-Raz, Kevin R. Farrell, Oshrit Yaron, Luca Scarpato
  • Publication number: 20180174582
    Abstract: The method comprises receive first audio comprising speech from a user of a computing device, detecting an end of speech in the first audio, generating an ASR result based, at least in part, on a portion of the first audio prior to the detected end of speech, determining whether a valid action can be performed by a speech-enabled application installed on the computing device using the ASR result, and processing second audio when it is determined that a valid action cannot be performed by the speech-enabled application using the ASR result.
    Type: Application
    Filed: May 23, 2016
    Publication date: June 21, 2018
    Applicant: Nuance Communications, Inc.
    Inventor: Mark Fanty
  • Patent number: 10002608
    Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for approximating relevant responses to a user query with voice-enabled search. A system practicing the method receives a word lattice generated by an automatic speech recognizer based on a user speech and a prosodic analysis of the user speech, generates a reweighted word lattice based on the word lattice and the prosodic analysis, approximates based on the reweighted word lattice one or more relevant responses to the query, and presents to a user the responses to the query. The prosodic analysis examines metalinguistic information of the user speech and can identify the most salient subject matter of the speech, assess how confident a speaker is in the content of his or her speech, and identify the attitude, mood, emotion, sentiment, etc. of the speaker. Other information not described in the content of the speech can also be used.
    Type: Grant
    Filed: September 17, 2010
    Date of Patent: June 19, 2018
    Assignee: NUANCE COMMUNICATIONS, INC.
    Inventors: Srinivas Bangalore, Junlan Feng, Michael Johnston, Taniya Mishra
  • Patent number: 9998597
    Abstract: An interactive communication system configured to conduct a call with a caller. The interactive communication system comprises at least one computer hardware processor configured to perform: obtaining a plurality of dialog chunks comprising information provided by the caller to the interactive communication system and information provided by the interactive communication system to the caller; generating, based on the plurality of dialog chunks, a respective plurality of feature sets, each of the plurality of feature sets comprising at least one feature generated using a respective dialog chunk of the plurality of dialog chunks; determining, based on the plurality of feature sets, a respective plurality of dialog chunk scores; determining, based at least in part on the plurality of dialog chunk scores, a likelihood that the caller is dissatisfied with the interactive communication system; and when the likelihood exceeds a threshold, performing a remedial action that alters how the call is handled.
    Type: Grant
    Filed: July 6, 2015
    Date of Patent: June 12, 2018
    Assignee: Nuance Communications, Inc.
    Inventors: Jiri Havelka, Raimo Bakis
  • Patent number: 9997172
    Abstract: A system, method and computer program product are described for voice activity detection (VAD) within a digitally encoded bitstream. A parameter extraction module is configured to extract parameters from a sequence of coded frames from a digitally encoded bitstream containing speech. A VAD classifier is configured to operate with input of the digitally encoded bitstream to evaluate each coded frame based on bitstream coding parameter classification features to output a VAD decision indicative of whether or not speech is present in one or more of the coded frames.
    Type: Grant
    Filed: December 2, 2013
    Date of Patent: June 12, 2018
    Assignee: Nuance Communications, Inc.
    Inventors: Daniel A. Barreda, Jose E. G. Lainez, Dushyant Sharma, Patrick Naylor
  • Patent number: 9996675
    Abstract: An assignment device (1) assigns word class information (WKI) to one or more words of text information (ETI). Based on word-class sequence information (WK-AI) formed from this assigned word class information (WKI), actions (A) are executed in order to notify the user of conflicts or to provide the user with background information (HI) relating to words in the text information (TT).
    Type: Grant
    Filed: January 14, 2015
    Date of Patent: June 12, 2018
    Assignee: Nuance Communications, Inc.
    Inventors: Matthias Helletzgruber, Kresimir Rajic
  • Patent number: RE46952
    Abstract: Voicemail systems can include a memory and a processor. The memory can store data relating to users. An incoming communication can be handled by the voicemail system, forwarded to another voicemail system, provided with functionality based upon a user's preferences, and the like. The voicemail systems can include functionality to allow a user to consolidate voicemail messages and/or calls at one or more designated destinations, for example, a voicemail system and/or a mobile device.
    Type: Grant
    Filed: November 6, 2014
    Date of Patent: July 10, 2018
    Assignee: NUANCE COMMUNICATIONS, INC.
    Inventors: William Joseph Sigmund, Michael Robert Zubas, Brian Keith Rainer