Patents Assigned to Nuance Communications, Inc.
  • Patent number: 10607600
    Abstract: A system and method of updating automatic speech recognition parameters on a mobile device are disclosed. The method comprises storing user account-specific adaptation data associated with ASR on a computing device associated with a wireless network, generating new ASR adaptation parameters based on transmitted information from the mobile device when a communication channel between the computing device and the mobile device becomes available and transmitting the new ASR adaptation data to the mobile device when a communication channel between the computing device and the mobile device becomes available. The new ASR adaptation data on the mobile device more accurately recognizes user utterances.
    Type: Grant
    Filed: February 12, 2018
    Date of Patent: March 31, 2020
    Assignee: NUANCE COMMUNICATIONS, INC.
    Inventors: Sarangarajan Parthasarathy, Richard Cameron Rose
  • Patent number: 10609213
    Abstract: Embodiments are provided for the automatic real-time recording and processing of media in a communications network based on the context of the media. In one embodiment, a media stream is received in an analysis module in a service platform in the communications network. The media stream may represent a communication session between a calling party and a call center in the network. The incoming media steam is analyzed to identify words comprising a context of the communication session. A determination is then made as to whether the context of the communication session is related to a set of business rules associated with the service platform which may automatically trigger the retention of a recording of the communication session. If the context of the communication session is related to the set of business rules, the retention of the communication session is automatically triggered in real-time at a recording module.
    Type: Grant
    Filed: July 16, 2018
    Date of Patent: March 31, 2020
    Assignee: NUANCE COMMUNICATIONS, INC.
    Inventor: David Anderson
  • Patent number: 10586018
    Abstract: A method and a system for generating, with the assistance of a computer system (12), a medical report (18) suitable for automatic billing, where an electronic template (39) suited for a specific patient's condition is selected out of a plurality of given electronic templates stored in storage means (15); personal data of the specific patient's and previously stored in storage means (11) are automatically entered into the selected electronic template; and medical report text passages and instructions are entered into the selected template by dictating and using a speech recognition system (13); additionally, condition data are automatically entered on the basis of condition information as far as stored in storage means (7) into the selected template, and code data associated with these condition information are automatically embedded in the selected template; and when entering medical report text passages, at least one predetermined voice macro stored in the storage means (16) together with code data embedded
    Type: Grant
    Filed: July 10, 2014
    Date of Patent: March 10, 2020
    Assignee: Nuance Communications, Inc.
    Inventor: Mehmet M. Oez
  • Patent number: 10580429
    Abstract: A method, computer program product, and computing system for acoustic speech localization, comprising receiving, via a plurality of microphones, a plurality of audio signals. Modulation properties of the plurality of audio signals may be analyzed. Speech sounds may be localized from the plurality of audio signals based upon, at least in part, the modulation properties of the plurality of audio signals.
    Type: Grant
    Filed: August 22, 2018
    Date of Patent: March 3, 2020
    Assignee: Nuance Communications, Inc.
    Inventors: Sam Karimian-Azari, Dushyant Sharma, Amr Nour-Eldin, Patrick A. Naylor
  • Publication number: 20200058296
    Abstract: Techniques for learning front-end speech recognition parameters as part of training a neural network classifier include obtaining an input speech signal, and applying front-end speech recognition parameters to extract features from the input speech signal. The extracted features may be fed through a neural network to obtain an output classification for the input speech signal, and an error measure may be computed for the output classification through comparison of the output classification with a known target classification. Back propagation may be applied to adjust one or more of the front-end parameters as one or more layers of the neural network, based on the error measure.
    Type: Application
    Filed: July 23, 2019
    Publication date: February 20, 2020
    Applicant: Nuance Communications, Inc.
    Inventors: Tara N. Sainath, Brian E. D. Kingsbury, Abdel-rahman Mohamed, Bhuvana Ramabhadran
  • Patent number: 10559303
    Abstract: The method comprises receive first audio comprising speech from a user of a computing device, detecting an end of speech in the first audio, generating an ASR result based, at least in part, on a portion of the first audio prior to the detected end of speech, determining whether a valid action can be performed by a speech-enabled application installed on the computing device using the ASR result, and processing second audio when it is determined that a valid action cannot be performed by the speech-enabled application using the ASR result.
    Type: Grant
    Filed: May 23, 2016
    Date of Patent: February 11, 2020
    Assignee: Nuance Communications, Inc.
    Inventor: Mark Fanty
  • Patent number: 10546655
    Abstract: A method, computer program product, and computing system for tracking encounter participants is executed on a computing device and includes obtaining encounter information of a patient encounter, wherein the encounter information includes machine vision encounter information obtained via one or more machine vision systems. The machine vision encounter information is processed to identify one or more humanoid shapes.
    Type: Grant
    Filed: August 8, 2018
    Date of Patent: January 28, 2020
    Assignee: Nuance Communications, Inc.
    Inventors: Donald E. Owen, Daniel Paulino Almendro Barreda, Dushyant Sharma
  • Patent number: 10546595
    Abstract: Disclosed herein are systems, methods, and computer-readable storage media for improving speech recognition accuracy using textual context. The method includes retrieving a recorded utterance, capturing text from a device display associated with the spoken dialog and viewed by one party to the recorded utterance, and identifying words in the captured text that are relevant to the recorded utterance. The method further includes adding the identified words to a dynamic language model, and recognizing the recorded utterance using the dynamic language model. The recorded utterance can be a spoken dialog. A time stamp can be assigned to each identified word. The method can include adding identified words to and/or removing identified words from the dynamic language model based on their respective time stamps. A screen scraper can capture text from the device display associated with the recorded utterance. The device display can contain customer service data.
    Type: Grant
    Filed: March 5, 2018
    Date of Patent: January 28, 2020
    Assignee: NUANCE COMMUNICATIONS, INC.
    Inventors: Dan Melamed, Srinivas Bangalore, Michael Johnston
  • Publication number: 20200025904
    Abstract: A system and method for detecting multi-tone sirens despite environmental noises that may be present obtains a microphone input signal, applies, in real time, a time-frequency analysis to the microphone input signal to determine a time-frequency representation, provides at least one multi-tone model that has a plurality of tone duration patterns, performs multi-tone siren detection on the time-frequency representation, the detection based on the at least one multi-tone model and factoring of doppler shifts, and generates a detection result that can be used in systems for automated vehicles.
    Type: Application
    Filed: July 19, 2019
    Publication date: January 23, 2020
    Applicant: Nuance Communications, Inc.
    Inventors: Markus BUCK, Julien PREMONT, Friedrich FAUBEL
  • Patent number: 10540347
    Abstract: Methods, systems, computer-readable media, and apparatuses for providing search disambiguation using contextual information and domain ontologies are presented. In some embodiments, a computing device may receive a natural language input from a user. The computing device may identify a plurality of hypotheses for the natural language input. The computing device may map the plurality of hypotheses to one or more concepts of a plurality of concepts of an ontology by annotating the one or more concepts. The ontology may include the plurality of concepts respectively connected by a plurality of relations. The computing device may determine that there is an imperfect match between the annotated one or more concepts and annotations of answers. In response, the computing device may disambiguate the annotated one or more concepts using the ontology. The computing device may present output to the user based on the disambiguation.
    Type: Grant
    Filed: October 27, 2014
    Date of Patent: January 21, 2020
    Assignee: Nuance Communications, Inc.
    Inventors: Ladislav Kunc, Martin Labský, Tomá{hacek over (s)} Macek, Jan Vystr{hacek over (c)}il, Jan Kleindienst
  • Patent number: 10540965
    Abstract: Multiple natural language understanding (NLU) interpretation selection models may be generated. The NLU interpretation selection models may include a generic NLU interpretation selection model that is not specialized for a specific set of NLU interpretations type and one or more specialized NLU interpretation selection models, each of which may be specific to a particular set of NLU interpretations type. The specialized NLU interpretation selection model(s) may be utilized to process natural language input data comprising data corresponding to their respective sets of NLU interpretations type(s). The generic NLU interpretation selection model may be utilized to process natural language input data comprising data that does not correspond to the sets of NLU interpretations type(s) associated with the specialized NLU interpretation selection model(s).
    Type: Grant
    Filed: September 11, 2017
    Date of Patent: January 21, 2020
    Assignee: Nuance Communications, Inc.
    Inventors: Simona Gandrabur, Jean-Francois Lavallee, Real Tremblay
  • Patent number: 10540140
    Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for processing multimodal input. A system configured to practice the method continuously monitors an audio stream associated with a gesture input stream, and detects a speech event in the audio stream. Then the system identifies a temporal window associated with a time of the speech event, and analyzes data from the gesture input stream within the temporal window to identify a gesture event. The system processes the speech event and the gesture event to produce a multimodal command. The gesture in the gesture input stream can be directed to a display, but is remote from the display. The system can analyze the data from the gesture input stream by calculating an average of gesture coordinates within the temporal window.
    Type: Grant
    Filed: July 17, 2017
    Date of Patent: January 21, 2020
    Assignee: NUANCE COMMUNICATIONS, INC.
    Inventors: Michael Johnston, Derya Ozkan
  • Patent number: 10534623
    Abstract: A method, performed by at least one computer, the method comprising using the at least one computer to perform acts of accessing information specifying at least one user-specified condition specified by a user and at least one corresponding user-specified action, the user-specified action to be performed when the user-specified condition is met; determining whether the at least one user-specified condition is met; and when it is determined that the at least one user-specified condition is met, causing a virtual assistant executing on a mobile device different from the at least one computer to perform the at least one user-specified action.
    Type: Grant
    Filed: December 16, 2013
    Date of Patent: January 14, 2020
    Assignee: Nuance Communications, Inc.
    Inventors: Kenneth S. Harper, Fares Jaradeh, Holger Quast, Carey Radebaugh, Sean P. Brown
  • Patent number: 10529338
    Abstract: Embodiments of the present invention perform speaker identification and verification by first prompting a user to speak a phrase that includes a common phrase component and a personal identifier. Then, the embodiments decompose the spoken phrase to locate the personal identifier. Finally, the embodiments identify and verify the user based on the results of the decomposing.
    Type: Grant
    Filed: June 26, 2018
    Date of Patent: January 7, 2020
    Assignee: Nuance Communications, Inc.
    Inventors: Almog Aley-Raz, Kevin R. Farrell, Oshrit Yaron, Luca Scarpato
  • Patent number: 10530519
    Abstract: The present disclosure is directed towards a method for scheduling data packets in a multi-channel packet processing environment. The method may include receiving one or more data packets associated with an incoming signal and inserting the one or more data packets into a queue. The method may further include monitoring a time delay associated with each of the one or more data packets, wherein the time delay indicates a difference between packet arrival and packet departure times. The method may also include sorting the time delay results based upon an increasing order of time delay and determining a total number of data packets associated with each of a plurality of channels. The method may also include scheduling a data packet for processing based upon, at least in part, at least one of the sorted time delay results and the total number of data packets associated with each channel.
    Type: Grant
    Filed: July 10, 2017
    Date of Patent: January 7, 2020
    Assignee: Nuance Communications, Inc.
    Inventors: Mahesh Godavarti, Sridhar Pilli, Biswaranjan Panigrahi
  • Patent number: 10521186
    Abstract: A method for prompting user input for a multimodal interface including the steps of providing a multimodal interface to a user, where the interface includes a visual interface having a plurality of input regions, each having at least one input field; selecting an input region and processing a multi-token speech input provided by the user, where the processed speech input includes at least one value for at least one input field of the selected input region; and storing at least one value in at least one input field.
    Type: Grant
    Filed: March 20, 2013
    Date of Patent: December 31, 2019
    Assignee: Nuance Communications, Inc.
    Inventors: Ciprian Agapi, Soonthorn Ativanichayaphong, Leslie R. Wilson
  • Patent number: 10522133
    Abstract: Techniques for error correction using a history list comprising at least one misrecognition and correction information associated with each of the at least one misrecognitions indicating how a user corrected the associated misrecognition. The techniques include converting data input from a user to generate a text segment, determining whether at least a portion of the text segment appears in the history list as one of the at least one misrecognitions, if the at least a portion of the text segment appears in the history list as one of the at least one misrecognitions, obtaining the correction information associated with the at least one misrecognition, and correcting the at least a portion of the text segment based, at least in part, on the correction information.
    Type: Grant
    Filed: May 23, 2012
    Date of Patent: December 31, 2019
    Assignee: Nuance Communications, Inc.
    Inventors: Martin Labsky, Jan Kleindienst, Tomas Macek, David Nahamoo, Jan Curin, William F. Ganong, III
  • Patent number: 10515151
    Abstract: Disclosed methods and systems are directed to concept identification and capture. The methods and systems may include receiving, by a device, a first natural language input comprising one or more terms, and analyzing the first natural language input via a natural language processing engine to identify one or more named entities associated with the one or more terms, wherein each of the one or more named entities is associated with at least one category of a plurality of categories. The methods and systems may also include detecting a text field configured to receive text, the text field being associated with one of the plurality of categories, and inputting into the text field one of the one or more identified named entities based on the text field being associated with a same category as the one of the one or more named entities.
    Type: Grant
    Filed: August 18, 2014
    Date of Patent: December 24, 2019
    Assignee: Nuance Communications, Inc.
    Inventor: Matthieu Hebert
  • Patent number: 10515719
    Abstract: An assignment device (1) assigns word class information (WKI) to one or more words of text information (ETI). Based on word-class sequence information (WK-AI) formed from this assigned word class information (WKI), actions (A) are executed in order to notify the user of conflicts or to provide the user with background information (HI) relating to words in the text information (TT).
    Type: Grant
    Filed: May 29, 2018
    Date of Patent: December 24, 2019
    Assignee: Nuance Communications, Inc.
    Inventors: Matthias Helletzgruber, Kresimir Rajic
  • Publication number: 20190385202
    Abstract: Techniques for medical coding include applying a natural language understanding (NLU) engine to a free-form text documenting a clinical patient encounter, to derive a first set of one or more medical billing codes for the clinical patient encounter and a link between each code in the first set and a corresponding portion of the free-form text. The first set of codes may be compared to a second set of one or more medical billing codes approved by one or more human users for the patient encounter, to identify at least one code in the first set that overlaps with at least one code in the second set. The code in the second set approved by the one or more human users may be retained instead of the overlapping code in the first set derived by the NLU engine.
    Type: Application
    Filed: April 26, 2019
    Publication date: December 19, 2019
    Applicant: Nuance Communications, Inc.
    Inventors: Gregory Reiser, Howard Maurice D'Souza, Aparna Subramanian, Regina Marie Spitznagel