Patents Assigned to Nuance Communications, Inc.
  • Patent number: 11657828
    Abstract: Embodiments improve speech data quality through training a neural network for de-noising audio enhancement. One such embodiment creates simulated noisy speech data from high quality speech data. In turn, training, e.g., deep normalizing flow training, is performed on a neural network using the high quality speech data and the simulated noisy speech data to train the neural network to create de-noised speech data given noisy speech data. Performing the training includes minimizing errors in the neural network according to at least one of (i) a decoding error of an Automatic Speech Recognition (ASR) system processing current de-noised speech data results generated by the neural network during the training and (ii) spectral distance between the high quality speech data and the current de-noised speech data results generated by the neural network during the training.
    Type: Grant
    Filed: January 31, 2020
    Date of Patent: May 23, 2023
    Assignee: Nuance Communications, Inc.
    Inventor: Carl Benjamin Quillen
  • Publication number: 20230142081
    Abstract: A method of Completely Automated Public Turing test to tell Computers and Humans Apart (CAPTCHA) includes: recording, by a voice CAPTCHA module, a speech spoken by a user; determining, by a voice biometric service (VBS), whether a voiceprint matching the user's speech exists; and if a voiceprint matching the user's speech exists, verifying the user as a human user by the VBS. If a voiceprint matching the user's speech does not exist, the VBS i) generates a unique voiceprint for the user based on the user's speech, and/or ii) determines whether the user's speech is at least one of a synthetically generated speech and a previously recorded audio being played back. The user can perform a guest checkout without logging into the voice CAPTCHA module, in which case the VBS compares previously used voiceprints to the user's speech.
    Type: Application
    Filed: November 10, 2021
    Publication date: May 11, 2023
    Applicant: NUANCE COMMUNICATIONS, INC.
    Inventors: John Benjamin FISLER, Nikos POLIS, Christopher JENNISON, Andrew MATKIN, David ARDMAN, Nirvana TIKKU
  • Publication number: 20230137737
    Abstract: A method of enabling a virtual assistant (VA) serving a user to dynamically acquire contextual information regarding digital media environment accessed by a user includes: extracting, by an analysis engine, the contextual information dynamically from at least one of media content accessed by the user and webpage content accessed by the user; and injecting, by the analysis engine, the extracted contextual information into a VA memory to serve the user. The analysis engine is configured to analyze the extracted contextual information using at least one machine learning (ML) model. The extracted contextual information includes at least one of topics, intents, entities, sentiments, and products of interest. The at least one ML model includes at least one of Automatic Speech Recognition (ASR), Natural Language Understanding (NLU), speaker diarization, sentiment analysis on media streams, and web analytics for product focus.
    Type: Application
    Filed: November 4, 2021
    Publication date: May 4, 2023
    Applicant: NUANCE COMMUNICATIONS, INC.
    Inventors: Eduardo OLVERA, Abhishek ROHATGI
  • Patent number: 11631410
    Abstract: A method, computer program product, and computing system for receiving a signal from each microphone of a plurality of microphones, thus defining a plurality of signals. One or more inter-microphone gain-based augmentations may be performed on the plurality of signals, thus defining one or more inter-microphone gain-augmented signals.
    Type: Grant
    Filed: May 7, 2021
    Date of Patent: April 18, 2023
    Assignee: Nuance Communications, Inc.
    Inventors: Dushyant Sharma, Patrick A. Naylor, Rong Gong, Stanislav Kruchinin, Ljubomir Milanovic
  • Patent number: 11631411
    Abstract: A method, computer program product, and computing system for receiving information associated with an acoustic environment. Acoustic metadata associated with audio encounter information received by a first microphone system may be received. One or more speaker representations may be defined based upon, at least in part, the acoustic metadata associated with the audio encounter information and the information associated with the acoustic environment. One or more portions of the audio encounter information may be labeled with the one or more speaker representations and a speaker location within the acoustic environment.
    Type: Grant
    Filed: May 10, 2021
    Date of Patent: April 18, 2023
    Assignee: Nuance Communications, Inc.
    Inventors: Dushyant Sharma, Patrick A. Naylor
  • Patent number: 11620988
    Abstract: Disclosed herein are systems, computer-implemented methods, and tangible computer-readable storage media for speaker recognition personalization. The method recognizes speech received from a speaker interacting with a speech interface using a set of allocated resources, the set of allocated resources including bandwidth, processor time, memory, and storage. The method records metrics associated with the recognized speech, and after recording the metrics, modifies at least one of the allocated resources in the set of allocated resources commensurate with the recorded metrics. The method recognizes additional speech from the speaker using the modified set of allocated resources. Metrics can include a speech recognition confidence score, processing speed, dialog behavior, requests for repeats, negative responses to confirmations, and task completions.
    Type: Grant
    Filed: December 9, 2019
    Date of Patent: April 4, 2023
    Assignee: Nuance Communications, Inc.
    Inventors: Andrej Ljolje, Alistair D. Conkie, Ann K. Syrdal
  • Patent number: 11605381
    Abstract: A method, computer program product, and computing system for receiving information associated with an acoustic environment. Acoustic metadata associated with audio encounter information received by a first microphone system may be received. One or more speaker representations may be defined based upon, at least in part, the acoustic metadata associated with the audio encounter information and the information associated with the acoustic environment. One or more portions of the audio encounter information may be labeled with the one or more speaker representations and a speaker location within the acoustic environment.
    Type: Grant
    Filed: May 10, 2021
    Date of Patent: March 14, 2023
    Assignee: Nuance Communications, Inc.
    Inventors: Dushyant Sharma, Patrick A. Naylor
  • Patent number: 11605448
    Abstract: A method, computer program product, and computing system for visual diarization of an encounter is executed on a computing device and includes obtaining encounter information of a patient encounter. The encounter information is processed to: associate a first portion of the encounter information with a first encounter participant, and associate at least a second portion of the encounter information with at least a second encounter participant. A visual representation of the encounter information is rendered. A first visual representation of the first portion of the encounter information is rendered that is temporally-aligned with the visual representation of the encounter information. At least a second visual representation of the at least a second portion of the encounter information is rendered that is temporally-aligned with the visual representation of the encounter information.
    Type: Grant
    Filed: August 8, 2018
    Date of Patent: March 14, 2023
    Assignee: Nuance Communications, Inc.
    Inventors: Donald E. Owen, Garret N. Erskine, Mehmet Mert Öz, Daniel Paulino Almendro Barreda
  • Patent number: 11593572
    Abstract: A system and method incorporate prior knowledge into the optimization and regularization of a classification and regression model. The optimization may be a regularization process and the prior knowledge may be incorporated through adjustment of a cost function. A method of at least one processor developing a classification and regression model may be provided. The method may be implemented by at least one processor that implements classification and regression model functionality, including receiving training data and adjusting the model according to the training data; testing the classification and regression model; and employing prior knowledge during an optimization of the classification and regression model. The regularizing can include adjusting feature weights according to prior knowledge. In various embodiments, such systems and methods can be used in the processing of language inputs, e.g., speech and/or text inputs, to achieve greater interpretation accuracy.
    Type: Grant
    Filed: August 26, 2020
    Date of Patent: February 28, 2023
    Assignee: Nuance Communications, Inc.
    Inventors: Jean-François Lavallée, Jean-Michel Attendu, Réal Tremblay
  • Patent number: 11581077
    Abstract: A method, computer program product, and computing system for proactive encounter scanning is executed on a computing device and includes obtaining encounter information of a patient encounter. The encounter information is proactively processed to determine if the encounter information is indicative of one or more medical conditions and to generate one or more result set. The one or more result sets are provided to the user.
    Type: Grant
    Filed: September 7, 2021
    Date of Patent: February 14, 2023
    Assignee: Nuance Communications, Inc.
    Inventor: Donald E. Owen
  • Patent number: 11570587
    Abstract: According to at least one aspect, a system for remotely controlling an application installed on a device is provided. The system includes at least one processor and at least one computer-readable storage medium storing instructions which program the at least one processor to identify a task for the application installed on the device to perform, transmit a binary short message service (SMS) message to the device including a task code associated with the identified task, receive an information request from the device responsive to the binary SMS message, and transmit task information to the device responsive to receiving the information request.
    Type: Grant
    Filed: January 26, 2017
    Date of Patent: January 31, 2023
    Assignee: Nuance Communications, Inc.
    Inventors: Abhishek Rohatgi, John Dolan Heater, Flaviu Negrean, Mark P. Hanson
  • Patent number: 11568736
    Abstract: A remote control device includes a digital audio storage device, a talk button, and an optical distance measurer. The digital audio storage device is configured to continually record an audio input for a specific amount of time. The talk button is coupled to the digital audio storage device and is configured to initiate a transmission of the audio input to a set-top box device. The optical distance measurer is coupled to the talk button and is configured to automatically measure a distance to a user in response to the talk button being pressed.
    Type: Grant
    Filed: December 22, 2017
    Date of Patent: January 31, 2023
    Assignee: Nuance Communications, Inc.
    Inventors: Hisao Chang, Iker Arizmendi
  • Patent number: 11561775
    Abstract: A method, computer program product, and computing system for defining a library of functional modules; enabling a user to select a plurality of functional modules from the library of functional modules; and enabling the user to visually arrange the plurality of functional modules to form a conversational application.
    Type: Grant
    Filed: October 14, 2020
    Date of Patent: January 24, 2023
    Assignee: Nuance Communications, Inc.
    Inventors: David Ardman, Andrew Matkin, Nirvana Tikku, John B. Fisler, Matthias Haack, Christopher A. Starbird, Bryan A. Reif, Alfred Sterphone, III, Nikos Polis, Michael S. Gourlay, Robert A. Follett
  • Patent number: 11550552
    Abstract: A method, computer program product, and computing system for enabling usage of a conversational application by a plurality of users; gathering usage data concerning usage of the conversational application by the plurality of users; defining a visual representation of the conversational application; and overlaying the usage data onto the visual representation of the conversational application to generate visual traffic flow data.
    Type: Grant
    Filed: October 14, 2020
    Date of Patent: January 10, 2023
    Assignee: Nuance Communications, Inc.
    Inventors: David Ardman, Andrew Matkin, Nirvana Tikku, Abhishek Rohatgi, Marco Antonio Padron Chavez, Flaviu Gelu Negrean, Gabrielle R. Martone
  • Patent number: 11545136
    Abstract: A method for removing private data from an acoustic model includes capturing speech from a large population of users, creating a text-to-speech voice from at least a portion of the large population of users, discarding speech data from a database of speech, creating text-to-speech waveforms from the text-to-speech voice and the new database of speech with the discarded speech data and generating an automatic speech recognition model using the text-to-speech waveforms.
    Type: Grant
    Filed: October 21, 2019
    Date of Patent: January 3, 2023
    Assignee: NUANCE COMMUNICATIONS, INC.
    Inventors: Vincent Laurent Pollet, Carl Benjamin Quillen, Philip Charles Woodland, William F. Ganong, III, Steven Hoskins
  • Patent number: 11537697
    Abstract: In accordance with aspects of the inventive concepts, a system and method provide ongoing authentication through processing of data that includes biometric data. Such systems and methods can use, as examples, face recognition and/or voice biometric data, or other biometric data, to identify the user in real-time and thereafter during an ongoing session. In various embodiments, the system can continuously or repeatedly authenticate one or more users using biometric data to control access to information and/or functions in real (or near real) time. The system can be configured to optimize and/or minimize resource consumption associated with the ongoing authentication process.
    Type: Grant
    Filed: August 12, 2019
    Date of Patent: December 27, 2022
    Assignee: Nuance Communications, Inc.
    Inventors: Simon Falardeau, Thomas Stanton
  • Publication number: 20220406295
    Abstract: An end-to-end automatic speech recognition (ASR) system includes: a first encoder configured for close-talk input captured by a close-talk input mechanism; a second encoder configured for far-talk input captured by a far-talk input mechanism; and an encoder selection layer configured to select at least one of the first and second encoders for use in producing ASR output. The selection is made based on at least one of short-time Fourier transform (STFT), Mel-frequency Cepstral Coefficient (MFCC) and filter bank derived from at least one of the close-talk input and the far-talk input. If signals from both the close-talk input mechanism and the far-talk input mechanism are present for a speech segment, the encoder selection layer dynamically selects between the close-talk encoder and the far-talk encoder to select the encoder that better recognizes the speech segment. An encoder-decoder model is used to produce the ASR output.
    Type: Application
    Filed: June 22, 2021
    Publication date: December 22, 2022
    Applicant: NUANCE COMMUNICATIONS, INC.
    Inventors: Felix WENINGER, Marco GAUDESI, Ralf LEIBOLD, Puming ZHAN
  • Patent number: 11531807
    Abstract: A method, computer program product, and computer system for encoding, by a computing device, a transcript and text macros into vector representations. A word by word report may be predicted based upon, at least in part, the encoding. An attention mechanism may be queried based upon, at least in part, a decoder state. An attention distribution may be produced over an encoder output. An interpolation of the encoder output may be produced based upon, at least in part, the attention distribution. The interpolation of the encoder output may be input into a decoder for report modeling that includes text macro location and content.
    Type: Grant
    Filed: September 30, 2019
    Date of Patent: December 20, 2022
    Assignee: Nuance Communications, Inc.
    Inventors: Paul Joseph Vozila, Joel Praveen Pinto, Frank Diehl
  • Patent number: 11515020
    Abstract: A method, computer program product, and computing system for: receiving an initial portion of an encounter record; processing the initial portion of the encounter record to generate initial content for a medical report; receiving one or more additional portions of the encounter record; and processing the one or more additional portions of the encounter record to modify the medical report.
    Type: Grant
    Filed: March 5, 2019
    Date of Patent: November 29, 2022
    Assignee: Nuance Communications, Inc.
    Inventors: Paul Joseph Vozila, Joel Praveen Pinto, Kumar Abhinav, Haibo Li, Marilisa Amoia, Frank Diehl
  • Patent number: 11494166
    Abstract: A method, computer program product, and computing system for enabling a user to select a plurality of functional modules from a library of functional modules; and enabling the user to arrange the plurality of functional modules to form an omnichannel conversational application that includes a first channel and at least a second channel.
    Type: Grant
    Filed: October 14, 2020
    Date of Patent: November 8, 2022
    Assignee: NUANCE COMMUNICATIONS, INC.
    Inventors: David Ardman, Andrew Matkin, Nirvana Tikku, John B. Fisler, Matthias Haack, Christopher A. Starbird, Bryan A. Reif, Alfred Sterphone, III, Nikos Polis, Michael S. Gourlay, Robert A. Follett