Patents Assigned to Nuance Communications
-
Patent number: 10832682Abstract: The method comprises receive first audio comprising speech from a user of a computing device, detecting an end of speech in the first audio, generating an ASR result based, at least in part, on a portion of the first audio prior to the detected end of speech, determining whether a valid action can be performed by a speech-enabled application installed on the computing device using the ASR result, and processing second audio when it is determined that a valid action cannot be performed by the speech-enabled application using the ASR result.Type: GrantFiled: February 6, 2020Date of Patent: November 10, 2020Assignee: Nuance Communications, Inc.Inventor: Mark Fanty
-
Patent number: 10824662Abstract: According to some aspects, a method for aligning a first data source and a second data source during a plurality of iterations comprising a current iteration and a previous iteration is provided. The method comprises generating at least one property alignment hypothesis between at least one first property of the first data source and at least one second property of the second data source; generating a plurality of instance alignment hypotheses between a respective first plurality of instances of the first data source and a respective second plurality of instances of the second data source; and verifying at least one property alignment hypothesis and/or at least one of the plurality of instance alignment hypotheses. Generating the at least one property alignment hypothesis and/or generating the plurality of instance alignment hypotheses is based, at least in part, on at least one property alignment hypothesis and/or at least one instance alignment hypothesis verified during the previous iteration.Type: GrantFiled: October 13, 2015Date of Patent: November 3, 2020Assignee: Nuance Communications, Inc.Inventors: David L. Martin, Peter Zei-Chan Yeh, Peter Frederick Patel-Schneider, Jan Noessner
-
Patent number: 10817672Abstract: Methods and apparatus for natural language understanding (NLU) processing based on user-specified interests. Information specifying a weight for each of a plurality of domains is received via a user interface. The plurality of domains each relates to a potential area of interest for the user, and the weight for a domain from among the plurality of domains indicates a level of interest for the user in the domain. A ranking classifier used to rank NLU hypotheses generated by an NLU engine is trained using training data from which features are, at least in part, based on the information specifying a weight for each of the plurality of domains.Type: GrantFiled: October 1, 2014Date of Patent: October 27, 2020Assignee: Nuance Communications, Inc.Inventor: Matthieu Hebert
-
Patent number: 10818299Abstract: A method of verifying a user identity using a Web-based multimodal interface can include sending, to a remote computing device, a multimodal markup language document that, when rendered by the remote computing device, queries a user for a user identifier and causes audio of the user's voice to be sent to a multimodal, Web-based application. The user identifier and the audio can be received at about a same time from the client device. The audio can be compared with a voice print associated with the user identifier. The user at the remote computing device can be selectively granted access to the system according to a result obtained from the comparing step.Type: GrantFiled: May 12, 2014Date of Patent: October 27, 2020Assignee: Nuance Communications, Inc.Inventors: David Jaramillo, Gerald M. McCobb
-
Patent number: 10810996Abstract: A system, method and computer-readable storage device provides an improved speech processing approach in which hyper parameters used for speech recognition are modified dynamically or in batch mode rather than fixed statically. The method includes estimating, via a model trained on audio data and/or metadata, a set of parameters useful for performing automatic speech recognition, receiving speech at an automatic speech recognition system, applying, by the automatic speech recognition system, the set of parameters to processing the speech to yield text and outputting the text from the automatic speech recognition system.Type: GrantFiled: July 31, 2018Date of Patent: October 20, 2020Assignee: NUANCE COMMUNICATIONS, INC.Inventors: Daniel Willett, Yang Sun, Paul Joseph Vozila, Puming Zhan
-
Patent number: 10811004Abstract: An ontology stores information about a domain of an automatic speech recognition (ASR) application program. The ontology is augmented with information that enables subsequent automatic generation of a speech understanding grammar for use by the ASR application program. The information includes hints about how a human might talk about objects in the domain, such as preludes (phrases that introduce an identification of the object) and postludes (phrases that follow an identification of the object).Type: GrantFiled: March 28, 2013Date of Patent: October 20, 2020Assignee: Nuance Communications, Inc.Inventors: Stephen Douglas Peters, RĂ©al Tremblay
-
Patent number: 10809970Abstract: A method, computer program product, and computing system for obtaining encounter information of a patient encounter, wherein the encounter information includes machine vision encounter information; processing the machine vision encounter information to identify one or more humanoid shapes; and steering one or more audio recording beams toward the one or more humanoid shapes to capture audio encounter information.Type: GrantFiled: February 8, 2019Date of Patent: October 20, 2020Assignee: Nuance Communications, Inc.Inventors: Daniel Paulino Almendro Barreda, Dushyant Sharma, Joel Praveen Pinto, Uwe Helmut Jost, Patrick A. Naylor
-
Patent number: 10803871Abstract: Methods described herein provide functionality for automatic speech recognition (ASR). One such embodiment performs speech recognition using received speech recognition result candidates, where the received candidates were generated by performing Statistical Language Model (SLM) based speech recognition on one or more frames of audio data. In turn, such an embodiment transmits results of the speech recognition, performed using the received speech recognition result candidates, to a user device via a communications network. Results of the speech recognition are available with lower latency than pure cloud based ASR solution.Type: GrantFiled: November 26, 2018Date of Patent: October 13, 2020Assignee: Nuance Communications, Inc.Inventors: Carl Benjamin Quillen, Naveen Parihar
-
Patent number: 10795528Abstract: A method of providing a task assistant to provide an interface to an application, the method comprising activating the task assistant, the activation having an associated visual display. The method in one embodiment includes receiving input from a user through multimodal input including a plurality of speech input, typing input, and touch input, interpreting the input, and providing a formatted query to the application, receiving data from the application in response to the query, and providing a response to the user through multimodal output including a plurality of: speech output, text output, non-speech audio output, haptic output, and visual non-text output, wherein the task assistant has a plurality of active states, each of the active states having an associated visual display.Type: GrantFiled: March 6, 2013Date of Patent: October 6, 2020Assignee: Nuance Communications, Inc.Inventors: Elizabeth Ann Dykstra-Erickson, David Andrew Mauro, Paweena Attayadmawittaya, Aimee Piercy, Susan Dawnstarr Daniel
-
Publication number: 20200311343Abstract: Cascaded models may be applied to extract facts from a medical text. A first model may be applied to at least a portion of the medical text. The first model extracts at least one first medical fact. The at least one first medical fact is linked to at least first text in the at least a portion of the medical text. A second model may be applied to the first text. The second model extracts at least one second fact that is an attribute of the at least one first medical fact.Type: ApplicationFiled: November 1, 2019Publication date: October 1, 2020Applicant: Nuance Communications, Inc.Inventors: Neal E. Snider, Brian William Delaney, Girija Yegnanarayanan, Radu Florian, Martin Franz, Scott McCarley, John F. Pitrelli, Imed Zitouni, Salim E. Roukos
-
Patent number: 10789950Abstract: A multi-mode voice controlled user interface is described. The user interface is adapted to conduct a speech dialog with one or more possible speakers and includes a broad listening mode which accepts speech inputs from the possible speakers without spatial filtering, and a selective listening mode which limits speech inputs to a specific speaker using spatial filtering. The user interface switches listening modes in response to one or more switching cues.Type: GrantFiled: January 22, 2018Date of Patent: September 29, 2020Assignee: NUANCE COMMUNICATIONS, INC.Inventors: Tobias Wolff, Markus Buck, Tim Haulick, Suhadi
-
Patent number: 10789426Abstract: Aspects of the disclosure are directed to natural language processing. An input interface of a computing device receives input (e.g., speech input) and generates a digital signal corresponding to that input. Text corresponding to the digital signal is obtained, and the text is processed using each of a context-free and a context-specific linguistic model to generate linguistic processing results for that text. The text and linguistic processing results may be processed using a NLU model to generate an NLU recognition result corresponding to the input received at the input interface. The text and the linguistic processing results may also be annotated and used to train a NLU model. The linguistic processing results may relate to, e.g., the tokenization of portions of the text, the normalization of portions of the text, sequences of normalizations for portions of the text, and rankings and prioritization of the linguistic processing results.Type: GrantFiled: October 22, 2018Date of Patent: September 29, 2020Assignee: Nuance Communications, Inc.Inventors: Jean-Francois Lavallee, Kenneth W. D. Smith
-
Patent number: 10789539Abstract: Aspects of the disclosure are directed to natural language processing or natural language understanding and may include a determination of a probabilistic or probability-based ranking of potential results. For example, natural language input may be received such as speech or text. Natural language processing may be performed to determine one or more potential results for the input. A pairwise classifier may be used to determine a score for element pairs in the potential results. Based on the scores, probabilities for the element pairs may be determined. Based on the probabilities for the element pairs, further probabilities may be determined such as by estimating the probability that a current result is the top rank or best choice. Based on the estimated probabilities that the current result is the top rank or best choice, a ranking may be determined, which may form the basis for natural language understanding output.Type: GrantFiled: October 12, 2016Date of Patent: September 29, 2020Assignee: Nuance Communications, Inc.Inventor: Jean-Francois Lavallee
-
Patent number: 10785173Abstract: A method in accordance with the present disclosure may include receiving a message at a mobile computing device and performing natural language processing (NLP) based interpretation of the message. Embodiments may further include suggesting at least one of an action and an application configured to perform the action, the suggestion based upon, at least in part, the NLP-based interpretation of the message.Type: GrantFiled: July 3, 2014Date of Patent: September 22, 2020Assignee: Nuance Communications, Inc.Inventors: Daniel Willett, William F. Ganong, III
-
Patent number: 10783159Abstract: Techniques for question answering involve receiving, from a user, a text input expressing a question in natural language. In response to the question, a text output expressing an answer to the question may be generated. A plurality of documents comprising natural language text may be analyzed, involving mapping the question to one or more hypotheses, analyzing at least one passage of text in at least one of the documents to determine whether the passage entails at least one of the hypotheses, and in response to determining that the passage entails at least one of the hypotheses, identifying the passage as providing supporting evidence for the answer to the question. The answer and the at least one passage identified as providing supporting evidence for the answer may be presented to the user in response to the text input.Type: GrantFiled: December 18, 2014Date of Patent: September 22, 2020Assignee: Nuance Communications, Inc.Inventors: Marisa Ferrara Boston, Richard Stamford Crouch, Ali Erdem Ozcan, Peter Stubley
-
Patent number: 10783139Abstract: A method of providing a task assistant to provide an interface to an application is described. The method comprises receiving input from a user through multimodal input including a plurality of speech input, typing input, and touch input, interpreting the input, and providing a formatted query to the application, receiving data from the application in response to the query, and providing a response to the user through multimodal output including a plurality of: speech output, text output, non-speech audio output, haptic output, and visual non-text output.Type: GrantFiled: March 6, 2013Date of Patent: September 22, 2020Assignee: Nuance Communications, Inc.Inventors: David Andrew Mauro, Henri Bouvier, Stephen Douglas Peters, Elizabeth Ann Dykstra-Erickson, Susan Dawnstarr Daniel, Aimee Piercy, Paweena Attayadmawittaya, Andrew Jonathan Watson
-
Patent number: 10776073Abstract: A system, method and computer-readable storage device are disclosed for managing a mute and unmute feature on a device which is used to communicate data in a communication conference. The method includes detecting, when the device is set to mute, whether the user is speaking and whether the speech is meant for the conference. Background noises are distinguished from the speech of the user. If the user is speaking and the device is set to mute, the device will automatically switch to and unmute setting such that people in the indication conference can hear the user speak. Facial recognition, and gaze detection or other data can also be used to determine when to automatically mute or unmute the device and can aid in inferring an intent of the user to speak to the conference participants.Type: GrantFiled: October 8, 2018Date of Patent: September 15, 2020Assignee: NUANCE COMMUNICATIONS, INC.Inventors: Nils Lenke, Eric Montague, William F. Ganong, III
-
Patent number: 10769186Abstract: In an embodiment, a method includes determining, based on a received query and contextual information, candidate reasoners to respond to a received query to select a candidate reasoner. A reasoner or candidate reasoner is a module that translates information from a sensor, user settings, or other source, into additional or revised fields for a query. The method further includes generating, at each candidate reasoner determined, additional or revised query fields based on the contextual information and a rule of a rule database. The method further includes merging the additional query fields for each candidate reasoner based on a confidence score or other metric of each corresponding candidate reasoner. The confidence score can be based on applicability of the contextual information to the received query. The method further includes providing an enhanced query having the additional or revised query fields.Type: GrantFiled: October 16, 2017Date of Patent: September 8, 2020Assignee: Nuance Communications, Inc.Inventors: Peter Yeh, Ezra Story, Prateek Jain
-
Patent number: 10755702Abstract: An arrangement is described for conducting natural language dialogs with a user on a mobile device using automatic speech recognition (ASR) and multiple different dialog applications. A user interface provides for user interaction with the dialogue applications in natural language dialogs. An ASR engine processes unknown speech inputs from the user to produce corresponding speech recognition results. A dialog concept module develops dialog concept items from the speech recognition results and stores the dialog concept items and additional dialog information in a dialog concept database. A dialog processor accesses dialog concept database information and coordinates operation of the ASR engine and the dialog applications to conduct with the user a plurality of separate parallel natural language dialogs in the dialog applications.Type: GrantFiled: July 21, 2016Date of Patent: August 25, 2020Assignee: Nuance Communications, Inc.Inventors: Jean-Phillipe Robichaud, Matthieu Hebert
-
Patent number: 10754925Abstract: Techniques for training a natural language understanding (NLU) engine may include generating a first annotation of free-form text documenting a healthcare patient encounter and a link between the first annotation and a corresponding portion of the text, using the NLU engine. A second annotation of the text and a link between the second annotation and a corresponding portion of the text may be received from a human user. The first annotation and its corresponding link may be merged with the second annotation and its corresponding link. Training data may be provided to the engine in the form of the text and the merged annotations and links.Type: GrantFiled: June 4, 2014Date of Patent: August 25, 2020Assignee: Nuance Communications, Inc.Inventors: Howard D'Souza, Regina Spitznagel, Debjani Sarkar