Patents Examined by Michael Colucci

Method for voice controlling, terminal device, cloud server and system

Patent number: 11127398

Abstract: The embodiment of the disclosure provides a method for voice controlling, a terminal device, a cloud server and a system. The method includes: receiving voice information that the user performs voice controlling on a terminal device; transmitting voice information to the cloud server, so that the cloud server determines, according to the voice information, a voice control and a control instruction that match the voice information in the current interface, and generates a corresponding voice control instruction; receiving the voice control instruction transmitted by the cloud server; and controlling, according to the voice control instruction, a corresponding voice control of the terminal device to perform an operation. The method of the embodiments of the present disclosure achieves controlling over the controls in the interface through the voice, which deepens the controlling degree of the voice over the terminal device, and improves the user experience.

Type: Grant

Filed: December 28, 2018

Date of Patent: September 21, 2021

Assignees: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD., SHANGHAI XIAODU TECHNOLOGY CO. LTD.

Inventors: Lichao Xu, Yushu Cao, Lishang Xiao, Lifeng Zhao, Xiangdong Xue, Ji Zhou
Probabilistic word embeddings for text classification

Patent number: 11120223

Abstract: Disclosed are systems, methods, and non-transitory computer-readable media for probabilistic word embeddings for text classification. A text classification system receives a message including a keyword and determines an embedding probability distribution representing the keyword. The text classification system then determines an embedding value for the keyword based on the embedding probability distribution. The text classification system uses the embedding value as input into a set of mathematical functions, yielding a first set of coefficient values for the keyword. Each respective mathematical function from the set corresponds to a respective classification label from a set of classification labels and defines a continuous surface.

Type: Grant

Filed: June 18, 2019

Date of Patent: September 14, 2021

Assignee: SAP SE

Inventors: Gil Katz, Mathis Lamarre
Customizable embedded vocal command sets for a lighting and/or other environmental controller

Patent number: 11119725

Abstract: A control module, for lighting or other environmental control or the like has embedded voice responsive control capability. The recognizable command set supported by the control module is customizable. Software for a data processor configures the control module to identify a user in the vicinity of the control module and change voice response software based at least in part on identity of the user. The changed voice response software configures the control module, to recognize a customized set of vocal commands, e.g. associated with the identified user, in response to speech detected via a microphone and an audio processor of the control module, without requiring network communication. The control module issues control signals to equipment controlled by the module that correspond to the recognized commands of the customized vocal command set.

Type: Grant

Filed: September 27, 2018

Date of Patent: September 14, 2021

Assignee: ABL IP HOLDING LLC

Inventors: Sean P. White, Daniel M. Megginson, Januk Aggarwal, David P. Ramer
Call control service

Patent number: 11115528

Abstract: A technology is described for providing virtual call assistants and call options to identified callers. An example method may include identifying a caller and a call recipient using addressing information included in a call received at a call control service. After identifying the caller, a priority designation assigned to the caller may be obtained from an agent linking profile. An interactive agent linked to the priority designation assigned to the caller may be identified, wherein the interactive agent may be configured to provide prioritized call services. Thereafter, the interactive agent linked to the priority designation may be invoked.

Type: Grant

Filed: January 25, 2018

Date of Patent: September 7, 2021

Assignee: Amazon Technologies, Inc.

Inventors: Siddhartha Shankara Rao, Samuel Rislove Etler, Carlin Wiegner
Word correction using automatic speech recognition (ASR) incremental response

Patent number: 11107475

Abstract: An exemplary automatic speech recognition (ASR) system may receive an audio input including a segment of speech. The segment of speech may be independently processed by general ASR and domain-specific ASR to generate multiple ASR results. A selection between the multiple ASR results may be performed based on respective confidence levels for the general ASR and domain-specific ASR. As incremental ASR is performed, a composite result may be generated based on general ASR and domain-specific ASR.

Type: Grant

Filed: May 9, 2019

Date of Patent: August 31, 2021

Assignee: Rovi Guides, Inc.

Inventor: Jeffry Copps Robert Jose
Adversarial speaker adaptation

Patent number: 11107460

Abstract: Embodiments are associated with a speaker-independent acoustic model capable of classifying senones based on input speech frames and on first parameters of the speaker-independent acoustic model, a speaker-dependent acoustic model capable of classifying senones based on input speech frames and on second parameters of the speaker-dependent acoustic model, and a discriminator capable of receiving data from the speaker-dependent acoustic model and data from the speaker-independent acoustic model and outputting a prediction of whether received data was generated by the speaker-dependent acoustic model based on third parameters.

Type: Grant

Filed: July 2, 2019

Date of Patent: August 31, 2021

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Zhong Meng, Jinyu Li, Yifan Gong
Robust start-end point detection algorithm using neural network

Patent number: 11100932

Abstract: An end detector configured to receive the feature data and detect an end point of a keyword, and a start detector configured to receive an indication of the detected end point and process the feature data associated with corresponding input frames to detect a start point of the keyword. The start detector and end detector comprise neural networks trained through a process using a cross-entropy cost function for non-Region of Target (ROT) frames and a One-Spike Connectionist Temporal Classification cost function for ROT frames.

Type: Grant

Filed: December 20, 2019

Date of Patent: August 24, 2021

Assignee: SYNAPTICS INCORPORATED

Inventors: Saeed Mosayyebpour, Francesco Nesta, Trausti Thormundsson
Binary and multi-class classification systems and methods using one spike connectionist temporal classification

Patent number: 11087213

Abstract: A classification training system for binary and multi-class classification comprises a neural network operable to perform classification of input data, a training dataset including pre-segmented, labeled training samples, and a classification training module operable to train the neural network using the training dataset. The classification training module includes a forward pass processing module, and a backward pass processing module. The backward pass processing module is operable to determine whether a current frame is in a region of target (ROT), determine ROT information such as beginning and length of the ROT and update weights and biases using a cross-entropy cost function and One Spike Connectionist Temporal Classification (OSCTC) cost function. The backward pass module further computes a soft target value using ROT information and computes a signal output error using the soft target value and network output value.

Type: Grant

Filed: December 20, 2019

Date of Patent: August 10, 2021

Assignee: SYNAPTICS INCORPORATED

Inventors: Saeed Mosayyebpour, Trausti Thormundsson, Francesco Nesta
Adaptive interface in a voice-activated network

Patent number: 11087748

Abstract: The systems and methods of the present disclosure generally relate to a data processing system that can identify and surface alternative requests when presented with ambiguous, unclear, or other requests to which a data processing system may not be able to respond. The data processing system can improve the efficiency of network transmissions to reduce network bandwidth usage and processor utilization by selecting alternative requests that are responsive to the intent of the original request.

Type: Grant

Filed: May 11, 2018

Date of Patent: August 10, 2021

Assignee: GOOGLE LLC

Inventors: Gleb Skobeltsyn, Mihaly Kozsevnyikov, Vladimir Vuskovic
Recurrent neural network based acoustic event classification using complement rule

Patent number: 11080600

Abstract: An acoustic event detection and classification system includes a start-end point detector and multi-class acoustic event classification. A classification training system comprises a neural network configured to perform classification of input data, a training dataset including pre-segmented, labeled training samples, and a classification training module configured to train the neural network using the training dataset. The classification training module includes a forward pass processing module, and a backward pass processing module. The backward pass processing module is configured to determine whether a current frame is in a region of target (ROT), determine ROT information such as beginning and length of the ROT and update weights and biases using a cross-entropy cost function and a many-or-one detection (MOOD) cost function. The backward pass module further computes a soft target value using ROT information and computes a signal output error using the soft target value and network output value.

Type: Grant

Filed: December 20, 2019

Date of Patent: August 3, 2021

Assignee: SYNAPTICS INCORPORATED

Inventors: Saeed Mosayyebpour, Trausti Thormundsson
Media presence detection

Patent number: 11069352

Abstract: Described herein is a system for media presence detection in audio. The system analyzes audio data to recognize whether a given audio segment contains sounds from a media source as a way of differentiating recorded media source sounds from other live sounds. In exemplary embodiments, the system includes a hierarchical model architecture for processing audio data segments, where individual audio data segments are processed by a trained machine learning model operating locally, and another trained machine learning model provides historical and contextual information to determine a score indicating the likelihood that the audio data segment contains sounds from a media source.

Type: Grant

Filed: February 18, 2019

Date of Patent: July 20, 2021

Assignee: Amazon Technologies, Inc.

Inventors: Qingming Tang, Ming Sun, Chieh-Chi Kao, Chao Wang, Viktor Rozgic
System and method for generating subjective wellbeing analytics score

Patent number: 11062093

Abstract: A system includes at least one processor to perform natural language processing on text from at least one document and assign the at least one document to at least one subjective wellbeing dimension by comparing the text from the at least one document with a subjective wellbeing dimension filter for each subjective wellbeing dimension, insert the at least one document into at least one bin, each bin associated with a particular subjective wellbeing dimension, and analyze each document in each bin associated with the particular subjective wellbeing dimension to determine a score for each subjective wellbeing dimension and an overall score that is based on each score for each subjective wellbeing dimension.

Type: Grant

Filed: February 18, 2019

Date of Patent: July 13, 2021

Assignee: TSG Technologies, LLC

Inventors: Anthony L Hinrichs, Andrea E DiGiovanni, Willem S Maritz, Anthony M Sardella
Speech-to-text training data based on interactive response data

Patent number: 11062697

Abstract: A device includes a processor configured to, in response to determining that an input phrase includes a first term that is included in a term hierarchy, generate a second phrase by replacing the first term in the input phrase with a second term included in the term hierarchy. The processor is configured to determine that interactive response (IR) training data indicates that the input phrase is associated with a user intent indicator. The processor is configured to determine that user interaction data indicates that a first proportion of user phrases received by an IR system correspond to the user intent indicator. The processor is configured to update speech-to-text training data based on the input phrase and the second phrase so that a second proportion of training phrases of the speech-to-text training data correspond to the user intent indicator. The second proportion is based on the first proportion. A speech-to-text model is trained based on the speech-to-text training data.

Type: Grant

Filed: October 29, 2018

Date of Patent: July 13, 2021

Assignee: International Business Machines Corporation

Inventors: Edward G. Katz, Alexander C. Tonetti, John A. Riendeau, Sean T. Thatcher
Talk back from actions in applications

Patent number: 11056105

Abstract: Embodiments of the present invention provide systems, methods, and computer storage media directed to providing talk back automation for applications installed on a mobile device. To do so actions (e.g., talk back features) can be created, via the digital assistant, by recording a series of events that are typically provided by a user of the mobile device when manually invoking the desired action. At a desired state, the user may select an object that represents the output of the application. The recording embodies the action and can be associated with a series of verbal commands that the user would typically announce to the digital assistant when an invocation of the action is desired. In response, the object is verbally communicated to the user via the digital assistant, a different digital assistant, or even another device. Alternatively, the object may be communicated to the same application or another application as input.

Type: Grant

Filed: March 26, 2018

Date of Patent: July 6, 2021

Assignee: AIQUDO, INC

Inventors: Mark Robinson, Matan Levi, Kiran Bindhu Hemaraj, Rajat Mukherjee
End-to-end streaming keyword spotting

Patent number: 11056101

Abstract: A method for training hotword detection includes receiving a training input audio sequence including a sequence of input frames that define a hotword that initiates a wake-up process on a device. The method also includes feeding the training input audio sequence into an encoder and a decoder of a memorized neural network. Each of the encoder and the decoder of the memorized neural network include sequentially-stacked single value decomposition filter (SVDF) layers. The method further includes generating a logit at each of the encoder and the decoder based on the training input audio sequence. For each of the encoder and the decoder, the method includes smoothing each respective logit generated from the training input audio sequence, determining a max pooling loss from a probability distribution based on each respective logit, and optimizing the encoder and the decoder based on all max pooling losses associated with the training input audio sequence.

Type: Grant

Filed: December 10, 2019

Date of Patent: July 6, 2021

Assignee: Google LLC

Inventors: Raziel Alvarez Guevara, Hyun Jin Park, Patrick Violette
Understanding natural language using split-phrase tumbling-frequency phrase-chain parsing

Patent number: 11055487

Abstract: Of the four primary approaches to processing language by computer, only the parsing approach considers the semantic and syntactic components from the start. In doing so, however, the required resources expand rapidly as the scope of the language processed increases. And as that scope increases, the performance of parsing systems decreases. A natural language processor uses a tumbling-frequency phrase chain parser as described herein which circumvents this resource-intensive step in parsing, while quickly and almost effortlessly arriving at the next step in natural-language processing with far more accurate results involving a partitioning dictionary and phrase chains.

Type: Grant

Filed: April 22, 2020

Date of Patent: July 6, 2021

Assignee: QwikIntelligence, Inc.

Inventors: William Randolph Ford, Alfred Rives Berkeley, III, Mark Alexander Newman
Communication method, server, and device

Patent number: 11049499

Abstract: A method includes receiving information from each device in multiple devices of one user, where the information includes a device type, determining, according to at least the received information, a device responding to a voice instruction in the multiple devices, and sending a message to a device different from the device responding to a voice instruction in the multiple devices, where the message is used to instruct a voice assistant of the different device not to respond to the voice instruction.

Type: Grant

Filed: August 1, 2016

Date of Patent: June 29, 2021

Assignee: HUAWEI TECHNOLOGIES CO., LTD.

Inventors: Xiaojuan Li, Yahui Wang, Wenmei Gao
Contemporaneous machine-learning analysis of audio streams

Patent number: 11049497

Abstract: Described techniques select portions of an audio stream for transmission to a trained machine learning application, which generates response recommendations in real-time. This real-time response is facilitated by the system identifying, selecting and transmitting those portions of the audio stream likely to be most relevant to the conversation. Portions of an audio stream less likely to be relevant to the conversation are identified accordingly and not transmitted. The system may identify the relevant portions of an audio stream by detecting events in a contemporaneous event stream, use a trained machine learning model to identify events in an audio stream, or both.

Type: Grant

Filed: October 29, 2020

Date of Patent: June 29, 2021

Assignee: CRESTA INTELLIGENCE INC.

Inventors: Tianlin Shi, Kenneth George Oetzel
Systems and methods for mixed setting training for slot filling machine learning tasks in a machine learning task-oriented dialogue system

Patent number: 11043208

Abstract: Systems and methods for intelligently training a subject machine learning model includes identifying new observations comprising a plurality of distinct samples unseen by a target model during a prior training; creating an incremental training corpus based on randomly sampling a collection of training data samples that includes a plurality of new observations and a plurality of historical training data samples used in the prior training of the target model; implementing a first training mode that includes an incremental training of the target model using samples from the incremental training corpus as model training input; computing performance metrics of the target model based on the incremental training; evaluating the performance metrics of the target model against training mode thresholds; and selectively choosing based on the evaluation one of maintaining the first training mode and automatically switching to a second training mode that includes a full retraining of the target model.

Type: Grant

Filed: February 19, 2021

Date of Patent: June 22, 2021

Assignee: Clinc, Inc.

Inventors: Daniel C. Michelin, Jonathan K. Kummerfeld, Kevin Leach, Stefan Larson, Joseph J. Peper, Yunqi Zhang
User-friendly explanation production using generative adversarial networks

Patent number: 11042710

Abstract: A method of generating text using an adversarial network includes receiving a limited dataset. The limited dataset includes real data having actual parameters and actual sentences. The method includes receiving content data that includes a concept related to a portion of the real data or that causes an issue of the real data. The method includes generating relationships between the real data and the content data. The method includes embedding the content data with the real data in an encoder output that includes content vector embedding. The method includes generating an additional parameter set that includes additional parameters and one or more additional statements. The additional parameter set may be supplemental to the real data and configured to enhance an expressiveness of a model. The method includes generating explanatory statement based on the additional parameter set and the relationships.

Type: Grant

Filed: February 18, 2019

Date of Patent: June 22, 2021

Assignee: FUJITSU LIMITED

Inventors: Pouya Pezeshkpour, Ramya Malur Srinivasan, Ajay Chander

prev … 5 6 7 8 9 10 11 12 13 … next