Patents Examined by Michael Colucci
-
Patent number: 11127398Abstract: The embodiment of the disclosure provides a method for voice controlling, a terminal device, a cloud server and a system. The method includes: receiving voice information that the user performs voice controlling on a terminal device; transmitting voice information to the cloud server, so that the cloud server determines, according to the voice information, a voice control and a control instruction that match the voice information in the current interface, and generates a corresponding voice control instruction; receiving the voice control instruction transmitted by the cloud server; and controlling, according to the voice control instruction, a corresponding voice control of the terminal device to perform an operation. The method of the embodiments of the present disclosure achieves controlling over the controls in the interface through the voice, which deepens the controlling degree of the voice over the terminal device, and improves the user experience.Type: GrantFiled: December 28, 2018Date of Patent: September 21, 2021Assignees: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD., SHANGHAI XIAODU TECHNOLOGY CO. LTD.Inventors: Lichao Xu, Yushu Cao, Lishang Xiao, Lifeng Zhao, Xiangdong Xue, Ji Zhou
-
Patent number: 11120223Abstract: Disclosed are systems, methods, and non-transitory computer-readable media for probabilistic word embeddings for text classification. A text classification system receives a message including a keyword and determines an embedding probability distribution representing the keyword. The text classification system then determines an embedding value for the keyword based on the embedding probability distribution. The text classification system uses the embedding value as input into a set of mathematical functions, yielding a first set of coefficient values for the keyword. Each respective mathematical function from the set corresponds to a respective classification label from a set of classification labels and defines a continuous surface.Type: GrantFiled: June 18, 2019Date of Patent: September 14, 2021Assignee: SAP SEInventors: Gil Katz, Mathis Lamarre
-
Patent number: 11119725Abstract: A control module, for lighting or other environmental control or the like has embedded voice responsive control capability. The recognizable command set supported by the control module is customizable. Software for a data processor configures the control module to identify a user in the vicinity of the control module and change voice response software based at least in part on identity of the user. The changed voice response software configures the control module, to recognize a customized set of vocal commands, e.g. associated with the identified user, in response to speech detected via a microphone and an audio processor of the control module, without requiring network communication. The control module issues control signals to equipment controlled by the module that correspond to the recognized commands of the customized vocal command set.Type: GrantFiled: September 27, 2018Date of Patent: September 14, 2021Assignee: ABL IP HOLDING LLCInventors: Sean P. White, Daniel M. Megginson, Januk Aggarwal, David P. Ramer
-
Patent number: 11115528Abstract: A technology is described for providing virtual call assistants and call options to identified callers. An example method may include identifying a caller and a call recipient using addressing information included in a call received at a call control service. After identifying the caller, a priority designation assigned to the caller may be obtained from an agent linking profile. An interactive agent linked to the priority designation assigned to the caller may be identified, wherein the interactive agent may be configured to provide prioritized call services. Thereafter, the interactive agent linked to the priority designation may be invoked.Type: GrantFiled: January 25, 2018Date of Patent: September 7, 2021Assignee: Amazon Technologies, Inc.Inventors: Siddhartha Shankara Rao, Samuel Rislove Etler, Carlin Wiegner
-
Patent number: 11107475Abstract: An exemplary automatic speech recognition (ASR) system may receive an audio input including a segment of speech. The segment of speech may be independently processed by general ASR and domain-specific ASR to generate multiple ASR results. A selection between the multiple ASR results may be performed based on respective confidence levels for the general ASR and domain-specific ASR. As incremental ASR is performed, a composite result may be generated based on general ASR and domain-specific ASR.Type: GrantFiled: May 9, 2019Date of Patent: August 31, 2021Assignee: Rovi Guides, Inc.Inventor: Jeffry Copps Robert Jose
-
Patent number: 11107460Abstract: Embodiments are associated with a speaker-independent acoustic model capable of classifying senones based on input speech frames and on first parameters of the speaker-independent acoustic model, a speaker-dependent acoustic model capable of classifying senones based on input speech frames and on second parameters of the speaker-dependent acoustic model, and a discriminator capable of receiving data from the speaker-dependent acoustic model and data from the speaker-independent acoustic model and outputting a prediction of whether received data was generated by the speaker-dependent acoustic model based on third parameters.Type: GrantFiled: July 2, 2019Date of Patent: August 31, 2021Assignee: MICROSOFT TECHNOLOGY LICENSING, LLCInventors: Zhong Meng, Jinyu Li, Yifan Gong
-
Patent number: 11100932Abstract: An end detector configured to receive the feature data and detect an end point of a keyword, and a start detector configured to receive an indication of the detected end point and process the feature data associated with corresponding input frames to detect a start point of the keyword. The start detector and end detector comprise neural networks trained through a process using a cross-entropy cost function for non-Region of Target (ROT) frames and a One-Spike Connectionist Temporal Classification cost function for ROT frames.Type: GrantFiled: December 20, 2019Date of Patent: August 24, 2021Assignee: SYNAPTICS INCORPORATEDInventors: Saeed Mosayyebpour, Francesco Nesta, Trausti Thormundsson
-
Patent number: 11087213Abstract: A classification training system for binary and multi-class classification comprises a neural network operable to perform classification of input data, a training dataset including pre-segmented, labeled training samples, and a classification training module operable to train the neural network using the training dataset. The classification training module includes a forward pass processing module, and a backward pass processing module. The backward pass processing module is operable to determine whether a current frame is in a region of target (ROT), determine ROT information such as beginning and length of the ROT and update weights and biases using a cross-entropy cost function and One Spike Connectionist Temporal Classification (OSCTC) cost function. The backward pass module further computes a soft target value using ROT information and computes a signal output error using the soft target value and network output value.Type: GrantFiled: December 20, 2019Date of Patent: August 10, 2021Assignee: SYNAPTICS INCORPORATEDInventors: Saeed Mosayyebpour, Trausti Thormundsson, Francesco Nesta
-
Patent number: 11087748Abstract: The systems and methods of the present disclosure generally relate to a data processing system that can identify and surface alternative requests when presented with ambiguous, unclear, or other requests to which a data processing system may not be able to respond. The data processing system can improve the efficiency of network transmissions to reduce network bandwidth usage and processor utilization by selecting alternative requests that are responsive to the intent of the original request.Type: GrantFiled: May 11, 2018Date of Patent: August 10, 2021Assignee: GOOGLE LLCInventors: Gleb Skobeltsyn, Mihaly Kozsevnyikov, Vladimir Vuskovic
-
Patent number: 11080600Abstract: An acoustic event detection and classification system includes a start-end point detector and multi-class acoustic event classification. A classification training system comprises a neural network configured to perform classification of input data, a training dataset including pre-segmented, labeled training samples, and a classification training module configured to train the neural network using the training dataset. The classification training module includes a forward pass processing module, and a backward pass processing module. The backward pass processing module is configured to determine whether a current frame is in a region of target (ROT), determine ROT information such as beginning and length of the ROT and update weights and biases using a cross-entropy cost function and a many-or-one detection (MOOD) cost function. The backward pass module further computes a soft target value using ROT information and computes a signal output error using the soft target value and network output value.Type: GrantFiled: December 20, 2019Date of Patent: August 3, 2021Assignee: SYNAPTICS INCORPORATEDInventors: Saeed Mosayyebpour, Trausti Thormundsson
-
Patent number: 11069352Abstract: Described herein is a system for media presence detection in audio. The system analyzes audio data to recognize whether a given audio segment contains sounds from a media source as a way of differentiating recorded media source sounds from other live sounds. In exemplary embodiments, the system includes a hierarchical model architecture for processing audio data segments, where individual audio data segments are processed by a trained machine learning model operating locally, and another trained machine learning model provides historical and contextual information to determine a score indicating the likelihood that the audio data segment contains sounds from a media source.Type: GrantFiled: February 18, 2019Date of Patent: July 20, 2021Assignee: Amazon Technologies, Inc.Inventors: Qingming Tang, Ming Sun, Chieh-Chi Kao, Chao Wang, Viktor Rozgic
-
Patent number: 11062093Abstract: A system includes at least one processor to perform natural language processing on text from at least one document and assign the at least one document to at least one subjective wellbeing dimension by comparing the text from the at least one document with a subjective wellbeing dimension filter for each subjective wellbeing dimension, insert the at least one document into at least one bin, each bin associated with a particular subjective wellbeing dimension, and analyze each document in each bin associated with the particular subjective wellbeing dimension to determine a score for each subjective wellbeing dimension and an overall score that is based on each score for each subjective wellbeing dimension.Type: GrantFiled: February 18, 2019Date of Patent: July 13, 2021Assignee: TSG Technologies, LLCInventors: Anthony L Hinrichs, Andrea E DiGiovanni, Willem S Maritz, Anthony M Sardella
-
Patent number: 11062697Abstract: A device includes a processor configured to, in response to determining that an input phrase includes a first term that is included in a term hierarchy, generate a second phrase by replacing the first term in the input phrase with a second term included in the term hierarchy. The processor is configured to determine that interactive response (IR) training data indicates that the input phrase is associated with a user intent indicator. The processor is configured to determine that user interaction data indicates that a first proportion of user phrases received by an IR system correspond to the user intent indicator. The processor is configured to update speech-to-text training data based on the input phrase and the second phrase so that a second proportion of training phrases of the speech-to-text training data correspond to the user intent indicator. The second proportion is based on the first proportion. A speech-to-text model is trained based on the speech-to-text training data.Type: GrantFiled: October 29, 2018Date of Patent: July 13, 2021Assignee: International Business Machines CorporationInventors: Edward G. Katz, Alexander C. Tonetti, John A. Riendeau, Sean T. Thatcher
-
Patent number: 11056105Abstract: Embodiments of the present invention provide systems, methods, and computer storage media directed to providing talk back automation for applications installed on a mobile device. To do so actions (e.g., talk back features) can be created, via the digital assistant, by recording a series of events that are typically provided by a user of the mobile device when manually invoking the desired action. At a desired state, the user may select an object that represents the output of the application. The recording embodies the action and can be associated with a series of verbal commands that the user would typically announce to the digital assistant when an invocation of the action is desired. In response, the object is verbally communicated to the user via the digital assistant, a different digital assistant, or even another device. Alternatively, the object may be communicated to the same application or another application as input.Type: GrantFiled: March 26, 2018Date of Patent: July 6, 2021Assignee: AIQUDO, INCInventors: Mark Robinson, Matan Levi, Kiran Bindhu Hemaraj, Rajat Mukherjee
-
Patent number: 11056101Abstract: A method for training hotword detection includes receiving a training input audio sequence including a sequence of input frames that define a hotword that initiates a wake-up process on a device. The method also includes feeding the training input audio sequence into an encoder and a decoder of a memorized neural network. Each of the encoder and the decoder of the memorized neural network include sequentially-stacked single value decomposition filter (SVDF) layers. The method further includes generating a logit at each of the encoder and the decoder based on the training input audio sequence. For each of the encoder and the decoder, the method includes smoothing each respective logit generated from the training input audio sequence, determining a max pooling loss from a probability distribution based on each respective logit, and optimizing the encoder and the decoder based on all max pooling losses associated with the training input audio sequence.Type: GrantFiled: December 10, 2019Date of Patent: July 6, 2021Assignee: Google LLCInventors: Raziel Alvarez Guevara, Hyun Jin Park, Patrick Violette
-
Patent number: 11055487Abstract: Of the four primary approaches to processing language by computer, only the parsing approach considers the semantic and syntactic components from the start. In doing so, however, the required resources expand rapidly as the scope of the language processed increases. And as that scope increases, the performance of parsing systems decreases. A natural language processor uses a tumbling-frequency phrase chain parser as described herein which circumvents this resource-intensive step in parsing, while quickly and almost effortlessly arriving at the next step in natural-language processing with far more accurate results involving a partitioning dictionary and phrase chains.Type: GrantFiled: April 22, 2020Date of Patent: July 6, 2021Assignee: QwikIntelligence, Inc.Inventors: William Randolph Ford, Alfred Rives Berkeley, III, Mark Alexander Newman
-
Patent number: 11049499Abstract: A method includes receiving information from each device in multiple devices of one user, where the information includes a device type, determining, according to at least the received information, a device responding to a voice instruction in the multiple devices, and sending a message to a device different from the device responding to a voice instruction in the multiple devices, where the message is used to instruct a voice assistant of the different device not to respond to the voice instruction.Type: GrantFiled: August 1, 2016Date of Patent: June 29, 2021Assignee: HUAWEI TECHNOLOGIES CO., LTD.Inventors: Xiaojuan Li, Yahui Wang, Wenmei Gao
-
Patent number: 11049497Abstract: Described techniques select portions of an audio stream for transmission to a trained machine learning application, which generates response recommendations in real-time. This real-time response is facilitated by the system identifying, selecting and transmitting those portions of the audio stream likely to be most relevant to the conversation. Portions of an audio stream less likely to be relevant to the conversation are identified accordingly and not transmitted. The system may identify the relevant portions of an audio stream by detecting events in a contemporaneous event stream, use a trained machine learning model to identify events in an audio stream, or both.Type: GrantFiled: October 29, 2020Date of Patent: June 29, 2021Assignee: CRESTA INTELLIGENCE INC.Inventors: Tianlin Shi, Kenneth George Oetzel
-
Patent number: 11043208Abstract: Systems and methods for intelligently training a subject machine learning model includes identifying new observations comprising a plurality of distinct samples unseen by a target model during a prior training; creating an incremental training corpus based on randomly sampling a collection of training data samples that includes a plurality of new observations and a plurality of historical training data samples used in the prior training of the target model; implementing a first training mode that includes an incremental training of the target model using samples from the incremental training corpus as model training input; computing performance metrics of the target model based on the incremental training; evaluating the performance metrics of the target model against training mode thresholds; and selectively choosing based on the evaluation one of maintaining the first training mode and automatically switching to a second training mode that includes a full retraining of the target model.Type: GrantFiled: February 19, 2021Date of Patent: June 22, 2021Assignee: Clinc, Inc.Inventors: Daniel C. Michelin, Jonathan K. Kummerfeld, Kevin Leach, Stefan Larson, Joseph J. Peper, Yunqi Zhang
-
Patent number: 11042710Abstract: A method of generating text using an adversarial network includes receiving a limited dataset. The limited dataset includes real data having actual parameters and actual sentences. The method includes receiving content data that includes a concept related to a portion of the real data or that causes an issue of the real data. The method includes generating relationships between the real data and the content data. The method includes embedding the content data with the real data in an encoder output that includes content vector embedding. The method includes generating an additional parameter set that includes additional parameters and one or more additional statements. The additional parameter set may be supplemental to the real data and configured to enhance an expressiveness of a model. The method includes generating explanatory statement based on the additional parameter set and the relationships.Type: GrantFiled: February 18, 2019Date of Patent: June 22, 2021Assignee: FUJITSU LIMITEDInventors: Pouya Pezeshkpour, Ramya Malur Srinivasan, Ajay Chander