Patents Examined by Jesse S Pullias

Adaptive semi-supervised learning for cross-domain sentiment classification

Patent number: 10817668

Abstract: Methods, systems, and computer-readable storage media for receiving a source domain data set including a set of source document and source label pairs, each source label corresponding to a source domain and indicating a sentiment attributed to a respective source document, receiving a target domain data set including a set of target documents absent target labels, processing documents of the source and target domains using a feature encoder of a DAS platform, to map the documents of the source and target domains to a shared feature space through feature representations, the processing including minimizing a distance between the feature representations of the source domain, and feature representations of the target domain based on a set of loss functions, providing an ensemble prediction from the processing, and providing predicted labels based on the ensemble prediction, the predicted labels being used by the sentiment classifier to classify documents from the target domain.

Type: Grant

Filed: November 26, 2018

Date of Patent: October 27, 2020

Assignee: SAP SE

Inventor: Ruidan He
Methods and systems for recognizing simultaneous speech by multiple speakers

Patent number: 10811000

Abstract: Systems and methods for a speech recognition system for recognizing speech including overlapping speech by multiple speakers. The system including a hardware processor. A computer storage memory to store data along with having computer-executable instructions stored thereon that, when executed by the processor is to implement a stored speech recognition network. An input interface to receive an acoustic signal, the received acoustic signal including a mixture of speech signals by multiple speakers, wherein the multiple speakers include target speakers. An encoder network and a decoder network of the stored speech recognition network are trained to transform the received acoustic signal into a text for each target speaker. Such that the encoder network outputs a set of recognition encodings, and the decoder network uses the set of recognition encodings to output the text for each target speaker. An output interface to transmit the text for each target speaker.

Type: Grant

Filed: April 13, 2018

Date of Patent: October 20, 2020

Assignee: Mitsubishi Electric Research Laboratories, Inc.

Inventors: Jonathan Le Roux, Takaaki Hori, Shane Settle, Hiroshi Seki, Shinji Watanabe, John Hershey
Generating audio using neural networks

Patent number: 10803884

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating an output sequence of audio data that comprises a respective audio sample at each of a plurality of time steps. One of the methods includes, for each of the time steps: providing a current sequence of audio data as input to a convolutional subnetwork, wherein the current sequence comprises the respective audio sample at each time step that precedes the time step in the output sequence, and wherein the convolutional subnetwork is configured to process the current sequence of audio data to generate an alternative representation for the time step; and providing the alternative representation for the time step as input to an output layer, wherein the output layer is configured to: process the alternative representation to generate an output that defines a score distribution over a plurality of possible audio samples for the time step.

Type: Grant

Filed: April 22, 2019

Date of Patent: October 13, 2020

Assignee: DeepMind Technologies Limited

Inventors: Aaron Gerard Antonius van den Oord, Sander Etienne Lea Dieleman, Nal Emmerich Kalchbrenner, Karen Simonyan, Oriol Vinyals
Digital media environment for conversational image editing and enhancement

Patent number: 10796690

Abstract: Conversational image editing and enhancement techniques are described. For example, an indication of a digital image is received from a user. Aesthetic attribute scores for multiple aesthetic attributes of the image are generated. A computing device then conducts a natural language conversation with the user to edit the digital image. The computing device receives inputs from the user to refine the digital image as the natural language conversation progresses. The computing device generates natural language suggestions to edit the digital image based on the aesthetic attribute scores as part of the natural language conversation. The computing device provides feedback to the user that includes edits to the digital image based on the series of inputs. The computing device also includes as feedback natural language outputs indicating options for additional edits to the digital image based on the series of inputs and the previous edits to the digital image.

Type: Grant

Filed: August 22, 2018

Date of Patent: October 6, 2020

Assignee: Adobe Inc.

Inventors: Frieder Ludwig Anton Ganz, Walter Wei-Tuh Chang
Proxy for selective use of human and artificial intelligence in a natural language understanding system

Patent number: 10789943

Abstract: An interactive response system combines human intelligence (HI) subsystems with artificial intelligence (AI) subsystems to facilitate overall capability of multi-channel user interfaces. The system permits imperfect AI subsystems to nonetheless lessen the burden on HI subsystems. A combined AI and HI proxy is used to implement an interactive omnichannel system, and the proxy dynamically determines how many AI and HI subsystems are to perform recognition for any particular utterance, based on factors such as confidence thresholds of the AI recognition and availability of HI resources. Furthermore the system uses information from prior recognitions to automatically build, test, predict confidence, and maintain AI models and HI models for system recognition improvements.

Type: Grant

Filed: August 31, 2018

Date of Patent: September 29, 2020

Assignee: Interactions LLC

Inventors: Larissa Lapshina, Mahnoosh Mehrabani Sharifbad, David Thomson, Yoryos Yeracaris
Pictorial symbol prediction

Patent number: 10788900

Abstract: Symbol prediction can be implemented using a multi-task system trained for different tasks. The tasks may include a single symbol prediction, symbol category prediction, and symbol subcategory prediction. Categories of symbols can be generated by clustering sets of training data using a clustering scheme.

Type: Grant

Filed: June 29, 2018

Date of Patent: September 29, 2020

Assignee: Snap Inc.

Inventors: William Brendel, Francesco Barbieri, Xin Chen, Wei Chu, Venkata Satya Pradeep Karuturi, Luis Carlos Dos Santos Marujo, Leonardo Ribas Machado das Neves
Time-frequency convolutional neural network with bottleneck architecture for query-by-example processing

Patent number: 10777188

Abstract: A computing system determines whether a reference audio signal contains a query. A time-frequency convolutional neural network (TFCNN) comprises a time and frequency convolutional layers and a series of additional layers, which include a bottleneck layer. The computation engine applies the TFCNN to samples of a query utterance at least through the bottleneck layer. A query feature vector comprises output values of the bottleneck layer generated when the computation engine applies the TFCNN to the samples of the query utterance. The computation engine also applies the TFCNN to samples of the reference audio signal at least through the bottleneck layer. A reference feature vector comprises output values of the bottleneck layer generated when the computation engine applies the TFCNN to the samples of the reference audio signal. The computation engine determines at least one detection score based on the query feature vector and the reference feature vector.

Type: Grant

Filed: November 14, 2018

Date of Patent: September 15, 2020

Assignee: SRI International

Inventors: Julien van Hout, Vikramjit Mitra, Horacio Franco, Emre Yilmaz
Dynamic wakeword detection

Patent number: 10777189

Abstract: Techniques for using a dynamic wakeword detection threshold are described. A device detects a wakeword in audio data using a first wakeword detection threshold value. Thereafter, the device receives audio including speech. If the device receives the audio within a predetermined duration of time after detecting the previous wakeword, the device attempts to detect a wakeword in second audio data, corresponding to the audio including the speech, using a second, lower wakeword detection threshold value.

Type: Grant

Filed: December 5, 2017

Date of Patent: September 15, 2020

Assignee: AMAZON TECHNOLOGIES, INC.

Inventors: Gengshen Fu, Shiv Naga Prasad Vitaladevuni, Paul McIntyre, Shuang Wu
Voiceprint update method, client, and electronic device

Patent number: 10777206

Abstract: Data update methods, systems, and devices are disclosed. The method includes: acquiring at least a first piece of audio data of a user in a first conversation scenario and at least a second piece of audio data of the user in a second conversation scenario, performing voiceprint recognition on the first and second pieces of audio data based on voiceprint information of the user, acquiring audio feature information of the first and second pieces of audio data, and updating voiceprint information of the user according to the audio feature information.

Type: Grant

Filed: June 15, 2018

Date of Patent: September 15, 2020

Assignee: Alibaba Group Holding Limited

Inventors: Gang Liu, Qingen Zhao, Guangxing Liu
Systems and methods for diagnosing problems from error logs using natural language processing

Patent number: 10776577

Abstract: Disclosed is a solution for diagnosing problems from logs used in an application development environment. A random sample of log statements is collected. The log statements can be completely unstructured and/or do not conform to any natural language. The log statements are tagged with predefined classifications. A natural language processing (NLP) classifier model is trained utilizing the log statements tagged with the predefined classification. New log statements can be classified into the plurality of predefined classifications utilizing the trained NLP classifier model. From the log statements thus classified, statements having a problem classification can be identified and presented through a dashboard running in a browser. Outputs from the trained NLP classifier model can be provided as input to another trained model for automatically and quickly identifying a type of problem associated with the statements, eliminating a need to manually sift through tens or hundreds of thousands of lines of logs.

Type: Grant

Filed: June 20, 2018

Date of Patent: September 15, 2020

Assignee: Open Text Corporation

Inventors: Ankur Sharma, Ravikanth Somayaji
Routing audio streams based on semantically generated result sets

Patent number: 10770094

Abstract: An example apparatus for routing audio streams includes an audio receiver to receive audio from a microphone. The apparatus also includes a classifier to semantically generate a result set based on the audio. The apparatus further includes a scheduler to select a spoken language understanding (SLU) engine based on the result set. The apparatus includes a router to route the audio to the selected SLU engine.

Type: Grant

Filed: January 9, 2018

Date of Patent: September 8, 2020

Assignee: Intel IP Corporation

Inventors: Munir Nikolai Alexander Georges, Jakub Nowicki
System and method for assessing security threats and criminal proclivities

Patent number: 10764427

Abstract: A centralized and robust threat assessment tool is disclosed to perform comprehensive analysis of previously-stored and subsequent communication data, activity data, and other relevant information relating to inmates within a controlled environment facility. As part of the analysis, the system detects certain keywords and key interactions with the dataset in order to identify particular criminal proclivities of the inmate. Based on the identified proclivities, the system assigns threat scores to inmate that represents a relative likelihood that the inmate will carry out or be drawn to certain threats and/or criminal activities. This analysis provides a predictive tool for assessing an inmate's ability to rehabilitate. Based on the analysis, remedial measures can be taken in order to correct an inmate's trajectory within the controlled environment and increase the likelihood of successful rehabilitation, as well as to prevent potential criminal acts.

Type: Grant

Filed: September 11, 2018

Date of Patent: September 1, 2020

Assignee: Global Tel*Link Corporation

Inventor: Mitch Volkart
Voice command triggered speech enhancement

Patent number: 10755697

Abstract: Received data representing speech is stored, and a trigger detection block detects a presence of data representing a trigger phrase in the received data. In response, a first part of the stored data representing at least a part of the trigger phrase is supplied to an adaptive speech enhancement block, which is trained on the first part of the stored data to derive adapted parameters for the speech enhancement block. A second part of the stored data, overlapping with the first part of the stored data, is supplied to the adaptive speech enhancement block operating with said adapted parameters, to form enhanced stored data. A second trigger phrase detection block detects the presence of data representing the trigger phrase in the enhanced stored data. In response, enhanced speech data are output from the speech enhancement block for further processing, such as speech recognition.

Type: Grant

Filed: April 24, 2019

Date of Patent: August 25, 2020

Assignee: Cirrus Logic, Inc.

Inventors: Robert James Hatfield, Michael Page
System and method for neural network based speaker classification

Patent number: 10755718

Abstract: A method for classifying speakers includes: receiving, by a speaker recognition system including a processor and memory, input audio including speech from a speaker; extracting, by the speaker recognition system, a plurality of speech frames containing voiced speech from the input audio; computing, by the speaker recognition system, a plurality of features for each of the speech frames of the input audio; computing, by the speaker recognition system, a plurality of recognition scores for the plurality of features; computing, by the speaker recognition system, a speaker classification result in accordance with the recognition scores; and outputting, by the speaker recognition system, the speaker classification result.

Type: Grant

Filed: December 7, 2017

Date of Patent: August 25, 2020

Inventors: Zhenhao Ge, Ananth N. Iyer, Srinath Cheluvaraja, Ram Sundaram, Aravind Ganapathiraju
Noise mitigation for a voice interface device

Patent number: 10748552

Abstract: A method at an electronic device with one or more microphones and a speaker, the electronic device configured to be responsive to any of a plurality of affordances including a voice-based affordance, includes determining background noise of an environment associated with the electronic device, and before detecting the voice-based affordance: determining whether the background noise would interfere with recognition of the hotword in voice inputs detected by the electronic device, and if so, indicating to a user to use an affordance other than the voice-based affordance.

Type: Grant

Filed: March 27, 2019

Date of Patent: August 18, 2020

Assignee: GOOGLE LLC

Inventor: Kenneth Mixter
Multi-user personalization at a voice interface device

Patent number: 10748543

Abstract: A method at an electronic device with one or more microphones and a speaker includes receiving a first voice input; comparing the first voice input to one or more voice models; based on the comparing, determining whether the first voice input corresponds to any of a plurality of occupants, and according to the determination, authenticating an occupant and presenting a response, or restricting functionality of the electronic device.

Type: Grant

Filed: March 27, 2019

Date of Patent: August 18, 2020

Assignee: GOOGLE LLC

Inventors: Kenneth Mixter, Diego Melendo Casado, Bibo Xu
User interface for correcting recognition errors

Patent number: 10741181

Abstract: Speech recognition is performed on a received utterance to determine a plurality of candidate text representations of the utterance, including a primary text representation and one or more alternative text representations. Natural language processing is performed on the primary text representation to determine a plurality of candidate actionable intents, including a primary actionable intent and one or more alternative actionable intents. A result is determined based on the primary actionable intent. The result is provided to the user. A recognition correction trigger is detected. In response to detecting the recognition correction trigger, a set of alternative intent affordances and a set of alternative text affordances are concurrently displayed.

Type: Grant

Filed: May 14, 2019

Date of Patent: August 11, 2020

Assignee: Apple Inc.

Inventors: Ashish Garg, Harry J. Saddler, Shweta Grampurohit, Robert A. Walker, Rushin N. Shah, Matthew S. Seigel, Matthias Paulik
Identifying entities in electronic medical records

Patent number: 10740561

Abstract: Disclosed herein are methods, systems, and apparatus, including computer programs encoded on computer storage media, for entity prediction. One of the methods includes performing word segmentation on text to be predicted to obtain a plurality of words. For each word of the plurality of words, a determination is made whether the word has a pre-trained word vector. In response to determining that the word has a pre-trained word vector, the pre-trained word vector for the word is obtained. In response to determining that the word does not have a pre-trained word vector, a word vector for the word is determined based on a pre-trained stroke vector. The word vector and the pre-trained stroke vector are trained based on a text sample and a word vector model. An entity associated with the text is predicted by inputting word vectors of the plurality of words into an entity prediction model.

Type: Grant

Filed: October 31, 2019

Date of Patent: August 11, 2020

Assignee: Alibaba Group Holding Limited

Inventors: Shaosheng Cao, Jun Zhou
Preventing initiation of a voice recognition session

Patent number: 10733990

Abstract: A method, a system, and a computer program product for preventing initiation of a voice recognition session. The method includes monitoring at least one audio output channel for at least one audio trigger phrase that initiates a voice recognition session. The method further includes in response to detecting the at least one audio trigger phrase on the at least one audio output channel, setting a logic state of at least one output trigger detector of the at least one audio output channel to a first state. The method further includes gating a logic state of at least one input trigger detector of at least one audio input channel to the first state for a time period and preventing initiation of a voice recognition session by the at least one audio trigger phrase on the at least one audio input channel while the logic state is the first state.

Type: Grant

Filed: January 30, 2018

Date of Patent: August 4, 2020

Assignee: Motorola Mobility LLC

Inventors: Robert A. Zurek, Pratik M. Kamdar, Jincheng Wu, Joel Clark
Multi-directional dialog

Patent number: 10733982

Abstract: Systems and processes for providing multi-directional dialog are provided. An example method includes, at an electronic device with one or more processors, receiving a first natural-language input; determining a first intent based on the first natural-language input, identifying a first dialog flow based on the first intent, outputting a natural-language output associated with the first dialog flow, receiving a second natural-language input; determining whether the second natural-language input satisfies dialog criteria associated with the first dialog flow, and in accordance with a determination that the second natural-language input satisfies the dialog criteria, outputting a second natural-language output associated with the first dialog flow.

Type: Grant

Filed: April 5, 2018

Date of Patent: August 4, 2020

Assignee: Apple Inc.

Inventors: Nicholas A. Grupen, Matthew E. Austin, Monica S. Ephrati, Kenneth H. Leung, Sebrand F. Warren, Philip T. Williams, Matthew Henderson

prev … 8 9 10 11 12 13 14 15 16 … next