Patents Examined by Matthew H Baker

Mixed-initiative dialog automation with goal orientation

Patent number: 10853579

Abstract: In one aspect, method useful for goal-oriented dialog automation comprising includes the step of receiving an input message. The method includes the step of implementing an entity tagging operation on the input message. The method includes the step of tagging the message context of the input message to generate a tagged message context. The method includes the step of implementing semantic frame extraction from the tagged message context. The method includes the step of implementing an entity interpretation on the extracted frame. The method includes the step of accessing a database to determine a business schedule and a client profile. The business schedule and the client profile are related to the input message. The method includes the step of implementing a retrieval engine. The retrieval engine obtains one or more response templates. The method includes the step of generating a ranked list of candidate templates from the output of the retrieval engine.

Type: Grant

Filed: October 25, 2018

Date of Patent: December 1, 2020

Inventors: Srivatsan Laxman, Devang Savita Ram Mohan, Supriya Rao
Dialogue system and a dialogue method

Patent number: 10847141

Abstract: A dialogue system comprising: an input for receiving input data relating to a speech or text signal originating from a user; an output for outputting speech or text information specified by a dialogue act; and a processor configured to: update a belief state, the belief state comprising information corresponding to one or more dialogue options, each dialogue option comprising a slot and a corresponding slot value, based on the input signal; determine a dialogue act, wherein a dialogue act is determined by applying one or more rules to world state information, the world state comprising information relating to the dialogue, wherein rules are applied in two or more ordered stages for each dialogue turn, wherein one of the stages is a first update stage, comprising applying one or more further rules controlling updating of the world state information based on the belief state information, and another of the stages is an act selection stage, comprising determining the dialogue act by applying the one or more rule

Type: Grant

Filed: November 8, 2019

Date of Patent: November 24, 2020

Assignee: PolyAI Limited

Inventors: Matthew Steedman Henderson, Tsung-Hsien Wen, Pei-Hao Su, Nikola Mrksic, Ivan Vulic, Inigo Casanueva-Perez
Applied artificial intelligence for natural language processing automotive reporting system

Patent number: 10839618

Abstract: The present disclosure provides a computer system, method, and computer-readable medium for a computer processor to determine whether to report a consumer message regarding a vehicle to a regulatory agency. The computer system receives a message from a consumer. The computer system applies the message to a machine learning classifier trained to determine whether a category of vehicle system is described in the message, the machine learning classifier trained using category definitions and corresponding parts and symptoms labeled with the category definitions. The computer system determines whether the message includes a complaint based on an ontology defining a vehicle problem lexicon, a car part lexicon, and lexical patterns. The computer system extracts MVS terms from the message. The computer system determines to report the message if the message includes a complaint related to at least one category of vehicle system and includes a set of the MVS terms.

Type: Grant

Filed: July 12, 2018

Date of Patent: November 17, 2020

Assignee: HONDA MOTOR CO., LTD.

Inventors: Ravi Advani, Mithun Balakrishna, Tatiana Erekhinskaya
Extracting signals from paired recordings

Patent number: 10839826

Abstract: A system, method and computer product for extracting an activity from recordings. The method comprises searching for signals representing plural versions of a track, determining feature representations of the plural versions of the track identified in the searching, aligning the feature representations determined in the determining, and extracting a time varying activity signal from the feature representations aligned in the aligning.

Type: Grant

Filed: May 9, 2018

Date of Patent: November 17, 2020

Assignee: SPOTIFY AB

Inventors: Eric J. Humphrey, Andreas Jansson, Nicola Montecchio
Electronic document classification system optimized for combining a plurality of contemporaneously scanned documents

Patent number: 10832049

Abstract: A method, system and computer-usable medium for classifying a source document using sub-documents identified in the source document. The method, system, and computer-usable medium are used to access the source document from electronic memory. The source document is electronically searched to detect markers indicative of whether the source document includes one or more sub-documents. Incongruities in the source document are located using the detected markers and the source document is split into sub-documents at the located incongruities. Each of the sub-documents is classified. The sub-documents are joined as a re-assembled source document with classifications including classifications for one or more of the sub-documents.

Type: Grant

Filed: May 31, 2018

Date of Patent: November 10, 2020

Assignee: Intematlonal Business Machlnes Corporation

Inventors: Andrew R. Freed, Corville O. Allen
Wake command nullification for digital assistance and voice recognition technologies

Patent number: 10777195

Abstract: A computing device includes a communication interface configured to interface and communicate with a communication system, an audio interface configured to interface and communicate with a user, a memory that stores operational instructions, and processing circuitry operably coupled to the communication interface, the audio interface, and to the memory that is configured to execute the operational instructions to perform various operations. The computing device monitors audio content, maintains a running buffer of most recent audio content, and detects a wake word command of the user. When detected, the computing device processes the most recent audio content including the wake word command of the user to determine validity/invalidity whether the wake word command of the user is invalid based on the most recent audio content. When invalid, the computing device rejects the wake word command of the user and continues to monitor the audio content and maintain the running buffer.

Type: Grant

Filed: May 31, 2018

Date of Patent: September 15, 2020

Assignee: International Business Machines Corporation

Inventors: Jeremy R. Fox, Andrew R. Jones, Gregory J. Boss, John E. Moore, Jr.
Real-time speaker-dependent neural vocoder

Patent number: 10770063

Abstract: Techniques for a recursive deep-learning approach for performing speech synthesis using a repeatable structure that splits an input tensor into a left half and right half similar to the operation of the Fast Fourier Transform, performs a 1-D convolution on each respective half, performs a summation and then applies a post-processing function. The repeatable structure may be utilized in a series configuration to operate as a vocoder or perform other speech processing functions.

Type: Grant

Filed: August 22, 2018

Date of Patent: September 8, 2020

Assignees: Adobe Inc., The Trustees of Princeton University

Inventors: Zeyu Jin, Gautham J. Mysore, Jingwan Lu, Adam Finkelstein
Split-domain speech signal enhancement

Patent number: 10741192

Abstract: A method and an apparatus for estimating speech signal in split-domain is disclosed. The method includes performing LP analysis on a noisy speech signal to generate a first plurality of LPC and a first residual signal. The method also includes estimating speech LPC spectrum to generate cleaned LPC. The method further includes estimating speech residual spectrum to generate cleaned residual signal. The method also includes synthesizing output signals based on the cleaned LPC and the cleaned residual signal.

Type: Grant

Filed: May 7, 2018

Date of Patent: August 11, 2020

Assignee: Qualcomm Incorporated

Inventors: Vivek Rajendran, Duminda Dewasurendra, Daniel Jared Sinder
Kernel-based verbal phrase splitting devices and methods

Patent number: 10741171

Abstract: A device capable of splitting user input into phrases is presented. The disclosed device leverages multiple phrase splitting models to generate one or more possible split locations. The possible split locations can be derived based on leveraging multiple phrase splitting models. Each model contributes its suggested split locations to the set of possible split locations according to an implementation of a phrase splitting kernel algorithm that weights each model's suggestions.

Type: Grant

Filed: July 8, 2019

Date of Patent: August 11, 2020

Assignee: NANTMOBILE, LLC

Inventors: Demitrios L. Master, Farzad Ehsani
Dictionary editing system integrated with text mining

Patent number: 10740381

Abstract: Embodiments are directed to a system, computer program product, and method for dynamic facet dictionary management. As one or more annotations are applied to a document collection, electronic text and associated facets are identified. Additional facets and facet values are identified and selectively applied to a knowledge base. A dictionary comprised of facets and associated facet values is constructed from the selective application. Application of the dictionary to the knowledge base identifies and returns a targeted document collection. Accordingly, facet mining and dictionary construction are dynamically applied to the knowledge base.

Type: Grant

Filed: July 18, 2018

Date of Patent: August 11, 2020

Assignee: International Business Machines Corporation

Inventors: Susumu Fukuda, Kenta Watanabe, Shunsuke Ishikawa, Takashi Fukuda
System and method for rapid customization of speech recognition models

Patent number: 10726833

Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for generating domain-specific speech recognition models for a domain of interest by combining and tuning existing speech recognition models when a speech recognizer does not have access to a speech recognition model for that domain of interest and when available domain-specific data is below a minimum desired threshold to create a new domain-specific speech recognition model. A system configured to practice the method identifies a speech recognition domain and combines a set of speech recognition models, each speech recognition model of the set of speech recognition models being from a respective speech recognition domain. The system receives an amount of data specific to the speech recognition domain, wherein the amount of data is less than a minimum threshold to create a new domain-specific model, and tunes the combined speech recognition model for the speech recognition domain based on the data.

Type: Grant

Filed: May 21, 2018

Date of Patent: July 28, 2020

Assignee: NUANCE COMMUNICATIONS, INC.

Inventors: Srinivas Bangalore, Robert Bell, Diamantino Antonio Caseiro, Mazin Gilbert, Patrick Haffner
Voice commands across devices

Patent number: 10714083

Abstract: Aspects of the subject technology relate to a method for using a voice command for multiple computing devices. First voice input data is received from a first computing device associated with a user account, where the first voice input data comprises a first voice command captured at the first computing device. Second voice input data is received from a second computing device associated with the user account where the second voice input data comprises a second voice command captured at the second computing device. An intended voice command is determined based on the obtained first and second voice input data. Based on the intended voice command, a first target computing device is determined. First instructions associated with the intended voice command are provided to the first target computing device for execution.

Type: Grant

Filed: May 15, 2017

Date of Patent: July 14, 2020

Assignee: Google LLC

Inventors: Jennifer Shien-Ming Chen, Alexander Friedrich Kuscher, Mitsuru Oshima
Apparatus and method for generating sentence

Patent number: 10713439

Abstract: A sentence generating apparatus includes an encoder configured to generate a first sentence embedding vector by applying trained result data to a first paraphrased sentence of an input sentence, an extractor configured to extract verification sentences in a preset range from the generated first sentence embedding vector, and a determiner configured to determine a similarity of the first paraphrased sentence to the input sentence based on comparing the verification sentences to the input sentence.

Type: Grant

Filed: March 28, 2017

Date of Patent: July 14, 2020

Assignee: Samsung Electronics Co., Ltd.

Inventors: Hoshik Lee, Hwidong Na
Automated computer text classification and routing using artificial intelligence transfer learning

Patent number: 10678830

Abstract: Methods and apparatuses are described for automated computer text classification and routing using artificial intelligence transfer learning. A server trains a word embedding model using one-hot vectors of word pairs from a filtered first corpus of unstructured computer text and a filtered second corpus of unstructured computer text, using an artificial intelligence neural network. The server trains a long short-term memory model using vector matrices that correspond to sentences in the filtered second corpus of unstructured computer text, and labels. The server receives a message, generates a matrix for each sentence in the message by applying the trained word embedding model, generates one or more labels, and a probability for each label, for each sentence in the message by applying the trained long short-term memory model, and routes the message to a second client computing device based upon an assigned label.

Type: Grant

Filed: May 31, 2018

Date of Patent: June 9, 2020

Assignee: FMR LLC

Inventors: Pu Li, Chuanlu Yu, Hua Hao, Yu Zhang, Dong Han
Electronic device and method of operating voice recognition function

Patent number: 10679628

Abstract: Provided is an electronic device that includes a first processor for receiving an audio signal, performing first voice recognition on the audio signal, and transferring a driving signal to a second processor based on a result of the first voice recognition. The second processor performs second voice recognition based on a voice signal by the first voice recognition or the audio signal, in response to the driving signal.

Type: Grant

Filed: February 16, 2016

Date of Patent: June 9, 2020

Assignee: Samsung Electronics Co., Ltd

Inventors: Taejin Lee, Subhojit Chakladar, Sanghoon Lee, Kyungtae Kim, Yuna Kim, Junhui Kim, Eunhye Shin, Jaegeun Lee, Hyunwoong Lim
Systems and methods for manipulating electronic content based on speech recognition

Patent number: 10657985

Abstract: Systems and methods are disclosed for displaying electronic multimedia content to a user. One computer-implemented method for manipulating electronic multimedia content includes generating, using a processor, a speech model and at least one speaker model of an individual speaker. The method further includes receiving electronic media content over a network; extracting an audio track from the electronic media content; and detecting speech segments within the electronic media content based on the speech model. The method further includes detecting a speaker segment within the electronic media content and calculating a probability of the detected speaker segment involving the individual speaker based on the at least one speaker model.

Type: Grant

Filed: June 21, 2018

Date of Patent: May 19, 2020

Assignee: Oath Inc.

Inventors: Peter F. Kocks, Guoning Hu, Ping-Hao Wu
Language recognition method, apparatus and device and computer storage medium

Patent number: 10650801

Abstract: Embodiments of the present disclosure provide a language recognition method and apparatus, a device and a computer storage medium. In an aspect, in the embodiments of the present disclosure, after the Nth speech segment included by the speech signal is received, language recognition is performed according to already-received previous N speech segments to obtain the score of each language in at least one language, N being 2, 3, 4 . . . ; therefore, if there exists a langue whose score reaches the designated threshold, the language whose score reaches the designated threshold is considered as the language matched with the speech signal. Therefore, the technical solutions according to embodiments of the present disclosure solve the problem in the prior art that the efficiency of language recognition is lower so that the language recognition cannot be applied to an application scenario in which the recognition result needs to be obtained quickly.

Type: Grant

Filed: June 20, 2016

Date of Patent: May 12, 2020

Assignee: Baidu Online Network Technology (Beijing) Co., Ltd.

Inventors: Xiao Li, Chao Li, Yong Guan
Adversarial teacher-student learning for unsupervised domain adaptation

Patent number: 10643602

Abstract: Methods, systems, and computer programs are presented for training, with adversarial constraints, a student model for speech recognition based on a teacher model. One method includes operations for training a teacher model based on teacher speech data, initializing a student model with parameters obtained from the teacher model, and training the student model with adversarial teacher-student learning based on the teacher speech data and student speech data. Training the student model with adversarial teacher-student learning further includes minimizing a teacher-student loss that measures a divergence of outputs between the teacher model and the student model; minimizing a classifier condition loss with respect to parameters of a condition classifier; and maximizing the classifier condition loss with respect to parameters of a feature extractor. The classifier condition loss measures errors caused by acoustic condition classification. Further, speech is recognized with the trained student model.

Type: Grant

Filed: March 16, 2018

Date of Patent: May 5, 2020

Assignee: Microsoft Technology Licensing, LLC

Inventors: Jinyu Li, Zhong Meng, Yifan Gong
Multi-threaded text affinity analyzer for text and sentiment analytics

Patent number: 10621679

Abstract: A method and system are disclosed for analyzing text affinity among a plurality of social media communications, comprising dividing a first social media communication into first plurality of social media communication threads; dividing a second social media communication into a second plurality of social media communication threads; performing a text affinity analysis operation between respective threads of the first plurality of social media communication threads and the second plurality of social media communication threads; and, determining a level of intervention to perform based upon the text affinity analysis operation.

Type: Grant

Filed: July 18, 2016

Date of Patent: April 14, 2020

Assignee: Dell Products L.P.

Inventor: Prabir Majumder
Methods for voice enhancement

Patent number: 10600432

Abstract: A system configured to perform power normalization for voice enhancement. The system may identify active intervals corresponding to voice activity and may selectively amplify the active intervals in order to generate output audio data at a near uniform loudness. The system may determine a variable gain for each of the active intervals based on a desired output loudness and a flatness value, which indicates how much a signal envelope is to be modified. For example, a low flatness value corresponds to no modification, with peak active interval values corresponding to the desired output loudness and lower active intervals being lower than the desired output loudness. In contrast, a high flatness value corresponds to extensive modification, with peak active interval values and lower active interval values both corresponding to the desired output loudness. Thus, individual words may share the same peak power level.

Type: Grant

Filed: March 28, 2017

Date of Patent: March 24, 2020

Assignee: Amazon Technologies, Inc.

Inventors: Wai Chung Chu, Carlo Murgia, Hyeong Cheol Kim

prev 1 2 3 4 5 6 7 next