Patents Examined by Douglas Godbold

Dynamically constructing and configuring a conversational agent learning model

Patent number: 11886823

Abstract: An approach is described with respect to dynamically constructing and configuring a conversational agent learning model. Various aspects of the conversational agent learning model may be constructed and updated without continuous intervention of a domain administrator. A method pertaining to such approach may include retrieving a corpus of information. The corpus of information may include records from a set of repositories and external data, including data from social networks or applications. The method further may include configuring the conversational agent learning model based upon the retrieved corpus of information. The method further may include deploying the conversational agent learning model by facilitating interaction between the conversational agent learning model and a plurality of clients. The method further may include updating the conversational agent learning model to address any modification to the corpus of information.

Type: Grant

Filed: February 1, 2018

Date of Patent: January 30, 2024

Assignee: International Business Machines Corporation

Inventors: Giuseppe Ciano, Pietro Marella, Leonardo Modeo, Luigi Pichetti
Vocal command recognition

Patent number: 11887584

Abstract: A method to detect a vocal command, the method including: analyzing audio data received from a transducer configured to convert audio into an electric signal and analyzing the data using a first neural network. The method also includes detecting a keyword from the audio data using the first neural network on the edge device, the first neural network being trained to recognize the keyword. The method further includes activating a second neural network after the keyword is identified by the first neural network and analyzing the audio data using the second neural network, the second neural network being trained to recognize a set of vocal commands. The method to detect a vocal command may also include detecting the vocal command word using the second neural network.

Type: Grant

Filed: June 18, 2021

Date of Patent: January 30, 2024

Assignee: STMicroelectronics S.r.l.

Inventors: Nunziata Ivana Guarneri, Viviana D'Alto
Method and apparatus for detecting similarity between multimedia information, electronic device, and storage medium

Patent number: 11887619

Abstract: A multimedia information processing method includes: parsing multimedia information to separate an audio from the multimedia information; converting the audio to obtain a mel spectrogram corresponding to the audio; determining, according to the mel spectrogram corresponding to the audio, an audio feature vector corresponding to the audio; and determining, based on an audio feature vector corresponding to a source audio in source multimedia information and an audio feature vector corresponding to a target audio in target multimedia information, a similarity between the target multimedia information and the source multimedia information.

Type: Grant

Filed: October 10, 2022

Date of Patent: January 30, 2024

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventors: Yurong Yang, Xuyuan Xu, Guoping Gong, Yang Fang
Method and device for fusing voiceprint features, voice recognition method and system, and storage medium

Patent number: 11875799

Abstract: A method and device for fusing voiceprint features. The method includes: obtaining at least two voiceprint features of a voice sample of a target speaker (S3; S4); fusing the at least two voiceprint features on the basis of linear discriminant analysis (S5). The present method introduces a technique employing linear discriminant analysis to fuse various voiceprint features, so as to improve complementarities between the various voiceprint features and distinctions between the fused features, thereby increasing the recognition rate for target speakers and reducing the misrecognition rate for non-target speakers in voiceprint authentication scenarios, and providing personalized and improved user experience.

Type: Grant

Filed: July 23, 2019

Date of Patent: January 16, 2024

Assignee: SOUNDAI TECHNOLOGY CO., LTD.

Inventors: Xiaoliang Chen, Dahang Feng, Shaowei Su, Le Chang
Zero-shot cross-lingual transfer learning

Patent number: 11875131

Abstract: Providing a predictive model for a target language by determining an instance weight for a labeled source language textual unit according to a set of unlabeled target language textual units, scaling, by the one or more computer processors, an error between a predicted label for the source language textual unit and a ground-truth label for the source language textual unit according to the instance weight, updating, by the one or more computer processors, network parameters of a predictive neural network model for the target language according to the error, and providing, by the one or more computer processors, the predictive neural network model for the target language to a user.

Type: Grant

Filed: September 16, 2020

Date of Patent: January 16, 2024

Assignee: International Business Machines Corporation

Inventors: Zihui Li, Yunyao Li, Prithviraj Sen, Huaiyu Zhu
Low-resource entity resolution with transfer learning

Patent number: 11875253

Abstract: Methods, systems, and computer program products for low-resource entity resolution with transfer learning are provided herein. A computer-implemented method includes processing input data via a first entity resolution model, wherein the input data comprise labeled input data and unlabeled input data; identifying one or more portions of the unlabeled input data to be used in training a neural network entity resolution model, wherein said identifying comprises applying one or more active learning algorithms to the first entity resolution model; training, using (i) the one or more portions of the unlabeled input data and (ii) one or more deep learning techniques, the neural network entity resolution model; and performing one or more entity resolution tasks by applying the trained neural network entity resolution model to one or more datasets.

Type: Grant

Filed: June 17, 2019

Date of Patent: January 16, 2024

Assignee: International Business Machines Corporation

Inventors: Jungo Kasai, Kun Qian, Sairam Gurajada, Yunyao Li, Lucian Popa
Audio encoder, audio decoder, methods for encoding and decoding an audio signal, audio stream and a computer program

Patent number: 11869521

Abstract: An encoder for providing an audio stream on the basis of a transform-domain representation of an input audio signal includes a quantization error calculator configured to determine a multi-band quantization error over a plurality of frequency bands of the input audio signal for which separate band gain information is available. The encoder also includes an audio stream provider for providing the audio stream such that the audio stream includes information describing an audio content of the frequency bands and information describing the multi-band quantization error. A decoder for providing a decoded representation of an audio signal on the basis of an encoded audio stream representing spectral components of frequency bands of the audio signal includes a noise filler for introducing noise into spectral components of a plurality of frequency bands to which separate frequency band gain information is associated on the basis of a common multi-band noise intensity value.

Type: Grant

Filed: May 17, 2021

Date of Patent: January 9, 2024

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Nikolaus Rettelbach, Bernhard Grill, Guillaume Fuchs, Stefan Geyersberger, Markus Multrus, Harald Popp, Juergen Herre, Stefan Wabnik, Gerald Schuller, Jens Hirschfeld
Apparatus and method for automatic generation and update of knowledge graph from multi-modal sources

Patent number: 11869484

Abstract: The present invention provides an apparatus and method for automatic generation and update of a knowledge graph from multi-modal sources. The apparatus comprises a conversation parsing module configured for updating a dynamic information word set VD with labelled words generated from extracted from the multi-modal sources; updating a static information word set VS based on extracted schema of relations extracted from the multi-modal sources; and generating pairs of question and answer based on the dynamic information word set VD, the static information word set VS and the one or more sentence patterns; and a knowledge graph container configured for updating a knowledge graph based on the extracted entities of interest and schema of relations. Therefore, an efficient and cost-effective way for question decomposition, query chain construction and entity association from unstructured data is achieved.

Type: Grant

Filed: August 27, 2021

Date of Patent: January 9, 2024

Assignee: Hong Kong Applied Science and Technology Research Institute Company Limited

Inventors: Yunzhao Lu, Wai On Sam Lam, Man Choi Asa Chan
Adapting an utterance cut-off period based on parse prefix detection

Patent number: 11862162

Abstract: A processing system detects a period of non-voice activity and compares its duration to a cutoff period. The system adapts the cutoff period based on parsing previously-recognized speech to determine, according to a model, such as a machine-learned model, the probability that the speech recognized so far is a prefix to a longer complete utterance. The cutoff period is longer when a parse of previously recognized speech has a high probability of being a prefix of a longer utterance.

Type: Grant

Filed: March 18, 2022

Date of Patent: January 2, 2024

Assignee: SoundHound, Inc.

Inventors: Patricia Pozon Aguayo, Jennifer Hee Young Zhang, Jonah Probell
Systems and methods for creating dynamic features for correlation engines

Patent number: 11848010

Abstract: Systems for analyzing and categorizing audio content that has been transcribed into text are provided. The systems include at least one machine that has a central processing unit, random access memory, a correlation module, a feature abstraction module, and at least one database. The correlation module is configured to receive written transcripts (each of which has been generated from audio content) and derive a correlation between each written transcript and one or more attributes. The feature abstraction module is configured to receive instructions that identify specific words within the written transcripts; replace the specific words with surrogate words; and associate correlative meanings with each of the surrogate words. The database is configured to receive, record, and make accessible to the feature abstraction module a table of specific words, each of which is associated with corresponding surrogate words and correlative meanings associated with each of the surrogate words.

Type: Grant

Filed: November 4, 2021

Date of Patent: December 19, 2023

Inventors: Walter Bachtiger, Bruce Ramsay
Systems and methods for converting an input content item based on contexts

Patent number: 11847416

Abstract: Methods and systems for converting an input content item into an output content item to enhance comprehension of the message by an interlocutor, based on contexts. For example, the conversion may occur in any message service: when an interlocutor writes a message in English (or any other language), he or she might include a regional dialect (purposively or not), such as a piece of slang, that the other interlocutors may not understand, although they all generally write and understand English. In such circumstances, the regional dialect is identified and replaced with either a more globalized word or with another linguistic regionalism that is understandable for the intended interlocutor.

Type: Grant

Filed: December 1, 2020

Date of Patent: December 19, 2023

Assignee: Rovi Guides, Inc.

Inventors: Lakhan Tanaji Kadam, Srishti Sharma
Virtual assistant activation

Patent number: 11842734

Abstract: At an electronic device with a display, a microphone, and an input device: while the display is on, receiving user input via the input device, the user input meeting a predetermined condition; in accordance with receiving the user input meeting the predetermined condition, sampling audio input received via the microphone; determining whether the audio input comprises a spoken trigger; and in accordance with a determination that audio input comprises the spoken trigger, triggering a virtual assistant session.

Type: Grant

Filed: July 23, 2021

Date of Patent: December 12, 2023

Assignee: Apple Inc.

Inventors: Stephen O. Lemay, Brandon J. Newendorp, Jonathan R. Dascola
Computing services using embeddings of a transformer-based encoder

Patent number: 11842738

Abstract: Techniques are described and relate to providing computing services using embeddings of a transformer-based encoder. In an example, a computer system generates, by using a machine learning (ML) transformer, an embedding vector based at least in part on text. The computer system stores the embedding vector and an association between the embedding vector and the text in a data store. Further, the computer system determines that a task is to be performed based at least in part on natural language understanding (NLU) of the text. The computer system receives the embedding vector from the data store based at least in part on the association between the embedding vector and the text. The task is performed based at least in part on the embedding vector after being received from the data store.

Type: Grant

Filed: March 22, 2021

Date of Patent: December 12, 2023

Assignee: Amazon Technologies, Inc.

Inventors: Wenbo Yan, Ruiqi Luo, Prathap Ramachandra, Jingqian Zhao, Kyung Jae Lee, Liu Yang
Systems and methods for secure authentication based on machine learning techniques

Patent number: 11837239

Abstract: A system described herein may provide a technique for the use of machine learning techniques to perform authentication, such as biometrics-based user authentication. For example, user biometric information (e.g., facial features, fingerprints, voice, etc.) of a user may be used to train a machine learning model, in addition to a noise vector. A representation of the biometric information (e.g., an image file including a picture of the user's face, an encoded file with vectors or other representation of the user's fingerprint, a sound file including the user's voice, etc.) may be iteratively transformed until the transformed biometric information matches the noise vector, and the machine learning model may be trained based on the set of transformations that ultimately yield the noise vector, when given the biometric information.

Type: Grant

Filed: September 22, 2022

Date of Patent: December 5, 2023

Assignee: Verizon Patent and Licensing Inc.

Inventors: Sumanth S. Mallya, Corbin Pierce Moline, Saravanan Mallesan
Frame error concealment

Patent number: 11837240

Abstract: A frame error concealment method based on frames including transform coefficient vectors including the following steps: It tracks sign changes between corresponding transform coefficients of predetermined sub-vectors of consecutive good stationary frames. It accumulates the number of sign changes in corresponding sub-vectors of a predetermined number of consecutive good stationary frames. It reconstructs an erroneous frame with the latest good stationary frame, but with reversed signs of transform coefficients in sub-vectors having an accumulated number of sign changes that exceeds a predetermined threshold.

Type: Grant

Filed: January 7, 2022

Date of Patent: December 5, 2023

Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)

Inventors: Sebastian Näslund, Volodya Grancharov, Jonas Svedberg
Method, system, and computer program for artificial intelligence answer

Patent number: 11822890

Abstract: Provided is an artificial intelligence (AI) answering system including a user question receiver configured to receive a user question from a user terminal; a first question extender configured to generate a question template by analyzing the user question and determine whether the user question and the generated question template match; a second question extender configured to generate a similar question template by using a natural language processing and a deep learning model when the user question and the generated question template do not match; a training data builder configured to generate training data for training the second question extender by using an neural machine translation (NMT) engine; and a question answering unit configured to transmit a user question result derived through the first question extender or the second question extender to the user terminal.

Type: Grant

Filed: June 6, 2022

Date of Patent: November 21, 2023

Assignee: 42 Maru Inc.

Inventor: Dong Hwan Kim
Correcting speech misrecognition of spoken utterances

Patent number: 11823664

Abstract: Implementations can receive audio data corresponding to a spoken utterance of a user, process the audio data to generate a plurality of speech hypotheses, determine an action to be performed by an automated assistant based on the speech hypotheses, and cause the computing device to render an indication of the action. In response to the computing device rendering the indication, implementations can receive additional audio data corresponding to an additional spoken utterance of the user, process the additional audio data to determine that a portion of the spoken utterance is similar to an additional portion of the additional spoken utterance, supplant the action with an alternate action, and cause the automated assistant to initiate performance of the alternate action. Some implementations can determine whether to render the indication of the action based on a confidence level associated with the action.

Type: Grant

Filed: November 8, 2022

Date of Patent: November 21, 2023

Assignee: GOOGLE LLC

Inventors: Matthew Sharifi, Victor Carbune
Management of a focused information sharing dialogue based on discourse trees

Patent number: 11809825

Abstract: Disclosed techniques relate to managing a dialogue between a user device and an autonomous agent. For instance, a computing device creates a discourse tree from a body of text that includes fragments. The discourse tree includes a root node, nonterminal nodes, and terminal nodes. Each nonterminal node represents a rhetorical relationship between two of the fragments and each terminal node is associated with one of the fragments. The computing device determines a main topic of the body of text from the discourse tree. The computing device provides the main topic to the user device, and in response, receives a user utterance. The computing device determines an intent from the user utterance and navigates the discourse tree to locate relevant information consistent with the intent.

Type: Grant

Filed: April 20, 2021

Date of Patent: November 7, 2023

Assignee: Oracle International Corporation

Inventor: Boris Galitsky
Natural language text generation using semantic objects

Patent number: 11809832

Abstract: Techniques for generating natural language text with a natural language generation (NLG) system using a plurality of semantic objects including a first semantic object. The techniques include: obtaining a first specification of the first semantic object, the first specification specifying a first set of one or more data variables, first attributes, a first vocabulary, and a first document structure configuration; obtaining, from at least one data store, first data related to the first set of data variables; determining values of at least some of the first set of data variables using the first data; generating the natural language text including first natural language text, using the first specification, the values of at least some of the first set of data variables; and outputting the generated natural language text.

Type: Grant

Filed: August 18, 2022

Date of Patent: November 7, 2023

Assignee: YSEOP SA

Inventors: Raphaël François André Salmon, Alain Kaeser, Bernard Paul Rémy Légaut
Method, apparatus and device for voiceprint recognition of original speech, and storage medium

Patent number: 11798563

Abstract: A method for voiceprint recognition of an original speech is used to reduce information losses and system complexity of a model for data recognition of a speaker's original speech. The method includes: obtaining original speech data, and segmenting the original speech data based on a preset time length to obtain segmented speech data; performing tail-biting convolution processing and discrete Fourier transform on the segmented speech data through a preset convolution filter bank to obtain voiceprint feature data; pooling the voiceprint feature data through a preset deep neural network to obtain a target voiceprint feature; performing embedded vector transformation on the target voiceprint feature to obtain corresponding voiceprint feature vectors; and performing calculation on the voiceprint feature vectors through a preset loss function to obtain target voiceprint data, where the loss function includes a cosine similarity matrix loss function and a minimum mean square error matrix loss function.

Type: Grant

Filed: August 26, 2020

Date of Patent: October 24, 2023

Assignee: PING AN TECHNOLOGY (SHENZHEN) CO., LTD.

Inventors: Yuechao Guo, Yixuan Qiao, Yijun Tang, Jun Wang, Peng Gao, Guotong Xie

prev 1 2 3 4 5 6 7 … next