Patents Examined by Bhavesh M. Mehta

Audio processor and method for generating a frequency enhanced audio signal using pulse processing

Patent number: 11776554

Abstract: An audio processor for generating a frequency enhanced audio signal from a source audio signal has: an envelope determiner for determining a temporal envelope of at least a portion of the source audio signal; an analyzer for analyzing the temporal envelope to determine temporal values of certain features of the temporal envelope; a signal synthesizer for generating a synthesis signal, the generating having placing pulses in relation to the determined temporal values, wherein the pulses are weighted using weights derived from amplitudes of the temporal envelope related to the temporal values, where the pulses are placed; and a combiner for combining at least a band of the synthesis signal that is not included in the source audio signal and the source audio signal to obtain the frequency enhanced audio signal.

Type: Grant

Filed: May 27, 2021

Date of Patent: October 3, 2023

Assignee: FRAUNHOFER-GESELLSCHAFT ZUR FÖRDERUNG DER ANGEWANDTEN FORSCHUNG E.V.

Inventors: Sascha Disch, Michael Sturm
Signal processing

Patent number: 11776538

Abstract: A key word detection apparatus and a method for low-power voice-activated devices are presented. A first signal processing module operates with a first transducer to receive an incoming signal and generates a first sample. A second signal processing module operates with a second transducer which receives an incoming signal and generates a second sample. In summary, a signal processing system, in particular a key word detection system, has a first low power module that wakes up a second higher power module. The second module uses signals from the first module in order to improve accuracy of key word detection or other signal processing tasks.

Type: Grant

Filed: April 1, 2019

Date of Patent: October 3, 2023

Assignee: Dialog Semiconductor B.V.

Inventors: Niels Schutten, Wessel Harm Lubberhuizen
Adversarial bootstrapping for multi-turn dialogue model training

Patent number: 11775770

Abstract: Systems described herein may use machine classifiers to perform a variety of natural language understanding tasks including, but not limited to multi-turn dialogue generation. Machine classifiers in accordance with aspects of the disclosure may model multi-turn dialogue as a one-to-many prediction task. The machine classifier may be trained using adversarial bootstrapping between a generator and a discriminator with multi-turn capabilities. The machine classifiers may be trained in both auto-regressive and traditional teacher-forcing modes, with the maximum likelihood loss of the auto-regressive outputs being weighted by the score from a metric-based discriminator model. The discriminators input may include a mixture of ground truth labels, the teacher-forcing outputs of the generator, and/or negative examples from the dataset. This mixture of input may allow for richer feedback on the autoregressive outputs of the generator.

Type: Grant

Filed: May 21, 2020

Date of Patent: October 3, 2023

Assignee: Capital One Services, LLC

Inventors: Oluwatobi Olabiyi, Erik T. Mueller
Context-aware hardware-based voice activity detection

Patent number: 11776562

Abstract: Certain aspects of the present disclosure provide a method for performing voice activity detection, including: receiving audio data from an audio source of an electronic device; generating a plurality of model input features using a hardware-based feature generator based on the received audio data; providing the plurality of model input features to a hardware-based voice activity detection model; receiving an output value from the hardware-based voice activity detection model; and determining a presence of voice activity in the audio data based on the output value.

Type: Grant

Filed: May 29, 2020

Date of Patent: October 3, 2023

Assignee: QUALCOMM Incorporated

Inventors: Ren Li, Xiaofei Chen, Murray Jarvis
Communication issue detection using evaluation of multiple machine learning models

Patent number: 11769520

Abstract: Techniques are provided for evaluating multiple machine learning models to identify issues with a communication. One method comprises applying an audio signal associated with a communication to at least two of: (i) a trigger word analysis module that evaluates contextual information to determine if a trigger word is detected in the audio signal; (ii) an audio activity pattern analysis module that determines if a silence pattern anomaly is detected; and (iii) a communication application analysis module that evaluates features provided by a communication application relative to applicable thresholds; and combining results of the at least two of the trigger word analysis module, the audio activity pattern analysis module and the communication application analysis module to identify a communication issue. The combining may evaluate an accuracy of the trigger word analysis module, the audio activity pattern analysis module and/or the communication application analysis module to combine the results.

Type: Grant

Filed: August 17, 2020

Date of Patent: September 26, 2023

Assignee: EMC IP Holding Company LLC

Inventors: Idan Richman Goshen, Shiri Gaber
Performing utterance detection using convolution

Patent number: 11769491

Abstract: A system configured to perform utterance detection using data processing techniques that are similar to those used for object detection is provided. For example, the system may treat utterances within audio data as analogous to an object represented within an image and employ techniques to separate and identify individual utterances. The system may include one or more trained models that are trained to perform utterance detection. For example, the system may include a first module to process input audio data and identify whether speech is represented in the input audio data, a second module to apply convolution filters, and a third module configured to determine a boundary identifying a beginning and ending of a portion of the input audio data along with an utterance score indicating how closely the portion of the input audio data represents an utterance.

Type: Grant

Filed: September 29, 2020

Date of Patent: September 26, 2023

Assignee: Amazon Technologies, Inc.

Inventors: Abhishek Bafna, Haithem Albadawi
Semantic consistency of explanations in explainable artificial intelligence applications

Patent number: 11769080

Abstract: A computer-implemented method in accordance with one embodiment includes, in response to a submission of an input dataset to an artificially intelligent application, receiving an explanation from each module of the application. The modules are configured within the application in a serial sequence in which each module, upon receiving the input dataset and any input generated by an immediately preceding module of the serial sequence, generates output that is forwarded as input to a next module, if any, in the sequence. A determination is made that at least two of the received explanations are semantically inconsistent.

Type: Grant

Filed: July 14, 2022

Date of Patent: September 26, 2023

Assignee: Kyndryl, Inc.

Inventors: Sreekrishnan Venkateswaran, Debasisha Padhi, Shubhi Asthana, Anuradha Bhamidipaty, Ashish Kundu
User interface disambiguation

Patent number: 11769015

Abstract: Systems, methods, and computer programming products for alleviating ambiguity amongst the terms and language displayed by the user interface of software products and services. The disclosed solutions catalog terms displayed by the UI of software and services and identify where overlapping terms with the same or substantially similar term names are presented by the UI but have different meanings than the software most familiar to the user. Natural language processing is leveraged to derive meanings of software terms using the context of the surrounding words and text elements within the UI, as well as product documentation, error messages, sentiment and other textual clues. Ambiguity among overlapping terms is alleviated by modifying the UI, highlighting differences in term definitions from the software or services a user is most familiar with using, and updating the UI in a manner that differentiates the overlapping terms displayed by accessed products or services.

Type: Grant

Filed: April 1, 2021

Date of Patent: September 26, 2023

Assignee: International Business Machines Corporation

Inventors: Amy Travis, Laura Janet Rodriguez, Sara Beth Weber, Brittany Bogle, Smriti Talwar, Brent Alan Miller
Voice conversation analysis method and apparatus using artificial intelligence

Patent number: 11769492

Abstract: The present invention relates to a voice conversation analysis apparatus and a method therefor and, more specifically, to: a voice conversation analysis apparatus categorizing voices generated during a voice conversation so as to predict required functions and further analyzing the voices so as to provide proper functions; and a method therefor. In addition, disclosed are: an artificial intelligence (AI) system for simulating the functions of recognition, decision-making, and the like of the human brain by using a machine learning algorithm; and an application thereof.

Type: Grant

Filed: March 26, 2019

Date of Patent: September 26, 2023

Assignee: Samsung Electronics Co., Ltd.

Inventors: Changhan Kim, Bowon Kim, Jinsuk Lee, Hyeontaek Lim, Yangwook Kim, Guiwon Seo, Jonghwa Lee
Cascade pooling for natural language processing

Patent number: 11763094

Abstract: Natural language processing systems and methods are disclosed herein. In some embodiments, digital document information comprising text is received. The digital document information may be processed through word and character encoding operations to generate word and character vectors while retaining document location information for the words and characters. The data may be then be processed by a series of convolution and maximum pooling operations to obtain maximum valued elements from the data. The document location information as well as the maximum values element data may be further processed for semantic classification of the data using a semantic classifier and bounding box regression.

Type: Grant

Filed: May 13, 2021

Date of Patent: September 19, 2023

Assignee: SAP SE

Inventor: Christian Reisswig
Hot-word free pre-emption of automated assistant response presentation

Patent number: 11756533

Abstract: The presentation of an automated assistant response may be selectively pre-empted in response to a hot-word free utterance that is received during the presentation and that is determined to be likely directed to the automated assistant. The determination that the utterance is likely directed to the automated assistant may be performed, for example, using an utterance classification operation that is performed on audio data received during presentation of the response, and based upon such a determination, the response may be pre-empted with another response associated with the later-received utterance. In addition, the duration that is used to determine when a session should be terminated at the conclusion of a conversation between a user and an automated assistant may be dynamically controlled based upon when the presentation of a response has completed.

Type: Grant

Filed: May 15, 2020

Date of Patent: September 12, 2023

Assignee: GOOGLE LLC

Inventors: Pu-sen Chao, Alex Fandrianto
Sound signal generation method, generative model training method, sound signal generation system, and recording medium

Patent number: 11756558

Abstract: A computer-implemented sound signal generation method includes: obtaining a first sound source spectrum of a sound signal to be generated; obtaining a first spectral envelope of the sound signal; and estimating fragment data representative of samples of the sound signal based on the obtained first sound source spectrum and the obtained first spectral envelope.

Type: Grant

Filed: August 18, 2021

Date of Patent: September 12, 2023

Assignee: YAMAHA CORPORATION

Inventors: Jordi Bonada, Merlijn Blaauw, Ryunosuke Daido
Auto generation of conversational artifacts from specifications

Patent number: 11748559

Abstract: A conversational interface generation method, system, and computer program product that includes determining a conversational artifact for a computer program from a specification of the computer program and generating a conversational interface for the computer program based on the conversational artifact for the computer program included in the specification.

Type: Grant

Filed: March 24, 2021

Date of Patent: September 5, 2023

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Yara Rizk, Vatche Isahagian, Yasaman Khazaeni, Scott Boag, Falk Pollok
System and methods for training task-oriented dialogue (TOD) language models

Patent number: 11749264

Abstract: Embodiments described herein provide methods and systems for training task-oriented dialogue (TOD) language models. In some embodiments, a TOD language model may receive a TOD dataset including a plurality of dialogues and a model input sequence may be generated from the dialogues using a first token prefixed to each user utterance and a second token prefixed to each system response of the dialogues. In some embodiments, the first token or the second token may be randomly replaced with a mask token to generate a masked training sequence and a masked language modeling (MLM) loss may be computed using the masked training sequence. In some embodiments, the TOD language model may be updated based on the MLM loss.

Type: Grant

Filed: November 3, 2020

Date of Patent: September 5, 2023

Assignee: Salesforce, Inc.

Inventors: Chien-Sheng Wu, Chu Hong Hoi, Richard Socher, Caiming Xiong
Electronic apparatus and control method thereof

Patent number: 11748594

Abstract: An electronic apparatus, including a memory configured to store a first artificial intelligence model; and a processor connected to the memory and configured to: based on receiving an input audio signal, obtain an input frequency spectrum image representing a frequency spectrum of the input audio signal, input the input frequency spectrum image to the first artificial intelligence model, obtain an output frequency spectrum image from the first artificial intelligence model, obtain an output audio signal based on the output frequency spectrum image, wherein the first artificial intelligence model is trained based on a target learning image, and wherein the target learning image represents a target frequency spectrum of a specific style, and is obtained from a second artificial intelligence model based on a random value.

Type: Grant

Filed: September 29, 2020

Date of Patent: September 5, 2023

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Anant Baijal, Jeongrok Jang
System and method for query authorization and response generation using machine learning

Patent number: 11748569

Abstract: Systems, methods, and computer-readable storage media for responding to a query using a neural network and natural language processing. If necessary, the system can request disambiguation, then parse the query using a trained machine-learning classifier, resulting in at least one of an identified subject or an identified domain of the text query. The system can determine if the user is authorized to retrieve answers to the query and, if so, retrieve factual data associated with the query. The system can then retrieve a response template, and fill in the template with the retrieved facts. The system can then determine, by executing a machine comprehension model on the filled response template, a probable readability token, portion of text, of at least a portion of the filled response template and, upon identifying that the probable readability is above a threshold, reply to the text query with the at least a portion of the filled response template.

Type: Grant

Filed: December 20, 2022

Date of Patent: September 5, 2023

Assignee: ADP, INC.

Inventors: Guilherme Gomes, Bruno Apel, Jarismar Silva, Vincent Kellers, Roberto Dias, Roberto Masiero, Roberto Silveira
Machine learning-based real-time guest rider identification

Patent number: 11741400

Abstract: Techniques for automatically detecting when a ride requester has requested a ride-share ride on behalf of a guest rider using some or all of the communications between the driver and ride requester are described herein. For example, a server can obtain chat logs between a ride requester and a driver and process the chat logs to identify whether the ride requester has requested a ride on behalf of a guest rider. In particular, the server can train an artificial intelligence model (e.g., a machine learning model) to predict potential guest rider behavior. Once trained, the server can obtain chat logs comprising chat messages sent between a driver and a ride requester, and apply a representation of the chat logs as an input to the trained artificial intelligence model to determine whether guest rider behavior is detected.

Type: Grant

Filed: December 18, 2020

Date of Patent: August 29, 2023

Assignee: Beijing DiDi Infinity Technology and Development Co., Ltd.

Inventors: Conghui Fu, Zihan Yi, Zetian Ni, Xin Chen
Systems and methods for automatic speech recognition based on graphics processing units

Patent number: 11741967

Abstract: An automatic speech recognition system and a method thereof are provided. The system includes an encoder and a decoder. The encoder comprises a plurality of encoder layers. At least one encoder layer includes a plurality of encoder sublayers fused into one or more encoder kernels. The system further comprises a first pair of ping-pong buffers communicating with the one or more encoder kernels. The decoder comprises a plurality of decoder layers. At least one decoder layer includes a plurality of decoder sublayers fused into one or more decoder kernels. The decoder receives a decoder output related to the encoder output and generates a decoder output. The encoder sends the decoder output to a beam search kernel.

Type: Grant

Filed: January 4, 2021

Date of Patent: August 29, 2023

Assignee: KWAI INC.

Inventors: Yongxiong Ren, Heng Liu, Yang Liu, Lingzhi Liu, Jie Li, Yuanyuan Zhao, Xiaorui Wang
Automatically modifying responses from generative models using artificial intelligence techniques

Patent number: 11741296

Abstract: Methods, systems, and computer program products for automatically modifying responses from generative models using artificial intelligence techniques are provided herein. A computer-implemented method includes obtaining data pertaining to at least one conversation involving at least one automated conversation exchange software program and at least one user; identifying, among words proposed by the at least one automated conversation exchange software program in connection with the at least one conversation, words qualifying as belonging to one or more predetermined categories by processing the obtained data using artificial intelligence techniques; determining, by processing the identified words and at least one word-based data source, one or more alternate words; modifying at least a portion of the proposed words by replacing at least a portion of the identified words with at least a portion of the one or more alternate words; and performing at least one automated action based on the modifying.

Type: Grant

Filed: February 18, 2021

Date of Patent: August 29, 2023

Assignee: International Business Machines Corporation

Inventors: Nishtha Madaan, Naveen Panwar, Deepak Vijaykeerthy, Pranay Kumar Lohia, Diptikalyan Saha
System and method for passive subject specific monitoring

Patent number: 11741986

Abstract: A method includes obtaining, by an electronic device, an audio segment comprising one or more audio events of a target subject. The method also includes extracting, by the electronic device, audio embeddings from the one or more audio events using an embedding model, the embedding model comprising a trained machine learning model. The method further includes comparing, by the electronic device, the extracted audio embeddings with a match profile of the target subject, the match profile generated during an enrollment stage. The method also includes generating, by the electronic device, a label for the audio segment based on whether or not the extracted audio embeddings match the match profile, wherein the label enables correlation of the audio segment with the target subject for monitoring a health condition of the target subject.

Type: Grant

Filed: August 20, 2020

Date of Patent: August 29, 2023

Assignee: Samsung Electronics Co., Ltd.

Inventors: Korosh Vatanparvar, Tousif Ahmed, Viswam Nathan, Ebrahim Nematihosseinabadi, Md Mahbubur Rahman, Jilong Kuang, Jun Gao

prev 1 2 3 4 5 6 7 8 … next