Patents Examined by Paras D Shah

Vehicle agent device, vehicle agent system, and computer-readable storage medium

Patent number: 12293760

Abstract: A vehicle agent device receives utterance information from an on-board unit, analyzes the content of the utterance, detects, as a non-installed function from a database, a function that an occupant intended to utilize but which was not installed and is installable, generates proposal information for furnishing the occupant with information relating to the non-installed function it detected, and sends the proposal information that has been generated to the on-board unit to thereby send the information relating to the non-installed function to a preregistered mobile device carried by the occupant.

Type: Grant

Filed: October 6, 2021

Date of Patent: May 6, 2025

Assignee: TOYOTA JIDOSHA KABUSHIKI KAISHA

Inventors: Chikage Kubo, Keiko Nakano, Eiichi Maeda, Hiroyuki Nishizawa
Time reversed audio subframe error concealment

Patent number: 12293766

Abstract: A method and a decoder device of generating a concealment audio subframe of an audio signal are provided. The method comprises generating frequency spectra on a subframe basis where consecutive subframes of the audio signal have a property that an applied window shape of first subframe of the consecutive subframes is a mirrored version or a time reversed version of a second subframe of the consecutive subframes. Peaks of a signal spectrum of a previously received audio signal are detected for a concealment subframe, and a phase of each of the peaks is estimated. A time reversed phase adjustment is derived based on the estimated phase and applied to the peaks of the signal spectrum to form time reversed phase adjusted peaks.

Type: Grant

Filed: March 18, 2024

Date of Patent: May 6, 2025

Assignee: Telefonaktiebolaget LM Ericsson (publ)

Inventors: Erik Norvell, Chamran Moradi Ashour
Voice signal dereverberation processing method and apparatus, computer device and storage medium

Patent number: 12293770

Abstract: A speech signal dereverberation processing method includes extracting an amplitude spectrum feature and a phase spectrum feature of a current frame in an original speech signal, extracting subband amplitude spectrums from the amplitude spectrum feature corresponding to the current frame, determining, based on the subband amplitude spectrums and by using a first reverberation predictor, a reverberation strength indicator corresponding to the current frame, and determining, based on the subband amplitude spectrums and the reverberation strength indicator, and by using a second reverberation predictor, a clean speech subband spectrum corresponding to the current frame.

Type: Grant

Filed: March 2, 2022

Date of Patent: May 6, 2025

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventors: Rui Zhu, Juan Juan Li, Yan Nan Wang, Yue Peng Li
Computer systems and computer-based methods for automated caller intent prediction

Patent number: 12288552

Abstract: An automated system and corresponding method is configured to predict a call duration of a customer service interaction between a caller and a customer-service agent of a call center, based at least in part on information provided orally by the caller to the automated system. The automated system transcribes the orally provided information, preprocesses the transcribed data, adds feature enrichment data to supplement the transcribed data, and executes a machine-learning model to predict the caller's intent. If the predicted caller's intent does not have an adequate confidence score associated therewith, the system requests additional data from the caller, and supplements the original data with newly provided data, and again determines a predicted call intent. This process may iterate until the confidence score satisfies applicable confidence criteria prior to utilizing two additional machine-learning models to predict a call duration of the interaction between the caller and a customer-service agent.

Type: Grant

Filed: September 17, 2021

Date of Patent: April 29, 2025

Assignee: Optum, Inc.

Inventors: Vivedha Elango, Soundararajan Dhakshinamoorthy, Srividya Thyagarajan, Ninad D. Sathaye, Gregory J. Boss, Santhosh Kumar Gopynadhan
Bitrate distribution in immersive voice and audio services

Patent number: 12283281

Abstract: Embodiments are disclosed for bitrate distribution in immersive voice and audio services. In an embodiment, a method of encoding an IVAS bitstream comprises: receiving an input audio signal; downmixing the input audio signal into one or more downmix channels and spatial metadata; reading a set of one or more bitrates for the downmix channels and a set of quantization levels for the spatial metadata from a bitrate distribution control table; determining a combination of the one or more bitrates for the downmix channels; determining a metadata quantization level from the set of metadata quantization levels using a bitrate distribution process; quantizing and coding the spatial metadata using the metadata quantization level; generating, using the combination of one or more bitrates, a downmix bitstream for the one or more downmix channels; combining the downmix bitstream, the quantized and coded spatial metadata and the set of quantization levels into the IVAS bitstream.

Type: Grant

Filed: October 28, 2020

Date of Patent: April 22, 2025

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Rishabh Tyagi, Juan Felix Torres, Stefanie Brown
Topic suggestion in messaging systems

Patent number: 12284148

Abstract: Embodiments are provided for suggesting topics in a messaging system. A set of queries is received from a chat transcript history, where the set of queries includes a set of unhandled queries, and each unhandled query comprises a query for which a bot did not identify a corresponding topic (e.g., queries that did not trigger selection of a topic by the bot). A vector representation is generated for each unhandled query in the set of unhandled queries. The vector representations for the set of unhandled queries are clustered to generate one or more clusters of vector representations, each cluster corresponding to a group of unhandled queries. A corresponding suggested topic is generated for each cluster and provided to an authoring tool that comprises one or more interactive elements to enable an author to select at least one of the suggested topics for implementation in the bot.

Type: Grant

Filed: May 13, 2022

Date of Patent: April 22, 2025

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Webber Po-Wei Lee, Daniil Sokolov, Jaclyn Ruth Elizabeth Phillips, Yi Zhang, Jennifer Oliva Ede, Shoou-Jiun Wang, Tracy My Tuyen Nguyen
End-to-end streaming speech translation with neural transducer

Patent number: 12277927

Abstract: Systems and methods are provided for obtaining, training, and using an end-to-end AST model based on a neural transducer, the end-to-end AST model comprising at least (i) an acoustic encoder which is configured to receive and encode audio data, (ii) a prediction network which is integrated in a parallel model architecture with the acoustic encoder in the end-to-end AST model, and (iii) a joint layer which is integrated in series with the acoustic encoder and prediction network. The end-to-end AST model is configured to generate a transcription in the second language of input audio data in the first language such that the acoustic encoder learns a plurality of temporal processing paths.

Type: Grant

Filed: March 15, 2022

Date of Patent: April 15, 2025

Assignee: Microsoft Technology Licensing, LLC

Inventors: Jinyu Li, Jian Xue, Matthew John Post, Peidong Wang
Method of training ranking model, and electronic device

Patent number: 12277393

Abstract: A method of training a ranking model, and an electronic device, which relate to technical fields of natural language processing and intelligent search. The method includes: in training the ranking model, firstly acquiring a plurality of first sample pairs and respective label information; for each first sample pair, inputting a first search text, a first title text of a first candidate text, and a first target summary corresponding to the first candidate text into an initial language model to obtain a second relevance score corresponding to the each first sample pair; then using the first target summary to replace the first candidate text to participate in the training of the ranking model, and updating at least one network parameter of the initial language model according to the label information and the second relevance score corresponding to each first sample pair.

Type: Grant

Filed: March 9, 2022

Date of Patent: April 15, 2025

Assignee: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.

Inventor: Lixin Zou
Interactive system for hearing devices

Patent number: 12279092

Abstract: In an audio signal, one or more processing circuits recognize spoken content in a user's own speech signal using speech recognition and natural language understanding. The spoken content describes a listening difficulty of the user. The one or more processing circuits generate, based on the spoken content, one or more actions for hearing devices and feedback for the user. The one or more actions attempt to resolve the listening difficulty. Additionally, the one or more processing circuits convert the user feedback to verbal feedback using speech synthesis and transmit the one or more actions and the verbal feedback to the hearing devices via a body-worn device. The hearing devices are configured to perform the one or more actions and play back the verbal feedback to the user.

Type: Grant

Filed: May 31, 2022

Date of Patent: April 15, 2025

Assignee: Starkey Laboratories, Inc.

Inventors: Tao Zhang, Eric Durant, Dean G. Meyer, Martin McKinney, Matthew D. Kleffner, Dominic Perz, Karrie Recker
System and method for interpreting stuctured and unstructured content to facilitate tailored transactions

Patent number: 12271700

Abstract: A method for interpreting structured and unstructured content to facilitate tailored transactions is provided. The method includes acquiring, from one or more unstructured data sources, unstructured data, and acquiring, from one or more structured data sources, structured data. The method further includes performing natural language processing (NLP) on both the structured data and the unstructured data using a machine learning algorithm, and generating, via the machine learning algorithm, an NLP response based on the NLP. Based on the NLP response, the method further performs identifying at least one candidate object, and generating a list of actions corresponding to the candidate object.

Type: Grant

Filed: May 16, 2022

Date of Patent: April 8, 2025

Assignee: JPMORGAN CHASE BANK, N.A.

Inventors: Oleh Filipchuk, Richard Lascelles
Text reconstruction system and method thereof

Patent number: 12271695

Abstract: A text reconstruction system for reconstructing a primary text data is provided. A voice input signal of a user is converted into an input text data by a speech recognition module. A text classifier module generates one or more tokens and adds the tokens into a word bag corresponding to the user. A text identifier module generates a text corpus based on the input text data. A user profile builder module creates a user profile based on the word bag, the input text data, and the text corpus. A decision module determines, based on the word bag, whether the primary data is to be reconstructed and reconstructs the primary text data to generate a personalized text data based on the user profile.

Type: Grant

Filed: May 19, 2020

Date of Patent: April 8, 2025

Assignee: Samsung Electronics Co., Ltd.

Inventors: Nitesh Laller, Sujit Kumar Sinha
Adaptive noise suppression for virtual meeting/remote education

Patent number: 12272368

Abstract: One example method includes performing sound quality operations. Microphone arrays are used to cancel background noise and to enhance speech. With arrays at each environment of each user participating in a call, a first microphone array can cancel or suppress background noise and a second array can generate enhanced speech for transmission to other users. Thus, for user, the audio signal output by the user's device includes an anti-noise signal to cancel background noise present in the user's environment and enhanced speech from other users.

Type: Grant

Filed: August 31, 2021

Date of Patent: April 8, 2025

Assignee: EMC IP Holding Company LLC

Inventors: Danqing Sha, Amy N. Seibel, Eric Bruno, Zhen Jia
Speech interpretation based on environmental context

Patent number: 12266354

Abstract: Systems and processes for speech interpretation based on environmental context are provided. For example, a user gaze direction is detected, and a speech input is received from a first user of the electronic device. In accordance with a determination that the user gaze is directed at a digital assistant object, the speech input is processed by the digital assistant. In accordance with a determination that the user gaze is not directed at a digital assistant object, contextual information associated with the electronic device is obtained, wherein the contextual information includes speech from a second user. Determination is made whether the speech input is directed to a digital assistant of the electronic device. In accordance with a determination that the speech input is directed to a digital assistant of the electronic device, the speech input is processed by the digital assistant.

Type: Grant

Filed: October 13, 2021

Date of Patent: April 1, 2025

Assignee: Apple Inc.

Inventors: Brad Kenneth Herman, Shiraz Akmal, Aaron Mackay Burns, David A. Carson
Method for correcting text, method for generating text correction model, device

Patent number: 12265790

Abstract: Disclosed are a method for correcting a text, an electronic device and a storage medium. The method includes: acquiring a text to be corrected; acquiring a phonetic symbol sequence of the text to be corrected; and obtaining a corrected text by inputting the text to be corrected and the phonetic symbol sequence into a text correction model, in which, the text correction model obtains the corrected text by: detecting an error word in the text to be corrected, determining a phonetic symbol corresponding to the error word in the phonetic symbol sequence, and adding the phonetic feature corresponding to the phonetic symbol behind the error word to obtain a phonetic symbol text, and correcting the error word and the phonetic feature in the phonetic symbol text to obtain the corrected text.

Type: Grant

Filed: November 7, 2022

Date of Patent: April 1, 2025

Assignee: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventors: Ruiqing Zhang, Zhongjun He, Hua Wu
Method and apparatus of noise reduction, electronic device and readable storage medium

Patent number: 12260873

Abstract: A method of noise reduction, which is applied to an electronic device. The electronic device includes a first sound collector and a second sound collector, installation positions of the first sound collector and the second sound collectors are different; the method includes: determining a desired sound signal and an interference sound signal based on a first sound signal collected by the first sound collector and a second sound signal collected by the second sound collector (S102); obtaining a third sound signal by performing coherent noise elimination processing on the desired sound signal based on the interfering sound signal (S103); and then obtaining a target sound signal by performing incoherent noise suppression processing on the third sound signal based on a probability of existence of a speech in the third sound signal (S104).

Type: Grant

Filed: June 27, 2022

Date of Patent: March 25, 2025

Assignee: UNISOC (CHONGQING) TECHNOLOGIES CO., LTD.

Inventor: Li Kang
Multi-layer keyword detection

Patent number: 12249331

Abstract: A system and method for temporarily disabling keyword detection to avoid detection of machine-generated keywords. A local device may operate two keyword detectors. The first keyword detector operates on input audio data received by a microphone to capture keywords uttered by a user. In these instances, the keyword may be detected by the first detector and the audio data may be indicated for speech processing. The system may determine output audio data responsive to the input audio data. The local device may process the output audio data to determine that it also includes the keyword. The device may then disable the first keyword detector while the output audio data is played back by an audio speaker of the local device. Thus the local device may avoid detection of a keyword originating from the output audio. The first keyword detector may be reactivated after a time interval during which the keyword might be detectable in the output audio.

Type: Grant

Filed: May 8, 2023

Date of Patent: March 11, 2025

Assignee: Amazon Technologies, Inc.

Inventors: Christopher Wayne Lockhart, Matthew Joseph Cole, Xulei Liu
Speech recognition using neural networks

Patent number: 12243515

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speech recognition using neural networks. A feature vector that models audio characteristics of a portion of an utterance is received. Data indicative of latent variables of multivariate factor analysis is received. The feature vector and the data indicative of the latent variables is provided as input to a neural network. A candidate transcription for the utterance is determined based on at least an output of the neural network.

Type: Grant

Filed: March 2, 2023

Date of Patent: March 4, 2025

Assignee: Google LLC

Inventors: Andrew W. Senior, Ignacio L. Moreno
System and method for real-time fraud detection in voice biometric systems using phonemes in fraudster voice prints

Patent number: 12236438

Abstract: A system and method for real-time fraud detection with a social engineering phoneme (SEP) watchlist of phoneme sequences may perform real-time fraud prevention operations including receiving incoming call interactions and grouping the call interactions into one or more clusters, each cluster associated with a speaker's voice based on voiceprints. For a pair of voiceprints in a cluster, a phoneme sequence is extracted for each voice print. From the extracted phoneme sequences, a similarity score is then calculated to determine if a match exists between the extracted phoneme sequences based on a threshold. If determined a match exists, the phoneme sequence may be added to a SEP watchlist.

Type: Grant

Filed: January 4, 2022

Date of Patent: February 25, 2025

Assignee: Nice Ltd.

Inventors: Matan Keret, Roman Frenkel, Zvika Horev
Anonymization of text transcripts corresponding to user commands

Patent number: 12230260

Abstract: One embodiment provides a method, including: receiving, at an information handling device, text associated with a user command; storing, in a data store, an encrypted form of the text associated with the user command; determining, using a processor, whether the encrypted form of the text has been detected in other user commands in exceedance of a predetermined threshold; and storing, responsive to determining that the encrypted form of the text has been detected in the other user commands in exceedance of the predetermined threshold, an unencrypted transcript of the text in a data table. Other aspects are described and claimed.

Type: Grant

Filed: March 5, 2021

Date of Patent: February 18, 2025

Assignee: Lenovo (Singapore) Pte. Ltd.

Inventors: John Weldon Nicholson, Igor Stolbikov, David Alexander Schwarz
Multi-lag format for audio coding

Patent number: 12223968

Abstract: Described herein is a method of encoding an audio signal. The method comprises: generating a plurality of subband audio signals based on the audio signal; determining a spectral envelope of the audio signal; for each subband audio signal, determining autocorrelation information for the subband audio signal based on an autocorrelation function of the subband audio signal; and generating an encoded representation of the audio signal, the encoded representation comprising a representation of the spectral envelope of the audio signal and a representation of the autocorrelation information for the plurality of subband audio signals. Further described are methods of decoding the audio signal from the encoded representation, as well as corresponding encoders, decoders, computer programs, and computer-readable recording media.

Type: Grant

Filed: August 18, 2020

Date of Patent: February 11, 2025

Assignee: Dolby International AB

Inventors: Lars Villemoes, Heidi-Maria Lehtonen, Heiko Purnhagen, Per Hedelin

1 2 3 4 5 … next