Patents Examined by Shaun Roberts

Method for provision of a trigger token

Patent number: 12380273

Abstract: A trigger token correlates with a need for maintenance of a system. A computer-implemented method comprises: receiving a logfile of the system, wherein the logfile includes a plurality of event descriptions, and a subset of the plurality of event descriptions is assigned a time stamp; receiving a point in time of a technical defect of the system; extracting tokens from the subset of the event descriptions based on a token category, wherein each of the extracted tokens is contained as a character string in the subset of the plurality of event descriptions; determining the trigger token from the extracted tokens based on a correlation of time stamps assigned to the extracted tokens with the point in time of the technical defect of the system; and provisioning the trigger token.

Type: Grant

Filed: November 7, 2022

Date of Patent: August 5, 2025

Assignee: Siemens Healthineers AG

Inventor: Tobias Hipp
Context-data based speech enhancement

Patent number: 12380909

Abstract: A device to perform speech enhancement includes one or more processors configured to process image data to detect at least one of an emotion, a speaker characteristic, or a noise type. The one or more processors are also configured to generate context data based at least in part on the at least one of the emotion, the speaker characteristic, or the noise type. The one or more processors are further configured to obtain input spectral data based on an input signal. The input signal represents sound that includes speech. The one or more processors are also configured to process, using a multi-encoder transformer, the input spectral data and the context data to generate output spectral data that represents a speech enhanced version of the input signal.

Type: Grant

Filed: June 14, 2023

Date of Patent: August 5, 2025

Assignee: QUALCOMM Incorporated

Inventors: Kyungguen Byun, Shuhua Zhang, Lae-Hoon Kim, Erik Visser, Sunkuk Moon, Vahid Montazeri
End-to-end automatic speech recognition with transformer

Patent number: 12380880

Abstract: An end-to-end automatic speech recognition (ASR) system can be constructed by fusing a first ASR model with a transformer. The input of the transformer is a learned layer generated by the first ASR model. The fused ASR model and transformer can be treated as a single end-to-end model and trained as a single model. In some embodiments, the end-to-end speech recognition system can be trained using a teacher-student training technique by selectively truncating portions of the first ASR model and/or the transformer components and selectively freezing various layers during the training passes.

Type: Grant

Filed: April 3, 2023

Date of Patent: August 5, 2025

Assignee: Deepgram, Inc.

Inventors: Andrew Nathan Seagraves, Deepak Subburam, Adam Joseph Sypniewski, Scott Ivan Stephenson, Jacob Edward Cutter, Michael Joseph Sypniewski, Daniel Lewis Shafer
Stereo signal encoding method and apparatus using a residual signal encoding parameter

Patent number: 12374345

Abstract: A stereo signal encoding method includes: obtaining a residual signal encoding parameter of a current frame of a stereo signal based on downmixed signal energy and residual signal energy of each of M sub-bands of the current frame, where the residual signal encoding parameter indicates whether to encode residual signals of the M sub-bands; determining whether to encode the residual signals based on the residual signal encoding parameter; and encoding the residual signals when it is determined that the residual signals need to be encoded.

Type: Grant

Filed: April 3, 2024

Date of Patent: July 29, 2025

Assignee: HUAWEI TECHNOLOGIES CO., LTD.

Inventors: Bin Wang, Zexin Liu, Haiting Li
Methods and systems for computer-generated visualization of speech

Patent number: 12374352

Abstract: Methods, systems and apparatuses for computer-generated visualization of speech are described herein. An example method of computer-generated visualization of speech including at least one segment includes: generating a graphical representation of an object corresponding to a segment of the speech; and displaying the graphical representation of the object on a screen of a computing device. Generating the graphical representation includes: representing a duration of the respective segment by a length of the object and representing intensity of the respective segment by a width of the object; and placing, in the graphical representation, a space between adjacent objects.

Type: Grant

Filed: August 17, 2023

Date of Patent: July 29, 2025

Assignee: SomniQ, Inc.

Inventors: Rikko Sakaguchi, Hidenori Ishikawa
Implementations for voice assistant on devices

Patent number: 12347433

Abstract: An electronic device stores a voice assistant library for execution on the electronic device based on the electronic device having a first device type. The electronic device receives a verbal input from a user. It extracts request information from the verbal input by processing the verbal input using the voice assistant library executing on the device. It transmits a request to a remote system. The electronic device receives a response to the request. The response is generated by the remote system. The electronic device performs an operation in accordance with the response by one or more voice-processing modules of the configured voice assistant library.

Type: Grant

Filed: January 26, 2024

Date of Patent: July 1, 2025

Assignee: Google LLC

Inventors: Kenneth Mixter, Raunaq Shah
Audio content identification

Patent number: 12340822

Abstract: A method of audio content identification includes using a two-stage classifier. The first stage includes previously-existing classifiers and the second stage includes a new classifier. The outputs of the first and second stages calculated over different time periods are combined to generate a steering signal. The final classification results from a combination of the steering signal and the outputs of the first and second stages. In this manner, a new classifier may be added without disrupting existing classifiers.

Type: Grant

Filed: August 18, 2021

Date of Patent: June 24, 2025

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Guiping Wang, Lie Lu
Methods for phase ECU F0 interpolation split and related controller

Patent number: 12340812

Abstract: Controlling a concealment method for a lost audio frame associated with a received audio signal is provided. At least one bin vector of a spectral representation for at least one tone is obtained, wherein the at least one bin vector includes three consecutive bin values for the at least one tone. Whether each of the three consecutive bin values has a complex value or a real value is determined. Responsive to the determination, the three consecutive bin values are processed to estimate a frequency of the at least one tone based on whether each bin value has a complex value or a real value.

Type: Grant

Filed: April 26, 2024

Date of Patent: June 24, 2025

Assignee: Telefonaktiebolaget LM Ericsson (publ)

Inventor: Martin Sehlstedt
Emotionally intelligent responses to information seeking questions

Patent number: 12334066

Abstract: A method for generating emotionally intelligent responses to information seeking questions includes receiving audio data corresponding to a query spoken by a user and captured by an assistant-enabled device associated with the user, and processing, using a speech recognition model, the audio data to determine a transcription of the query. The method also includes performing query interpretation on the transcription of the query to identify an emotional state of the user that spoke the query, and an action to perform. The method also includes obtaining a response preamble based on the emotional state of the user and performing the identified action to obtain information responsive to the query. The method further includes generating a response including the obtained response preamble followed by the information responsive to the query.

Type: Grant

Filed: March 18, 2022

Date of Patent: June 17, 2025

Assignee: Google LLC

Inventors: Madelaine Plauché, Kate Beryl Berman
Equalization of audio during a collaboration session in a heterogenous computing platform

Patent number: 12293771

Abstract: Systems and methods for equalizing audio during a collaboration session in a heterogenous computing platform are described. In an illustrative, non-limiting embodiment, an Information Handling System (IHS) may include: a heterogeneous computing platform comprising a plurality of devices, and a memory coupled to the heterogeneous computing platform, where the memory comprises a plurality of sets of firmware instructions, where each set of firmware instructions, upon execution by a respective device, enables the respective device to provide a corresponding firmware service, and where at least one of the plurality of devices operates as an orchestrator configured to: receive a policy from an Information Technology Decision Maker (ITDM) or Original Equipment Manufacturer (OEM), and select an audio equalization setting usable during a collaboration session based, at least in part, upon the policy.

Type: Grant

Filed: September 6, 2022

Date of Patent: May 6, 2025

Assignee: Dell Products, L.P.

Inventors: Daniel L. Hamlin, Srikanth Kondapi, Todd Erick Swierk
Beamforming using multiple sensor data

Patent number: 12288566

Abstract: A device capable of using data from multiple sensors to determine an estimated position/direction of a user with respect to the device. The device may use estimated position data, along with confidence data, that originated from a plurality of sensors to fuse the data to determine the user's estimated position and comprehensive confidence of the estimated position. The system may use the location information to perform beamforming/beam steering and/or other downstream operations using the comprehensive estimated position.

Type: Grant

Filed: June 27, 2022

Date of Patent: April 29, 2025

Assignee: Amazon Technologies, Inc.

Inventors: Anshuman Ganguly, Srivatsan Kandadai, Trausti Thor Kristjansson, Wontak Kim
Machine learning system for customer utterance intent prediction

Patent number: 12254871

Abstract: A method of operating a customer utterance analysis system includes obtaining a subset of utterances from among a first set of utterances. The method includes encoding, by a sentence encoder, the subset of utterances into multi-dimensional vectors. The method includes generating reduced-dimensionality vectors by reducing a dimensionality of the multi-dimensional vectors. Each vector of the reduced-dimensionality vectors corresponds to an utterance from among the subset of utterances. The method includes performing clustering on the reduced-dimensionality vectors. The method includes, based on the clustering performed on the reduced-dimensionality vectors, arranging the subset of utterances into clusters. The method includes obtaining labels for a least two clusters from among the clusters. The method includes generating training data based on the obtained labels. The method includes training a neural network model to predict an intent of an utterance based on the training data.

Type: Grant

Filed: March 14, 2023

Date of Patent: March 18, 2025

Assignee: CHARLES SCHWAB & CO., INC.

Inventors: Abhilash Krishnankutty Nair, Amaris Yuseon Sim, Dayanand Narregudem, Drew David Riassetto, Logan Sommers Ahlstrom, Nafiseh Saberian, Stephen Filios, Ravindra Reddy Tappeta Venkata
Detection of attachment problem of apparatus being worn by user

Patent number: 12248547

Abstract: Provided is to prevent a false determination due to an attachment condition of an apparatus that transmits and receives an acoustic signal, and perform accurate personal authentication. A personal authentication device includes: a personal authentication means that authenticates an individual by using first information at least including an acoustic characteristic calculated from an acoustic signal propagating through the head of the user, which is detected by an apparatus being attached on a head of a user for transmitting and receiving the acoustic signal, and a feature amount extracted from the acoustic characteristic; an attachment trouble rule storage means that stores an attachment trouble rule for detecting an attachment trouble with the apparatus; and an attachment trouble detection means that detects a trouble with an attachment state of the apparatus when the first information satisfies the attachment trouble rule.

Type: Grant

Filed: October 6, 2023

Date of Patent: March 11, 2025

Assignee: NEC CORPORATION

Inventors: Takayuki Arakawa, Takafumi Koshinaka
Detection of attachment problem of apparatus being worn by user

Patent number: 12248548

Abstract: Provided is to prevent a false determination due to an attachment condition of an apparatus that transmits and receives an acoustic signal, and perform accurate personal authentication. A personal authentication device includes: a personal authentication means that authenticates an individual by using first information at least including an acoustic characteristic calculated from an acoustic signal propagating through the head of the user, which is detected by an apparatus being attached on a head of a user for transmitting and receiving the acoustic signal, and a feature amount extracted from the acoustic characteristic; an attachment trouble rule storage means that stores an attachment trouble rule for detecting an attachment trouble with the apparatus; and an attachment trouble detection means that detects a trouble with an attachment state of the apparatus when the first information satisfies the attachment trouble rule.

Type: Grant

Filed: October 6, 2023

Date of Patent: March 11, 2025

Assignee: NEC CORPORATION

Inventors: Takayuki Arakawa, Takafumi Koshinaka
Detection of attachment problem of apparatus being worn by user

Patent number: 12248550

Abstract: Provided is to prevent a false determination due to an attachment condition of an apparatus that transmits and receives an acoustic signal, and perform accurate personal authentication. A personal authentication device includes: a personal authentication means that authenticates an individual by using first information at least including an acoustic characteristic calculated from an acoustic signal propagating through the head of the user, which is detected by an apparatus being attached on a head of a user for transmitting and receiving the acoustic signal, and a feature amount extracted from the acoustic characteristic; an attachment trouble rule storage means that stores an attachment trouble rule for detecting an attachment trouble with the apparatus; and an attachment trouble detection means that detects a trouble with an attachment state of the apparatus when the first information satisfies the attachment trouble rule.

Type: Grant

Filed: October 6, 2023

Date of Patent: March 11, 2025

Assignee: NEC CORPORATION

Inventors: Takayuki Arakawa, Takafumi Koshinaka
Spectral shape estimation from MDCT coefficients

Patent number: 12230280

Abstract: A method, decoder, and program code for controlling a concealment method for a lost audio frame is provided. A first audio frame and a second audio frame of the received audio signal are decoded to obtain modified discrete cosine transform (MDCT) coefficients. Values of a first spectral shape based upon the MDCT coefficients decoded from the first audio frame decoded and values of a second spectral shape based upon MDCT coefficients decoded from the second audio frame decoded are determined, the spectral shapes each comprising a number of sub-bands. The values of the spectral shapes and frame energies of the first audio frame and second audio frame are transformed into representations of FFT based spectral analyses. A transient condition is detected based on the representations of the FFTs. Responsive to detecting the transient condition, the concealment method is modified by selectively adjusting a spectrum magnitude of a substitution frame spectrum.

Type: Grant

Filed: November 30, 2023

Date of Patent: February 18, 2025

Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)

Inventors: Martin Sehlstedt, Jonas Svedberg
Voice to text conversion based on third-party agent content

Patent number: 12217759

Abstract: Implementations relate to dynamically, and in a context-sensitive manner, biasing voice to text conversion. In some implementations, the biasing of voice to text conversions is performed by a voice to text engine of a local agent, and the biasing is based at least in part on content provided to the local agent by a third-party (3P) agent that is in network communication with the local agent. In some of those implementations, the content includes contextual parameters that are provided by the 3P agent in combination with responsive content generated by the 3P agent during a dialog that: is between the 3P agent, and a user of a voice-enabled electronic device; and is facilitated by the local agent. The contextual parameters indicate potential feature(s) of further voice input that is to be provided in response to the responsive content generated by the 3P agent.

Type: Grant

Filed: February 6, 2024

Date of Patent: February 4, 2025

Assignee: GOOGLE LLC

Inventors: Barnaby James, Bo Wang, Sunil Vemuri, David Schairer, Ulas Kirazci, Ertan Dogrultan, Petar Aleksic
Context-based entropy coding of sample values of a spectral envelope

Patent number: 12205606

Abstract: An improved concept for coding sample values of a spectral envelope is obtained by combining spectrotemporal prediction on the one hand and context-based entropy coding the residuals, on the other hand, while particularly determining the context for a current sample value dependent on a measure of a deviation between a pair of already coded/decoded sample values of the spectral envelope in a spectrotemporal neighborhood of the current sample value. The combination of the spectrotemporal prediction on the one hand and the context-based entropy coding of the prediction residuals with selecting the context depending on the deviation measure on the other hand harmonizes with the nature of spectral envelopes.

Type: Grant

Filed: September 11, 2023

Date of Patent: January 21, 2025

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Florin Ghido, Andreas Niedermeier
Methods and devices for generation and processing of modified bitstreams

Patent number: 12205607

Abstract: Described herein is a method for generating a modified bitstream on a source device, wherein the method includes the steps of: a) receiving, by a receiver, a bitstream including coded media data; b) generating, by an embedder, payload of additional media data and embedding the payload in the bitstream for obtaining, as an output from the embedder, a modified bitstream including the coded media data and the payload of the additional media data; and d) outputting the modified bitstream to a sink device. Described is further a method for processing said modified bitstream on a sink device. Described are moreover a respective source device and sink device as well as a system of a source device and a sink device and respective computer program products.

Type: Grant

Filed: August 13, 2020

Date of Patent: January 21, 2025

Assignees: Dolby Laboratories Licensing Corporation, Dolby International AB

Inventors: Christof Fersch, Daniel Fischer, Leon Terentiv, Gregory John McGarry
Server side crossfading for progressive download media

Patent number: 12198731

Abstract: Systems and methods are provided to implement and facilitate cross-fading, interstitials and other effects/processing of two or more media elements in a personalized media delivery service to experience consistent high quality. The effects or crossfade processing may occur on the broadcast/publisher/server-side, but may be personalized to a specific user, allowing a personalized experience for each user, where the processing burden is minimized on the downstream side/client device. This approach enables a consistent user experience, independent of client device capabilities. A large-scale personalized content delivery service may be implemented by limiting the processing to the first and last chunks of any file. In exemplary embodiments, this type of processing may easily be accommodated in cloud computing technology, where first and last files are extracted and processed within the cloud to meet the required load.

Type: Grant

Filed: November 20, 2023

Date of Patent: January 14, 2025

Assignee: Sirius XM Radio Inc.

Inventors: Raymond Lowe, Christopher Ward

1 2 3 4 5 … next