Patents Examined by Shaun Roberts
  • Patent number: 12380273
    Abstract: A trigger token correlates with a need for maintenance of a system. A computer-implemented method comprises: receiving a logfile of the system, wherein the logfile includes a plurality of event descriptions, and a subset of the plurality of event descriptions is assigned a time stamp; receiving a point in time of a technical defect of the system; extracting tokens from the subset of the event descriptions based on a token category, wherein each of the extracted tokens is contained as a character string in the subset of the plurality of event descriptions; determining the trigger token from the extracted tokens based on a correlation of time stamps assigned to the extracted tokens with the point in time of the technical defect of the system; and provisioning the trigger token.
    Type: Grant
    Filed: November 7, 2022
    Date of Patent: August 5, 2025
    Assignee: Siemens Healthineers AG
    Inventor: Tobias Hipp
  • Patent number: 12380909
    Abstract: A device to perform speech enhancement includes one or more processors configured to process image data to detect at least one of an emotion, a speaker characteristic, or a noise type. The one or more processors are also configured to generate context data based at least in part on the at least one of the emotion, the speaker characteristic, or the noise type. The one or more processors are further configured to obtain input spectral data based on an input signal. The input signal represents sound that includes speech. The one or more processors are also configured to process, using a multi-encoder transformer, the input spectral data and the context data to generate output spectral data that represents a speech enhanced version of the input signal.
    Type: Grant
    Filed: June 14, 2023
    Date of Patent: August 5, 2025
    Assignee: QUALCOMM Incorporated
    Inventors: Kyungguen Byun, Shuhua Zhang, Lae-Hoon Kim, Erik Visser, Sunkuk Moon, Vahid Montazeri
  • Patent number: 12380880
    Abstract: An end-to-end automatic speech recognition (ASR) system can be constructed by fusing a first ASR model with a transformer. The input of the transformer is a learned layer generated by the first ASR model. The fused ASR model and transformer can be treated as a single end-to-end model and trained as a single model. In some embodiments, the end-to-end speech recognition system can be trained using a teacher-student training technique by selectively truncating portions of the first ASR model and/or the transformer components and selectively freezing various layers during the training passes.
    Type: Grant
    Filed: April 3, 2023
    Date of Patent: August 5, 2025
    Assignee: Deepgram, Inc.
    Inventors: Andrew Nathan Seagraves, Deepak Subburam, Adam Joseph Sypniewski, Scott Ivan Stephenson, Jacob Edward Cutter, Michael Joseph Sypniewski, Daniel Lewis Shafer
  • Patent number: 12374345
    Abstract: A stereo signal encoding method includes: obtaining a residual signal encoding parameter of a current frame of a stereo signal based on downmixed signal energy and residual signal energy of each of M sub-bands of the current frame, where the residual signal encoding parameter indicates whether to encode residual signals of the M sub-bands; determining whether to encode the residual signals based on the residual signal encoding parameter; and encoding the residual signals when it is determined that the residual signals need to be encoded.
    Type: Grant
    Filed: April 3, 2024
    Date of Patent: July 29, 2025
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventors: Bin Wang, Zexin Liu, Haiting Li
  • Patent number: 12374352
    Abstract: Methods, systems and apparatuses for computer-generated visualization of speech are described herein. An example method of computer-generated visualization of speech including at least one segment includes: generating a graphical representation of an object corresponding to a segment of the speech; and displaying the graphical representation of the object on a screen of a computing device. Generating the graphical representation includes: representing a duration of the respective segment by a length of the object and representing intensity of the respective segment by a width of the object; and placing, in the graphical representation, a space between adjacent objects.
    Type: Grant
    Filed: August 17, 2023
    Date of Patent: July 29, 2025
    Assignee: SomniQ, Inc.
    Inventors: Rikko Sakaguchi, Hidenori Ishikawa
  • Patent number: 12347433
    Abstract: An electronic device stores a voice assistant library for execution on the electronic device based on the electronic device having a first device type. The electronic device receives a verbal input from a user. It extracts request information from the verbal input by processing the verbal input using the voice assistant library executing on the device. It transmits a request to a remote system. The electronic device receives a response to the request. The response is generated by the remote system. The electronic device performs an operation in accordance with the response by one or more voice-processing modules of the configured voice assistant library.
    Type: Grant
    Filed: January 26, 2024
    Date of Patent: July 1, 2025
    Assignee: Google LLC
    Inventors: Kenneth Mixter, Raunaq Shah
  • Patent number: 12340822
    Abstract: A method of audio content identification includes using a two-stage classifier. The first stage includes previously-existing classifiers and the second stage includes a new classifier. The outputs of the first and second stages calculated over different time periods are combined to generate a steering signal. The final classification results from a combination of the steering signal and the outputs of the first and second stages. In this manner, a new classifier may be added without disrupting existing classifiers.
    Type: Grant
    Filed: August 18, 2021
    Date of Patent: June 24, 2025
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Guiping Wang, Lie Lu
  • Patent number: 12340812
    Abstract: Controlling a concealment method for a lost audio frame associated with a received audio signal is provided. At least one bin vector of a spectral representation for at least one tone is obtained, wherein the at least one bin vector includes three consecutive bin values for the at least one tone. Whether each of the three consecutive bin values has a complex value or a real value is determined. Responsive to the determination, the three consecutive bin values are processed to estimate a frequency of the at least one tone based on whether each bin value has a complex value or a real value.
    Type: Grant
    Filed: April 26, 2024
    Date of Patent: June 24, 2025
    Assignee: Telefonaktiebolaget LM Ericsson (publ)
    Inventor: Martin Sehlstedt
  • Patent number: 12334066
    Abstract: A method for generating emotionally intelligent responses to information seeking questions includes receiving audio data corresponding to a query spoken by a user and captured by an assistant-enabled device associated with the user, and processing, using a speech recognition model, the audio data to determine a transcription of the query. The method also includes performing query interpretation on the transcription of the query to identify an emotional state of the user that spoke the query, and an action to perform. The method also includes obtaining a response preamble based on the emotional state of the user and performing the identified action to obtain information responsive to the query. The method further includes generating a response including the obtained response preamble followed by the information responsive to the query.
    Type: Grant
    Filed: March 18, 2022
    Date of Patent: June 17, 2025
    Assignee: Google LLC
    Inventors: Madelaine Plauché, Kate Beryl Berman
  • Patent number: 12293771
    Abstract: Systems and methods for equalizing audio during a collaboration session in a heterogenous computing platform are described. In an illustrative, non-limiting embodiment, an Information Handling System (IHS) may include: a heterogeneous computing platform comprising a plurality of devices, and a memory coupled to the heterogeneous computing platform, where the memory comprises a plurality of sets of firmware instructions, where each set of firmware instructions, upon execution by a respective device, enables the respective device to provide a corresponding firmware service, and where at least one of the plurality of devices operates as an orchestrator configured to: receive a policy from an Information Technology Decision Maker (ITDM) or Original Equipment Manufacturer (OEM), and select an audio equalization setting usable during a collaboration session based, at least in part, upon the policy.
    Type: Grant
    Filed: September 6, 2022
    Date of Patent: May 6, 2025
    Assignee: Dell Products, L.P.
    Inventors: Daniel L. Hamlin, Srikanth Kondapi, Todd Erick Swierk
  • Patent number: 12288566
    Abstract: A device capable of using data from multiple sensors to determine an estimated position/direction of a user with respect to the device. The device may use estimated position data, along with confidence data, that originated from a plurality of sensors to fuse the data to determine the user's estimated position and comprehensive confidence of the estimated position. The system may use the location information to perform beamforming/beam steering and/or other downstream operations using the comprehensive estimated position.
    Type: Grant
    Filed: June 27, 2022
    Date of Patent: April 29, 2025
    Assignee: Amazon Technologies, Inc.
    Inventors: Anshuman Ganguly, Srivatsan Kandadai, Trausti Thor Kristjansson, Wontak Kim
  • Patent number: 12254871
    Abstract: A method of operating a customer utterance analysis system includes obtaining a subset of utterances from among a first set of utterances. The method includes encoding, by a sentence encoder, the subset of utterances into multi-dimensional vectors. The method includes generating reduced-dimensionality vectors by reducing a dimensionality of the multi-dimensional vectors. Each vector of the reduced-dimensionality vectors corresponds to an utterance from among the subset of utterances. The method includes performing clustering on the reduced-dimensionality vectors. The method includes, based on the clustering performed on the reduced-dimensionality vectors, arranging the subset of utterances into clusters. The method includes obtaining labels for a least two clusters from among the clusters. The method includes generating training data based on the obtained labels. The method includes training a neural network model to predict an intent of an utterance based on the training data.
    Type: Grant
    Filed: March 14, 2023
    Date of Patent: March 18, 2025
    Assignee: CHARLES SCHWAB & CO., INC.
    Inventors: Abhilash Krishnankutty Nair, Amaris Yuseon Sim, Dayanand Narregudem, Drew David Riassetto, Logan Sommers Ahlstrom, Nafiseh Saberian, Stephen Filios, Ravindra Reddy Tappeta Venkata
  • Patent number: 12248547
    Abstract: Provided is to prevent a false determination due to an attachment condition of an apparatus that transmits and receives an acoustic signal, and perform accurate personal authentication. A personal authentication device includes: a personal authentication means that authenticates an individual by using first information at least including an acoustic characteristic calculated from an acoustic signal propagating through the head of the user, which is detected by an apparatus being attached on a head of a user for transmitting and receiving the acoustic signal, and a feature amount extracted from the acoustic characteristic; an attachment trouble rule storage means that stores an attachment trouble rule for detecting an attachment trouble with the apparatus; and an attachment trouble detection means that detects a trouble with an attachment state of the apparatus when the first information satisfies the attachment trouble rule.
    Type: Grant
    Filed: October 6, 2023
    Date of Patent: March 11, 2025
    Assignee: NEC CORPORATION
    Inventors: Takayuki Arakawa, Takafumi Koshinaka
  • Patent number: 12248548
    Abstract: Provided is to prevent a false determination due to an attachment condition of an apparatus that transmits and receives an acoustic signal, and perform accurate personal authentication. A personal authentication device includes: a personal authentication means that authenticates an individual by using first information at least including an acoustic characteristic calculated from an acoustic signal propagating through the head of the user, which is detected by an apparatus being attached on a head of a user for transmitting and receiving the acoustic signal, and a feature amount extracted from the acoustic characteristic; an attachment trouble rule storage means that stores an attachment trouble rule for detecting an attachment trouble with the apparatus; and an attachment trouble detection means that detects a trouble with an attachment state of the apparatus when the first information satisfies the attachment trouble rule.
    Type: Grant
    Filed: October 6, 2023
    Date of Patent: March 11, 2025
    Assignee: NEC CORPORATION
    Inventors: Takayuki Arakawa, Takafumi Koshinaka
  • Patent number: 12248550
    Abstract: Provided is to prevent a false determination due to an attachment condition of an apparatus that transmits and receives an acoustic signal, and perform accurate personal authentication. A personal authentication device includes: a personal authentication means that authenticates an individual by using first information at least including an acoustic characteristic calculated from an acoustic signal propagating through the head of the user, which is detected by an apparatus being attached on a head of a user for transmitting and receiving the acoustic signal, and a feature amount extracted from the acoustic characteristic; an attachment trouble rule storage means that stores an attachment trouble rule for detecting an attachment trouble with the apparatus; and an attachment trouble detection means that detects a trouble with an attachment state of the apparatus when the first information satisfies the attachment trouble rule.
    Type: Grant
    Filed: October 6, 2023
    Date of Patent: March 11, 2025
    Assignee: NEC CORPORATION
    Inventors: Takayuki Arakawa, Takafumi Koshinaka
  • Patent number: 12230280
    Abstract: A method, decoder, and program code for controlling a concealment method for a lost audio frame is provided. A first audio frame and a second audio frame of the received audio signal are decoded to obtain modified discrete cosine transform (MDCT) coefficients. Values of a first spectral shape based upon the MDCT coefficients decoded from the first audio frame decoded and values of a second spectral shape based upon MDCT coefficients decoded from the second audio frame decoded are determined, the spectral shapes each comprising a number of sub-bands. The values of the spectral shapes and frame energies of the first audio frame and second audio frame are transformed into representations of FFT based spectral analyses. A transient condition is detected based on the representations of the FFTs. Responsive to detecting the transient condition, the concealment method is modified by selectively adjusting a spectrum magnitude of a substitution frame spectrum.
    Type: Grant
    Filed: November 30, 2023
    Date of Patent: February 18, 2025
    Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)
    Inventors: Martin Sehlstedt, Jonas Svedberg
  • Patent number: 12217759
    Abstract: Implementations relate to dynamically, and in a context-sensitive manner, biasing voice to text conversion. In some implementations, the biasing of voice to text conversions is performed by a voice to text engine of a local agent, and the biasing is based at least in part on content provided to the local agent by a third-party (3P) agent that is in network communication with the local agent. In some of those implementations, the content includes contextual parameters that are provided by the 3P agent in combination with responsive content generated by the 3P agent during a dialog that: is between the 3P agent, and a user of a voice-enabled electronic device; and is facilitated by the local agent. The contextual parameters indicate potential feature(s) of further voice input that is to be provided in response to the responsive content generated by the 3P agent.
    Type: Grant
    Filed: February 6, 2024
    Date of Patent: February 4, 2025
    Assignee: GOOGLE LLC
    Inventors: Barnaby James, Bo Wang, Sunil Vemuri, David Schairer, Ulas Kirazci, Ertan Dogrultan, Petar Aleksic
  • Patent number: 12205606
    Abstract: An improved concept for coding sample values of a spectral envelope is obtained by combining spectrotemporal prediction on the one hand and context-based entropy coding the residuals, on the other hand, while particularly determining the context for a current sample value dependent on a measure of a deviation between a pair of already coded/decoded sample values of the spectral envelope in a spectrotemporal neighborhood of the current sample value. The combination of the spectrotemporal prediction on the one hand and the context-based entropy coding of the prediction residuals with selecting the context depending on the deviation measure on the other hand harmonizes with the nature of spectral envelopes.
    Type: Grant
    Filed: September 11, 2023
    Date of Patent: January 21, 2025
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Florin Ghido, Andreas Niedermeier
  • Patent number: 12205607
    Abstract: Described herein is a method for generating a modified bitstream on a source device, wherein the method includes the steps of: a) receiving, by a receiver, a bitstream including coded media data; b) generating, by an embedder, payload of additional media data and embedding the payload in the bitstream for obtaining, as an output from the embedder, a modified bitstream including the coded media data and the payload of the additional media data; and d) outputting the modified bitstream to a sink device. Described is further a method for processing said modified bitstream on a sink device. Described are moreover a respective source device and sink device as well as a system of a source device and a sink device and respective computer program products.
    Type: Grant
    Filed: August 13, 2020
    Date of Patent: January 21, 2025
    Assignees: Dolby Laboratories Licensing Corporation, Dolby International AB
    Inventors: Christof Fersch, Daniel Fischer, Leon Terentiv, Gregory John McGarry
  • Patent number: 12198731
    Abstract: Systems and methods are provided to implement and facilitate cross-fading, interstitials and other effects/processing of two or more media elements in a personalized media delivery service to experience consistent high quality. The effects or crossfade processing may occur on the broadcast/publisher/server-side, but may be personalized to a specific user, allowing a personalized experience for each user, where the processing burden is minimized on the downstream side/client device. This approach enables a consistent user experience, independent of client device capabilities. A large-scale personalized content delivery service may be implemented by limiting the processing to the first and last chunks of any file. In exemplary embodiments, this type of processing may easily be accommodated in cloud computing technology, where first and last files are extracted and processed within the cloud to meet the required load.
    Type: Grant
    Filed: November 20, 2023
    Date of Patent: January 14, 2025
    Assignee: Sirius XM Radio Inc.
    Inventors: Raymond Lowe, Christopher Ward