Patents Examined by Leshui Zhang

Multi-channel audio decoder, multi-channel audio encoder, methods, computer program and encoded audio representation using a decorrelation of rendered audio signals

Patent number: 12374342

Abstract: A multi-channel audio decoder for providing at least two output audio signals on the basis of an encoded representation is configured to render a plurality of decoded audio signals, which are obtained on the basis of the encoded representation, in dependence on one or more rendering parameters, to obtain a plurality of rendered audio signals. The multi-channel audio decoder is configured to derive one or more decorrelated audio signals from the rendered audio signals, and to combine the rendered audio signals, or a scaled version thereof, with the one or more decorrelated audio signals, to obtain the output audio signals. A multi-channel audio encoder provides a decorrelation method parameter to control an audio decoder.

Type: Grant

Filed: August 9, 2018

Date of Patent: July 29, 2025

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Sascha Disch, Harald Fuchs, Oliver Hellmuth, Juergen Herre, Adrian Murtaza, Jouni Paulus, Falko Ridderbusch, V, Leon Terentiv
Encoding and decoding methods, and encoding and decoding apparatuses for stereo signal

Patent number: 12361953

Abstract: This disclosure provides a encoding method, and an encoder for a multi-channel signal. The encoding method includes: obtaining a first ITD of a current frame of a multi-channel signal includes an initial left channel signal and an initial right channel signal; obtaining a second ITD of the current frame based on the first ITD and a third ITD of a previous frame of the multi-channel signal; performing delay alignment on the initial left channel signal and the initial right channel signal based on the second ITD, to obtain an aligned left channel signal and an aligned right channel signal; and encoding the aligned left channel signal and the aligned right channel signal.

Type: Grant

Filed: July 12, 2023

Date of Patent: July 15, 2025

Assignee: Huawei Technologies Co., Ltd.

Inventors: Eyal Shlomot, Haiting Li, Bin Wang
Systems and methods for indicating communication efficiency or compliance with ATC phraseology

Patent number: 12347422

Abstract: Systems and methods for indicating communication effectiveness with air traffic control (ATC) are disclosed. The method includes: receiving a transcribed message containing a plurality of words used by an ownship flight crew member in a communication directed to ATC; determining a message intent of the transcribed message from the words used in the communication; identifying a plurality of ideal words that should be used for an ideal message having the same message intent as the transcribed message; comparing the words used in the communication with the words that should have been used in the ideal message; determining based on the comparing whether the words used in the communication conformed to ATC standard phraseology (e.g., ICAO Pilot communication vocabulary); generating an indicator for flight crew that indicates whether the words used in the communication conformed to ATC standard phraseology; and signaling an aircraft display device to display the indicator.

Type: Grant

Filed: January 12, 2022

Date of Patent: July 1, 2025

Assignee: HONEYWELL INTERNATIONAL INC.

Inventors: Naveen Venkatesh Prasad Nama, Chaya Garg, Vasantha Paulraj, Gobinathan Baladhandapani, Hariharan Saptharishi, Sivakumar Kanagarajan
Integration of high frequency audio reconstruction techniques

Patent number: 12340819

Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag. The high frequency regeneration is performed as a post-processing operation with a delay of 3010 samples per audio channel.

Type: Grant

Filed: December 16, 2024

Date of Patent: June 24, 2025

Assignee: DOLBY INTERNATIONAL AB

Inventors: Kristofer Kjoerling, Lars Villemoes, Heiko Purnhagen, Per Ekstrand
System and/or method for machine learning using student prediction model

Patent number: 12340309

Abstract: Disclosed are a system, method and apparatus to generate service codes based, at least in part, on electronic documents.

Type: Grant

Filed: May 23, 2022

Date of Patent: June 24, 2025

Assignee: Akasa, Inc.

Inventors: Byung-Hak Kim, Hariraam Varun Ganapathi
Integration of high frequency audio reconstruction techniques

Patent number: 12334102

Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag. The high frequency regeneration is performed as a post-processing operation with a delay of 3010 samples per audio channel.

Type: Grant

Filed: December 16, 2024

Date of Patent: June 17, 2025

Assignee: DOLBY INTERNATIONAL AB

Inventors: Kristofer Kjoerling, Lars Villemoes, Heiko Purnhagen, Per Ekstrand
Directional voice sensing using coherent optical detection

Patent number: 12334096

Abstract: An electronic device includes a microphone, an array of coherent optical emitters, an array of balanced coherent optical vibration sensors, and a processor. Each balanced coherent optical vibration sensor in the array of balanced coherent optical vibration sensors is paired with a coherent optical emitter in the array of coherent optical emitters. The processor is configured to analyze a set of waveforms acquired by the array of balanced coherent optical vibration sensors; identify, using the analysis of the set of waveforms, a set of one or more voices in a field of view; and adjust an output of the microphone to accentuate a particular voice in the set of one or more voices.

Type: Grant

Filed: October 3, 2023

Date of Patent: June 17, 2025

Assignee: Apple Inc.

Inventors: Eran Tal, Ariel Lipson
Training method, text translation method, electronic device, and storage medium

Patent number: 12333266

Abstract: A training method, a text translation method, an electronic device, and a storage medium, which relate to a field of artificial intelligence, in particular to fields of natural language processing and deep learning technologies.

Type: Grant

Filed: November 8, 2022

Date of Patent: June 17, 2025

Assignee: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventors: Xiyang Wang, Ruiqing Zhang, Zhongjun He, Zhi Li, Hua Wu
Downmixed signal calculation method and apparatus

Patent number: 12327567

Abstract: This application discloses a downmixed signal calculation method and apparatus. The method includes: when a current frame or a previous frame of the current frame of a stereo signal is not a switching frame and a residual signal in the current frame or the previous frame does not need to be encoded, obtaining a second downmixed signal in the current frame and a downmix compensation factor of the current frame, correcting the second downmixed signal in the current frame based on the downmix compensation factor of the current frame, to obtain the first downmixed signal in the current frame and determining the first downmixed signal in the current frame as a downmixed signal in the current frame in a preset frequency band.

Type: Grant

Filed: November 29, 2023

Date of Patent: June 10, 2025

Assignee: Huawei Technologies Co., Ltd.

Inventors: Haiting Li, Zexin Liu, Bin Wang
Multichannel audio coding

Patent number: 12300254

Abstract: In multichannel audio coding, improved computational efficiency is achieved by computing comparison parameters for ITD compensation between any two channels in the frequency domain for a parametric audio encoder. This may mitigate negative effects on encoder parameter estimates.

Type: Grant

Filed: September 8, 2023

Date of Patent: May 13, 2025

Assignee: Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.

Inventors: Jan Büthe, Eleni Fotopoulou, Srikanth Korse, Pallavi Maben, Markus Multrus, Franz Reutelhuber
Integration of high frequency audio reconstruction techniques

Patent number: 12300263

Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag. The high frequency regeneration is performed as a post-processing operation with a delay of 3010 samples per audio channel.

Type: Grant

Filed: December 16, 2024

Date of Patent: May 13, 2025

Assignee: DOLBY INTERNATIONAL AB

Inventors: Kristofer Kjoerling, Lars Villemoes, Heiko Purnhagen, Per Ekstrand
Method and apparatus for training text classification model

Patent number: 12271701

Abstract: This disclosure relates to a method and an apparatus for training a text classification model. The method may include determining a semantic representation of the training sample using the text classification model and determining a predicted classification result of the training sample based on the semantic representation. The method may further include generating an adversarial sample corresponding to the training sample based on the training sample and perturbation information and determining a semantic representation of the adversarial sample corresponding to the training sample using the text classification model.

Type: Grant

Filed: September 20, 2022

Date of Patent: April 8, 2025

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventors: Yao Qiu, Jinchao Zhang, Jie Zhou, Cheng Niu
Database systems with automated structural metadata assignment

Patent number: 12248754

Abstract: Database systems and methods are provided for assigning structural metadata to records and creating automations using the structural metadata. One method of assigning structural metadata to a record associated with a conversation involves obtaining a plurality of utterances associated with the conversation, identifying, from among the plurality of utterances, a representative utterance for semantic content of the conversation, assigning the conversation to a group of semantically similar conversations based on the representative utterance, and automatically updating the record associated with the conversation at a database system to include metadata identifying the group of semantically similar conversations.

Type: Grant

Filed: September 19, 2022

Date of Patent: March 11, 2025

Inventors: Yixin Mao, Zachary Alexander, Tian Xie, Wenhao Liu
Apparatus and method for encoding or decoding a multichannel signal using a side gain and a residual gain

Patent number: 12243541

Abstract: An apparatus for encoding a multi-channel signal having at least two channels, has: a downmixer for calculating a downmix signal from the multi-channel signal; a parameter calculator for calculating a side gain from a first channel of the at least two channels and a second channel of the at least two channels and for calculating a residual gain from the first channel and the second channel; and an output interface for generating an output signal, the output signal having information on the downmix signal, and on the side gain and the residual gain.

Type: Grant

Filed: August 10, 2022

Date of Patent: March 4, 2025

Assignee: FRAUNHOFER-GESELLSCHAFT ZUR FOERDERUNG DER ANGEWANDTEN FORSCHUNG E.V.

Inventors: Jan Buethe, Guillaume Fuchs, Wolfgang Jaegers, Franz Reutelhuber, Juergen Herre, Eleni Fotopoulou, Markus Multrus, Srikanth Korse
Method for processing an audio signal, signal processing unit, binaural renderer, audio encoder and audio decoder

Patent number: 12238508

Abstract: A method for processing an audio signal in accordance with a room impulse response is described. The audio signal is processed with an early part of the room impulse response separate from a late reverberation of the room impulse response, wherein the processing of the late reverberation has generating a scaled reverberated signal, the scaling being dependent on the audio signal. The processed early part of the audio signal and the scaled reverberated signal are combined.

Type: Grant

Filed: January 22, 2024

Date of Patent: February 25, 2025

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Simone Neukam, Jan Plogsties
Using structured audio output to detect playback and/or to adapt to misaligned playback in wireless speakers

Patent number: 12236951

Abstract: Implementations are directed to determining an audio delay, of a computing device, by causing an audio data stream to be transmitted to the computing device via a wireless communication channel. The computing device causes audio output generated using the audio data stream to be rendered via speaker(s). The rendered audio output is captured via microphone(s), and the audio delay determined by comparing the captured audio output with the audio data stream. A delay audio segment can be appended to an additional audio data stream transmitted to the computing device, where the length of the delay audio segment is determined using the audio delay. A noise reduction technique can additionally or alternatively be adapted based on the audio delay. Implementations are additionally or alternatively directed to determining if an audio data stream transmitted to a computing device for rendering through speaker(s) driven by the computing device—is actually being rendered.

Type: Grant

Filed: August 14, 2023

Date of Patent: February 25, 2025

Assignee: GOOGLE LLC

Inventors: Nathaniel Nesiba, Xiang Cao
Post-parse semantic analyzer

Patent number: 12229516

Abstract: A data processing system receives a natural language communication including an ordered sequence of a plurality of word spellings of a natural human language. A processor of the data processing system parses the plurality of word spelling of the natural language communication utilizing constraint-based parsing to identify a plurality of satisfied parsing constraints. The processor performs semantic analysis based on the plurality of satisfied parsing constraints. Performing semantic analysis includes obtaining at least mid-level comprehension of the natural language communication by identifying in the natural language communication utilizing constraints at least one of the following set: a clausal structure within the natural language communication, a sentence structure of a sentence in the natural language communication, an implied topic of the natural language communication, and a classical linguistic role in the natural language communication.

Type: Grant

Filed: June 19, 2023

Date of Patent: February 18, 2025

Inventor: Thomas A. Visel
Centrally controlling communication at a venue

Patent number: 12229471

Abstract: One example may include a method that includes initiating an audio recording to capture audio data, comparing the audio data received from a microphone of a mobile device to an audio data range, determining whether the audio data is above an optimal level based on a result of the comparison, and queuing the audio data in an audio data queue when the audio data is above the optimal level.

Type: Grant

Filed: August 30, 2022

Date of Patent: February 18, 2025

Assignee: Biamp Systems, LLC

Inventors: Nicholas William Metzar, Richard S. Juszkiewicz, Matthew V. Kotvis, Jason E. Damori
Stereo signal processing method and apparatus

Patent number: 12230283

Abstract: A stereo signal processing method includes performing delay estimation on a stereo signal of a current frame to determine an inter-channel time difference of the current frame, identifying a sign of the inter-channel time difference of the current frame is different from a sign of an inter-channel time difference of a previous frame of the current frame, performing delay alignment processing on the first-channel signal of the current frame based on the inter-channel time difference of the current frame, and performing delay alignment processing on the second-channel signal of the current frame based on the inter-channel time difference of the previous frame.

Type: Grant

Filed: August 14, 2023

Date of Patent: February 18, 2025

Assignee: HUAWEI TECHNOLOGIES CO., LTD.

Inventors: Eyal Shlomot, Haiting Li, Lei Miao
Method and apparatus for event extraction and extraction model training, device and medium

Patent number: 12223268

Abstract: A method for event extraction according to the disclosure includes: processing an object text using a preset extraction model to determine event information of the object text; wherein the event information includes an event element, and an event type and a role corresponding to the event element; and the extraction model includes a classification layer and an output layer; the classification layer is configured to determine a token attribute of a token in the object text; the token attribute includes whether the token is a start token of the event element of any event type and any role, and whether the token is an end token of the event element of any event type and any role; and the output layer is configured to determine the event element according to the token attribute of the token, and determine the event type and the role corresponding to the event element.

Type: Grant

Filed: September 28, 2020

Date of Patent: February 11, 2025

Assignee: BOE TECHNOLOGY GROUP CO., LTD.

Inventors: Bingqian Wang, Shaoxun Su, Tianxin Liang

1 2 3 4 5 … next