Patents Examined by Leshui Zhang
-
Patent number: 12374342Abstract: A multi-channel audio decoder for providing at least two output audio signals on the basis of an encoded representation is configured to render a plurality of decoded audio signals, which are obtained on the basis of the encoded representation, in dependence on one or more rendering parameters, to obtain a plurality of rendered audio signals. The multi-channel audio decoder is configured to derive one or more decorrelated audio signals from the rendered audio signals, and to combine the rendered audio signals, or a scaled version thereof, with the one or more decorrelated audio signals, to obtain the output audio signals. A multi-channel audio encoder provides a decorrelation method parameter to control an audio decoder.Type: GrantFiled: August 9, 2018Date of Patent: July 29, 2025Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.Inventors: Sascha Disch, Harald Fuchs, Oliver Hellmuth, Juergen Herre, Adrian Murtaza, Jouni Paulus, Falko Ridderbusch, V, Leon Terentiv
-
Patent number: 12361953Abstract: This disclosure provides a encoding method, and an encoder for a multi-channel signal. The encoding method includes: obtaining a first ITD of a current frame of a multi-channel signal includes an initial left channel signal and an initial right channel signal; obtaining a second ITD of the current frame based on the first ITD and a third ITD of a previous frame of the multi-channel signal; performing delay alignment on the initial left channel signal and the initial right channel signal based on the second ITD, to obtain an aligned left channel signal and an aligned right channel signal; and encoding the aligned left channel signal and the aligned right channel signal.Type: GrantFiled: July 12, 2023Date of Patent: July 15, 2025Assignee: Huawei Technologies Co., Ltd.Inventors: Eyal Shlomot, Haiting Li, Bin Wang
-
Patent number: 12347422Abstract: Systems and methods for indicating communication effectiveness with air traffic control (ATC) are disclosed. The method includes: receiving a transcribed message containing a plurality of words used by an ownship flight crew member in a communication directed to ATC; determining a message intent of the transcribed message from the words used in the communication; identifying a plurality of ideal words that should be used for an ideal message having the same message intent as the transcribed message; comparing the words used in the communication with the words that should have been used in the ideal message; determining based on the comparing whether the words used in the communication conformed to ATC standard phraseology (e.g., ICAO Pilot communication vocabulary); generating an indicator for flight crew that indicates whether the words used in the communication conformed to ATC standard phraseology; and signaling an aircraft display device to display the indicator.Type: GrantFiled: January 12, 2022Date of Patent: July 1, 2025Assignee: HONEYWELL INTERNATIONAL INC.Inventors: Naveen Venkatesh Prasad Nama, Chaya Garg, Vasantha Paulraj, Gobinathan Baladhandapani, Hariharan Saptharishi, Sivakumar Kanagarajan
-
Patent number: 12340819Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag. The high frequency regeneration is performed as a post-processing operation with a delay of 3010 samples per audio channel.Type: GrantFiled: December 16, 2024Date of Patent: June 24, 2025Assignee: DOLBY INTERNATIONAL ABInventors: Kristofer Kjoerling, Lars Villemoes, Heiko Purnhagen, Per Ekstrand
-
Patent number: 12340309Abstract: Disclosed are a system, method and apparatus to generate service codes based, at least in part, on electronic documents.Type: GrantFiled: May 23, 2022Date of Patent: June 24, 2025Assignee: Akasa, Inc.Inventors: Byung-Hak Kim, Hariraam Varun Ganapathi
-
Patent number: 12334102Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag. The high frequency regeneration is performed as a post-processing operation with a delay of 3010 samples per audio channel.Type: GrantFiled: December 16, 2024Date of Patent: June 17, 2025Assignee: DOLBY INTERNATIONAL ABInventors: Kristofer Kjoerling, Lars Villemoes, Heiko Purnhagen, Per Ekstrand
-
Patent number: 12334096Abstract: An electronic device includes a microphone, an array of coherent optical emitters, an array of balanced coherent optical vibration sensors, and a processor. Each balanced coherent optical vibration sensor in the array of balanced coherent optical vibration sensors is paired with a coherent optical emitter in the array of coherent optical emitters. The processor is configured to analyze a set of waveforms acquired by the array of balanced coherent optical vibration sensors; identify, using the analysis of the set of waveforms, a set of one or more voices in a field of view; and adjust an output of the microphone to accentuate a particular voice in the set of one or more voices.Type: GrantFiled: October 3, 2023Date of Patent: June 17, 2025Assignee: Apple Inc.Inventors: Eran Tal, Ariel Lipson
-
Patent number: 12333266Abstract: A training method, a text translation method, an electronic device, and a storage medium, which relate to a field of artificial intelligence, in particular to fields of natural language processing and deep learning technologies.Type: GrantFiled: November 8, 2022Date of Patent: June 17, 2025Assignee: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.Inventors: Xiyang Wang, Ruiqing Zhang, Zhongjun He, Zhi Li, Hua Wu
-
Patent number: 12327567Abstract: This application discloses a downmixed signal calculation method and apparatus. The method includes: when a current frame or a previous frame of the current frame of a stereo signal is not a switching frame and a residual signal in the current frame or the previous frame does not need to be encoded, obtaining a second downmixed signal in the current frame and a downmix compensation factor of the current frame, correcting the second downmixed signal in the current frame based on the downmix compensation factor of the current frame, to obtain the first downmixed signal in the current frame and determining the first downmixed signal in the current frame as a downmixed signal in the current frame in a preset frequency band.Type: GrantFiled: November 29, 2023Date of Patent: June 10, 2025Assignee: Huawei Technologies Co., Ltd.Inventors: Haiting Li, Zexin Liu, Bin Wang
-
Patent number: 12300254Abstract: In multichannel audio coding, improved computational efficiency is achieved by computing comparison parameters for ITD compensation between any two channels in the frequency domain for a parametric audio encoder. This may mitigate negative effects on encoder parameter estimates.Type: GrantFiled: September 8, 2023Date of Patent: May 13, 2025Assignee: Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.Inventors: Jan Büthe, Eleni Fotopoulou, Srikanth Korse, Pallavi Maben, Markus Multrus, Franz Reutelhuber
-
Patent number: 12300263Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag. The high frequency regeneration is performed as a post-processing operation with a delay of 3010 samples per audio channel.Type: GrantFiled: December 16, 2024Date of Patent: May 13, 2025Assignee: DOLBY INTERNATIONAL ABInventors: Kristofer Kjoerling, Lars Villemoes, Heiko Purnhagen, Per Ekstrand
-
Patent number: 12271701Abstract: This disclosure relates to a method and an apparatus for training a text classification model. The method may include determining a semantic representation of the training sample using the text classification model and determining a predicted classification result of the training sample based on the semantic representation. The method may further include generating an adversarial sample corresponding to the training sample based on the training sample and perturbation information and determining a semantic representation of the adversarial sample corresponding to the training sample using the text classification model.Type: GrantFiled: September 20, 2022Date of Patent: April 8, 2025Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITEDInventors: Yao Qiu, Jinchao Zhang, Jie Zhou, Cheng Niu
-
Patent number: 12248754Abstract: Database systems and methods are provided for assigning structural metadata to records and creating automations using the structural metadata. One method of assigning structural metadata to a record associated with a conversation involves obtaining a plurality of utterances associated with the conversation, identifying, from among the plurality of utterances, a representative utterance for semantic content of the conversation, assigning the conversation to a group of semantically similar conversations based on the representative utterance, and automatically updating the record associated with the conversation at a database system to include metadata identifying the group of semantically similar conversations.Type: GrantFiled: September 19, 2022Date of Patent: March 11, 2025Inventors: Yixin Mao, Zachary Alexander, Tian Xie, Wenhao Liu
-
Patent number: 12243541Abstract: An apparatus for encoding a multi-channel signal having at least two channels, has: a downmixer for calculating a downmix signal from the multi-channel signal; a parameter calculator for calculating a side gain from a first channel of the at least two channels and a second channel of the at least two channels and for calculating a residual gain from the first channel and the second channel; and an output interface for generating an output signal, the output signal having information on the downmix signal, and on the side gain and the residual gain.Type: GrantFiled: August 10, 2022Date of Patent: March 4, 2025Assignee: FRAUNHOFER-GESELLSCHAFT ZUR FOERDERUNG DER ANGEWANDTEN FORSCHUNG E.V.Inventors: Jan Buethe, Guillaume Fuchs, Wolfgang Jaegers, Franz Reutelhuber, Juergen Herre, Eleni Fotopoulou, Markus Multrus, Srikanth Korse
-
Patent number: 12238508Abstract: A method for processing an audio signal in accordance with a room impulse response is described. The audio signal is processed with an early part of the room impulse response separate from a late reverberation of the room impulse response, wherein the processing of the late reverberation has generating a scaled reverberated signal, the scaling being dependent on the audio signal. The processed early part of the audio signal and the scaled reverberated signal are combined.Type: GrantFiled: January 22, 2024Date of Patent: February 25, 2025Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.Inventors: Simone Neukam, Jan Plogsties
-
Patent number: 12236951Abstract: Implementations are directed to determining an audio delay, of a computing device, by causing an audio data stream to be transmitted to the computing device via a wireless communication channel. The computing device causes audio output generated using the audio data stream to be rendered via speaker(s). The rendered audio output is captured via microphone(s), and the audio delay determined by comparing the captured audio output with the audio data stream. A delay audio segment can be appended to an additional audio data stream transmitted to the computing device, where the length of the delay audio segment is determined using the audio delay. A noise reduction technique can additionally or alternatively be adapted based on the audio delay. Implementations are additionally or alternatively directed to determining if an audio data stream transmitted to a computing device for rendering through speaker(s) driven by the computing device—is actually being rendered.Type: GrantFiled: August 14, 2023Date of Patent: February 25, 2025Assignee: GOOGLE LLCInventors: Nathaniel Nesiba, Xiang Cao
-
Patent number: 12229516Abstract: A data processing system receives a natural language communication including an ordered sequence of a plurality of word spellings of a natural human language. A processor of the data processing system parses the plurality of word spelling of the natural language communication utilizing constraint-based parsing to identify a plurality of satisfied parsing constraints. The processor performs semantic analysis based on the plurality of satisfied parsing constraints. Performing semantic analysis includes obtaining at least mid-level comprehension of the natural language communication by identifying in the natural language communication utilizing constraints at least one of the following set: a clausal structure within the natural language communication, a sentence structure of a sentence in the natural language communication, an implied topic of the natural language communication, and a classical linguistic role in the natural language communication.Type: GrantFiled: June 19, 2023Date of Patent: February 18, 2025Inventor: Thomas A. Visel
-
Patent number: 12229471Abstract: One example may include a method that includes initiating an audio recording to capture audio data, comparing the audio data received from a microphone of a mobile device to an audio data range, determining whether the audio data is above an optimal level based on a result of the comparison, and queuing the audio data in an audio data queue when the audio data is above the optimal level.Type: GrantFiled: August 30, 2022Date of Patent: February 18, 2025Assignee: Biamp Systems, LLCInventors: Nicholas William Metzar, Richard S. Juszkiewicz, Matthew V. Kotvis, Jason E. Damori
-
Patent number: 12230283Abstract: A stereo signal processing method includes performing delay estimation on a stereo signal of a current frame to determine an inter-channel time difference of the current frame, identifying a sign of the inter-channel time difference of the current frame is different from a sign of an inter-channel time difference of a previous frame of the current frame, performing delay alignment processing on the first-channel signal of the current frame based on the inter-channel time difference of the current frame, and performing delay alignment processing on the second-channel signal of the current frame based on the inter-channel time difference of the previous frame.Type: GrantFiled: August 14, 2023Date of Patent: February 18, 2025Assignee: HUAWEI TECHNOLOGIES CO., LTD.Inventors: Eyal Shlomot, Haiting Li, Lei Miao
-
Patent number: 12223268Abstract: A method for event extraction according to the disclosure includes: processing an object text using a preset extraction model to determine event information of the object text; wherein the event information includes an event element, and an event type and a role corresponding to the event element; and the extraction model includes a classification layer and an output layer; the classification layer is configured to determine a token attribute of a token in the object text; the token attribute includes whether the token is a start token of the event element of any event type and any role, and whether the token is an end token of the event element of any event type and any role; and the output layer is configured to determine the event element according to the token attribute of the token, and determine the event type and the role corresponding to the event element.Type: GrantFiled: September 28, 2020Date of Patent: February 11, 2025Assignee: BOE TECHNOLOGY GROUP CO., LTD.Inventors: Bingqian Wang, Shaoxun Su, Tianxin Liang