Patents Examined by Bryan S Blankenagel

Quantum enhanced word embedding for natural language processing

Patent number: 11966707

Abstract: A quantum-enhanced system and method for natural language processing (NLP) for generating a word embedding on a hybrid quantum-classical computer. A training set is provided on the classical computer, wherein the training set provides at least one pair of words, and at least one binary value indicating the correlation between the pair of words. The quantum computer generates quantum state representations for each word in the pair of words. The quantum component evaluates the quantum correlation between the quantum state representations of the word pair using an engineering likelihood function and a Bayesian inference. Training the word embedding on the quantum computer is provided using an error function containing the binary value and the quantum correlation.

Type: Grant

Filed: January 13, 2022

Date of Patent: April 23, 2024

Assignee: Zapata Computing, Inc.

Inventor: Yudong Cao
Backward-compatible integration of high frequency reconstruction techniques for audio signals

Patent number: 11961528

Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag.

Type: Grant

Filed: July 24, 2023

Date of Patent: April 16, 2024

Assignee: Dolby International AB

Inventors: Kristofer Kjoerling, Lars Villemoes, Heiko Purnhagen, Per Ekstrand
Real-time accent conversion model

Patent number: 11948550

Abstract: Techniques for real-time accent conversion are described herein. An example computing device receives an indication of a first accent and a second accent. The computing device further receives, via at least one microphone, speech content having the first accent. The computing device is configured to derive, using a first machine-learning algorithm trained with audio data including the first accent, a linguistic representation of the received speech content having the first accent. The computing device is configured to, based on the derived linguistic representation of the received speech content having the first accent, synthesize, using a second machine learning-algorithm trained with (i) audio data comprising the first accent and (ii) audio data including the second accent, audio data representative of the received speech content having the second accent.

Type: Grant

Filed: August 27, 2021

Date of Patent: April 2, 2024

Assignee: SANAS.AI INC.

Inventors: Maxim Serebryakov, Shawn Zhang
Analysis of digital voice data in a data-communication server system

Patent number: 11948577

Abstract: Certain aspects of the disclosure are directed to apparatuses and methods for analyzing digital voice data in a data-communication system. A specific aspect is directed to a data-communication apparatus that includes a data-communication server and processing circuitry in communication therewith. The data-communication server interfaces with a plurality of remotely-situated client entities for providing data communication services.

Type: Grant

Filed: February 28, 2019

Date of Patent: April 2, 2024

Assignee: 8x8, Inc.

Inventors: Zhishen Liu, Bryan R. Martin
Encoding audio metadata in an audio frame

Patent number: 11942100

Abstract: Techniques for encoding audio data with metadata are described. In an example, a device receives audio data corresponding to audio detected by a microphone and receives metadata associated with the audio. The device generates encoded data based at least in part on encoding the audio data with the metadata. The encoding involves replacing a portion of the audio data with the metadata, such that the encoded data includes the metadata and a remaining portion of the audio data. The device sends the encoded data to an audio processing application.

Type: Grant

Filed: April 4, 2022

Date of Patent: March 26, 2024

Assignee: Amazon Technologies, Inc.

Inventors: Aditya Sharadchandra Joshi, Carlo Murgia, Michael Thomas Peterson
Method and apparatus for controlling enhancement of low-bitrate coded audio

Patent number: 11929085

Abstract: Described herein is a method of low-bitrate coding of audio data and generating enhancement metadata for controlling audio enhancement of the low-bitrate coded audio data at a decoder side, including the steps of: (a) core encoding original audio data at a low bitrate to obtain encoded audio data; (b) generating enhancement metadata to be used for controlling a type and/or amount of audio enhancement at the decoder side after core decoding the encoded audio data; and (c) outputting the encoded audio data and the enhancement metadata. Described is further an encoder configured to perform said method. Described is moreover a method for generating enhanced audio data from low-bitrate coded audio data based on enhancement metadata and a decoder configured to perform said method.

Type: Grant

Filed: August 29, 2019

Date of Patent: March 12, 2024

Assignees: DOLBY INTERNATIONAL AB, DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Arijit Biswas, Jia Dai, Aaron Steven Master
Voice processing device and voice processing method

Patent number: 11922933

Abstract: Voice processing method and device includes obtaining a probability value of an audio signal representing sound, collected by a first microphone on a near-end side, including a person's voice, determining a gain of the audio signal based on the determined probability value, processing the audio signal based on the determined gain of the audio signal, and sending the processed audio signal to a far-end side.

Type: Grant

Filed: June 2, 2020

Date of Patent: March 5, 2024

Assignee: YAMAHA CORPORATION

Inventor: Tetsuto Kawai
Systems and methods for human listening and live captioning

Patent number: 11922963

Abstract: Systems and methods are provided for generating and operating a speech enhancement model optimized for generating noise-suppressed speech outputs for improved human listening and live captioning. A computing system obtains a speech enhancement model trained on a first training dataset to generate noise-suppressed speech outputs and an automatic speech recognition model trained on a second training dataset to generate transcription labels for spoken language utterances. A third training dataset comprising a set of spoken language utterances is applied to the speech enhancement model to obtain a first noise-suppressed speech output which is applied to the automatic speech recognition model to generate a noise-suppressed transcription output for the set of spoken language utterances.

Type: Grant

Filed: May 26, 2021

Date of Patent: March 5, 2024

Assignee: Microsoft Technology Licensing, LLC

Inventors: Xiaofei Wang, Sefik Emre Eskimez, Min Tang, Hemin Yang, Zirun Zhu, Zhuo Chen, Huaming Wang, Takuya Yoshioka
Neural pitch-shifting and time-stretching

Patent number: 11915714

Abstract: Methods for modifying audio data include operations for accessing audio data having a first prosody, receiving a target prosody differing from the first prosody, and computing acoustic features representing samples. Computing respective acoustic features for a sample includes computing a pitch feature as a quantized pitch value of the sample by assigning a pitch value, of the target prosody or the audio data, to at least one of a set of pitch bins having equal widths in cents. Computing the respective acoustic features further includes computing a periodicity feature from the audio data. The respective acoustic features for the sample include the pitch feature, the periodicity feature, and other acoustic features. A neural vocoder is applied to the acoustic features to pitch-shift and time-stretch the audio data from the first prosody toward the target prosody.

Type: Grant

Filed: December 21, 2021

Date of Patent: February 27, 2024

Assignees: Adobe Inc., Northwestern University

Inventors: Maxwell Morrison, Juan Pablo Caceres Chomali, Zeyu Jin, Nicholas Bryan, Bryan A. Pardo
Remoteless control of drone behavior

Patent number: 11892859

Abstract: A drone system is configured to capture an audio stream that includes voice commands from an operator, to process the audio stream for identification of the voice commands, and to perform operations based on the identified voice commands. The drone system can identify a particular voice stream in the audio stream as an operator voice, and perform the command recognition with respect to the operator voice to the exclusion of other voice streams present in the audio stream. The drone can include a directional camera that is automatically and continuously focused on the operator to capture a video stream usable in disambiguation of different voice streams captured by the drone.

Type: Grant

Filed: July 28, 2022

Date of Patent: February 6, 2024

Assignee: Snap Inc.

Inventors: David Meisenholder, Steven Horowitz
Call audio mixing processing

Patent number: 11887618

Abstract: A call audio mixing processing method is provided. In the method, call audio streams from terminals of call members participating in a call are obtained. Voice analysis is performed on the call audio streams to determine voice activity corresponding to each of the terminals. The voice activity of the terminals indicate activity levels of the call members participating in the call. According to the voice activity of the terminals, respective voice adjustment parameters corresponding to the terminals are determined. According to the respective voice adjustment parameters corresponding to the terminals, the call audio streams of the terminals are adjusted. Further, mixing processing is performed on the adjusted call audio streams to obtain a mixed audio stream.

Type: Grant

Filed: April 18, 2022

Date of Patent: January 30, 2024

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventor: Junbin Liang
Methods and apparatus for supplementing partially readable and/or inaccurate codes in media

Patent number: 11854556

Abstract: Methods and apparatus are disclosed for supplementing partially readable and/or inaccurate codes. An example apparatus includes a watermark analyzer to select a first watermark and a second watermark decoded from media; a comparator to compare a first decoded timestamp of the first watermark to a second decoded timestamp of the second watermark; and a timestamp adjuster to adjust the second decoded timestamp based on the first decoded timestamp of the second watermark when at least a threshold number of symbols of the second decoded timestamp match corresponding symbols of the first decoded timestamp.

Type: Grant

Filed: November 14, 2022

Date of Patent: December 26, 2023

Assignee: The Nielsen Company (US), LLC

Inventors: David Gish, Jeremey M. Davis, Wendell D. Lynch, Christen V. Nielsen
Periodic-combined-envelope-sequence generation device, periodic-combined-envelope-sequence generation method, periodic-combined-envelope-sequence generation program and recording medium

Patent number: 11848021

Abstract: An envelope sequence is provided that can improve approximation accuracy near peaks caused by the pitch period of an audio signal. A periodic-combined-envelope-sequence generation device according to the present invention takes, as an input audio signal, a time-domain audio digital signal in each frame, which is a predetermined time segment, and generates a periodic combined envelope sequence as an envelope sequence. The periodic-combined-envelope-sequence generation device according to the present invention comprises at least a spectral-envelope-sequence calculating part and a periodic-combined-envelope generating part. The spectral-envelope-sequence calculating part calculates a spectral envelope sequence of the input audio signal on the basis of time-domain linear prediction of the input audio signal.

Type: Grant

Filed: September 29, 2022

Date of Patent: December 19, 2023

Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Takehiro Moriya, Yutaka Kamamoto, Noboru Harada
Harmonic transposition in an audio coding method and system

Patent number: 11837246

Abstract: The present invention relates to transposing signals in time and/or frequency and in particular to coding of audio signals. More particular, the present invention relates to high frequency reconstruction (HFR) methods including a frequency domain harmonic transposer. A method and system for generating a transposed output signal from an input signal using a transposition factor T is described. The system comprises an analysis window of length La, extracting a frame of the input signal, and an analysis transformation unit of order M transforming the samples into M complex coefficients. M is a function of the transposition factor T. The system further comprises a nonlinear processing unit altering the phase of the complex coefficients by using the transposition factor T, a synthesis transformation unit of order M transforming the altered coefficients into M altered samples, and a synthesis window of length Ls, generating a frame of the output signal.

Type: Grant

Filed: February 3, 2023

Date of Patent: December 5, 2023

Assignee: DOLBY INTERNATIONAL AB

Inventors: Per Ekstrand, Lars Villemoes
System and method for augmenting vehicle phone audio with background sounds

Patent number: 11830514

Abstract: A vehicle infotainment system that adds background sounds to an outgoing call on a mobile device. The infotainment system comprises: i) a database of selectable augmenting audio signals; and ii) audio processing circuitry configured to receive at a first input an uplink signal from the infotainment system and receive at a second input a selected augmenting audio signal. The audio processing circuitry adapts a spectrum of the first selected augmenting audio signal to prevent the selected augmenting audio signal from masking the uplink signal and combines the adapted selected augmenting audio signal and the uplink signal to produce an augmented uplink signal at an output.

Type: Grant

Filed: May 27, 2021

Date of Patent: November 28, 2023

Assignee: GM GLOBAL TECHNOLOGY OPERATIONS LLC

Inventors: Omer Tsimhoni, Eli Tzirkel-Hancock
Feature value generation device, feature value generation method, and program

Patent number: 11829868

Abstract: A feature value generation device includes a generator configured to digitize non-numerical text data items collected at a plurality of timings from a target of anomaly detection, to generate vectors whose elements are feature values corresponding to the digitized data items; a learning unit configured to learn the vectors during a learning period so as to output a learning result; and a detector configured to detect, during a test period, for each of the vectors generated by the generator, an anomaly based on said each of the vectors and the learning result.

Type: Grant

Filed: October 31, 2017

Date of Patent: November 28, 2023

Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Yasuhiro Ikeda, Yusuke Nakano, Keishiro Watanabe, Keisuke Ishibashi, Ryoichi Kawahara
Stream conformant bit error resilience

Patent number: 11823692

Abstract: Methods, devices, non-transitory computer-readable medium, and systems are described for compressing audio data. The techniques involve obtaining a sequence of digitized samples of an audio signal, performing a transform using the sequence of digitized samples, to generate a plurality of spectral lines, obtaining a group of spectral lines from the plurality of spectral lines, and quantizing the group of spectral lines to generate a group of quantized values. Quantizing the group of spectral lines to generate the group of quantized values may comprise performing a specialized rounding operation on a spectral line selected from the group of spectral lines and using the specialized rounding operation to force a group parity value, computed for the group of quantized values, to a predetermined parity value. One or more data frames based on the group of quantized values may be outputted.

Type: Grant

Filed: May 25, 2022

Date of Patent: November 21, 2023

Assignee: QUALCOMM Incorporated

Inventors: Richard Turner, Megan Lucy Taggart, Laurent Wojcieszak, Justin Hundt
Pattern recognition apparatus, pattern recognition method, and storage medium

Patent number: 11817103

Abstract: Provided is a pattern recognition apparatus to provide classification robustness to any kind of domain variability. The pattern recognition apparatus 500 based on Neural Network (NN) includes: NN training unit 501 that trains an NN model to generate NN parameters, based on at least one first feature vector and at least one domain vector indicating one of subsets in a specific domain, wherein, the first feature vector is extracted from each of the subsets, the domain vector indicates an identifier corresponding to the each of the subsets; and NN verification unit 502 that verifies a pair of second feature vectors in the specific domain to output whether the pair indicates same individual or not, based on a target domain vector and the NN parameters.

Type: Grant

Filed: September 15, 2017

Date of Patent: November 14, 2023

Assignee: NEC CORPORATION

Inventors: Qiongqiong Wang, Takafumi Koshinaka
Systems and methods for filtering unwanted sounds from a conference call

Patent number: 11817113

Abstract: To filter unwanted sounds from a conference call, a voice profile of a first user is generated based on a first voice signal captured by a media device during a first conference call. The voice profile may be generated by identifying a base frequency of the first voice signal and determining a plurality of voice characteristics, such as pitch, intonation, accent, loudness, and speech rate. These data may be stored in association with the first user. During a second conference call, a second voice signal captured by the media device is analyzed to determine, based on the voice profile of the first user, whether the second voice signal includes the voice of a second user. If so, the second voice signal is prevented from being transmitted into the conference call. A voice profile of the second user may be generated from the second voice signal for future use.

Type: Grant

Filed: September 9, 2020

Date of Patent: November 14, 2023

Assignee: Rovi Guides, Inc.

Inventors: Rajendran Pichaimurthy, Madhusudhan Seetharam
Augmentation of audiographic images for improved machine learning

Patent number: 11816577

Abstract: Generally, the present disclosure is directed to systems and methods that generate augmented training data for machine-learned models via application of one or more augmentation techniques to audiographic images that visually represent audio signals. In particular, the present disclosure provides a number of novel augmentation operations which can be performed directly upon the audiographic image (e.g., as opposed to the raw audio data) to generate augmented training data that results in improved model performance. As an example, the audiographic images can be or include one or more spectrograms or filter bank sequences.

Type: Grant

Filed: September 28, 2021

Date of Patent: November 14, 2023

Assignee: GOOGLE LLC

Inventors: Daniel Sung-Joon Park, Quoc Le, William Chan, Ekin Dogus Cubuk, Barret Zoph, Yu Zhang, Chung-Cheng Chiu

1 2 3 4 5 … next