Variable Rate Or Variable Quality Codecs, E.g., Scalable Representation Encoding, Etc. (epo) Patents (Class 704/E19.044)

Information processing device and method, and program

Patent number: 12198704

Abstract: The present technology relates to an information processing device and method and a program that make it possible to reduce the total number of objects while the influence on the sound quality is suppressed. The information processing device includes a pass-through object selection unit configured to acquire data of L objects and select, from the L objects, M pass-through objects whose data is to be outputted as it is, and an object generation unit configured to generate, on the basis of the data of multiple non-pass-through objects that are not the pass-through objects among the L objects, the data of N new objects, N being smaller than (L?M). The present technology can be applied to an information processing device.

Type: Grant

Filed: November 6, 2019

Date of Patent: January 14, 2025

Assignee: Sony Group Corporation

Inventors: Yuki Yamamoto, Toru Chinen, Minoru Tsuji, Yoshiaki Oikawa
Encoding method and apparatus

Patent number: 12198703

Abstract: An audio signal encoding method and device are provided. The method and device are used to encode an audio signal to obtain a bitstream representing the analog audio signal, in which a proper bit allocation for spectral coefficients can be performed.

Type: Grant

Filed: February 16, 2022

Date of Patent: January 14, 2025

Assignee: Top Quality Telephony, LLC

Inventors: Zexin Liu, Bin Wang, Lei Miao
Audio coder window and transform implementations

Patent number: 12175989

Abstract: An audio signal encoding method is provided comprising: receiving first and second audio signal frames; processing a second portion of the first audio signal frame and a first portion of the second audio signal frame using an orthogonal transformation to determine in part a first intermediate encoding result; and processing the first intermediate encoding result using an orthogonal transformation to determine a set of spectral coefficients that corresponds to at least a portion of the first audio signal frame.

Type: Grant

Filed: September 19, 2023

Date of Patent: December 24, 2024

Assignee: DTS, Inc.

Inventors: Michael M. Goodwin, Antonius Kalker, Albert Chau
Device to process sample using a time-windowed transform function to generate spectral data and to use combined magnitude and phase spectrograms

Patent number: 12104955

Abstract: A method of signal processing includes receiving samples of a signal and processing the samples using a time-windowed transform function to generate spectral data corresponding to each time window. The method includes generating first spectrogram data based on magnitudes of the spectral data and generating second spectrogram data based on phase differences of the spectral data. The method further includes combining the first spectrogram data and the second spectrogram data to generate a combined spectrogram and processing the combined spectrogram to generate output.

Type: Grant

Filed: February 4, 2022

Date of Patent: October 1, 2024

Assignee: THE BOEING COMPANY

Inventors: Ely V. Soto, Aleksandar Jovancevic
Microcontroller for recording and storing physiological data

Patent number: 12089964

Abstract: A microcontroller for recording and storing physiological data includes an analog-to-digital converter for converting analog physiological sensor signals to digital signals, a sample buffer for holding a temporal sequence of the digital signals, a central processing unit (CPU), and a non-volatile memory. The non-volatile memory includes (i) a code storage encoding machine-readable data compression instructions that, when executed by the CPU, control the CPU to (a) transform the temporal sequence of the digital signals to produce transformed physiological data characterized by a set of transformation coefficients and (b) compress the set of transformation coefficients to generate compressed physiological data, and (ii) a data storage configured to contain several different instances of the compressed physiological data respectively associated with several different instances of the temporal sequence of the digital signals.

Type: Grant

Filed: August 23, 2019

Date of Patent: September 17, 2024

Inventors: Ryan Halter, Gunnar C. Pope
Concept for combined dynamic range compression and guided clipping prevention for audio devices

Patent number: 12051432

Abstract: The invention provides a concept for combined dynamic range compression and guided clipping prevention for audio devices. An audio decoder for decoding an audio bitstream and a metadata bitstream related to the audio bitstream according to the concept includes an audio processing chain including a plurality of adjustment stages including a dynamic range control stage for adjusting a dynamic range of the audio output signal and a guided clipping prevention stage for preventing clipping of the audio output signal; and a metadata decoder configured to receive the metadata bitstream and to extract dynamic range control gain sequences and guided clipping prevention gain sequences from the metadata bitstream, at least a part of the dynamic range control gain sequences being supplied to the dynamic range control stage, and at least a part of the guided clipping prevention gain sequences being supplied to the guided clipping prevention stage.

Type: Grant

Filed: December 29, 2022

Date of Patent: July 30, 2024

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Fabian Kuech, Christian Uhle, Michael Kratschmer, Bernhard Neugebauer, Michael Meier, Arne Borsum
Embedded networked deep learning for implanted medical devices

Patent number: 11969266

Abstract: A deep learning medical device implantable in a body is provided. The device includes a processing and communication unit and a sensing and actuation unit. The processing and communication unit includes a deep learning module including a neural network trained to process the input samples, received from the sensing and actuation unit, through a plurality of layers to classify physiological parameters and provide classification results. A communication interface in communication with the deep learning module receives the classification results for ultrasonic transmission through biological tissue. Methods of sensing and classifying physiological parameters of a body and methods of embedding deep learning into an implantable medical device are also provided.

Type: Grant

Filed: February 16, 2021

Date of Patent: April 30, 2024

Assignee: Northeastern University

Inventors: Daniel Uvaydov, Raffaele Guida, Francesco Restuccia, Tommaso Melodia
Audio coder window and transform implementations

Patent number: 11894004

Abstract: An audio signal encoding method is provided comprising: receiving first and second audio signal frames; processing a second portion of the first audio signal frame and a first portion of the second audio signal frame using an orthogonal transformation to determine in part a first intermediate encoding result; and processing the first intermediate encoding result using an orthogonal transformation to determine a set of spectral coefficients that corresponds to at least a portion of the first audio signal frame.

Type: Grant

Filed: November 13, 2020

Date of Patent: February 6, 2024

Assignee: DTS, Inc.

Inventors: Michael M. Goodwin, Antonius Kalker, Albert Chau
Method of encoding and decoding data transferred via a communications link

Patent number: 11855711

Abstract: A communication system uses multiple communications links, preferably links that use different communications media. The multiple communications links may include a high latency/high bandwidth link using a fiber-optic cable configured to carry large volumes of data but having a high latency. The communications links may also include a low latency/low bandwidth link implemented using skywave propagation of radio waves and configured to carry smaller volumes of triggering data with a lower latency across a substantial portion of the earth's surface. The triggering data may be sent in a data stream as data frames without headers, security information, or error checking codes. The two communications links may be used together to coordinate various activities such as the buying and selling of financial instruments.

Type: Grant

Filed: February 14, 2022

Date of Patent: December 26, 2023

Assignee: Skywave Networks LLC

Inventor: Kevin J. Babich
Selection and signaling of motion vector (MV) precisions

Patent number: 11843800

Abstract: An encoder comprising a processor configured to obtain candidate motion vectors (MVs) corresponding to neighboring blocks of a current block, the neighboring blocks neighboring the current block; obtain precisions of the candidate MVs; round the precisions to a target precision based on a rounding scheme; round the candidate MVs based on the target precision; perform pruning of the candidate MVs; generate a candidate list based on the rounding of the candidate MVs and the pruning; select one of the candidate MVs from the candidate list for encoding the current block; and encode an MV candidate index corresponding to the one of the candidate MVs that was selected in a bitstream.

Type: Grant

Filed: December 13, 2021

Date of Patent: December 12, 2023

Assignee: Huawei Technologies Co., Ltd.

Inventors: Shan Liu, Wei Wang
System and method for processing audio data into a plurality of frequency components

Patent number: 11823691

Abstract: An encoder operable to filter audio signals into a plurality of frequency band components, generate quantized digital components for each band, identify a potential for pre-echo events within the generated quantized digital components, generate an approximate signal by decoding the quantized digital components using inverse pulse code modulation, generate an error signal by comparing the approximate signal with the sampled audio signal, and process the error signal and quantized digital components. The encoder operable to process the error signal by processing delayed audio signals and Q band values, determining the potential for pre-echo events from the Q band values, and determining scale factors and MDCT block sizes for the potential for pre-echo events.

Type: Grant

Filed: January 23, 2023

Date of Patent: November 21, 2023

Assignee: IMMERSION NETWORKS, INC.

Inventors: James David Johnston, Stephen Daniel White, King Wei Hor, Barry M. Genova
Encoder, decoder and method for encoding and decoding audio content using parameters for enhancing a concealment

Patent number: 11735196

Abstract: Described are an encoder for coding speech-like content and/or general audio content, wherein the encoder is configured to embed, at least in some frames, parameters in a bitstream, which parameters enhance a concealment in case an original frame is lost, corrupted or delayed, and a decoder for decoding speech-like content and/or general audio content, wherein the decoder is configured to use parameters which are sent later in time to enhance a concealment in case an original frame is lost, corrupted or delayed, as well as a method for encoding and a method for decoding.

Type: Grant

Filed: December 18, 2020

Date of Patent: August 22, 2023

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Jérémie Lecomte, Benjamin Schubert, Michael Schnabel, Martin Dietz
Content adaptive encoding

Patent number: 11729418

Abstract: Techniques are described for adaptive encoding different portions of media content based on content. Characteristics of GOPs of media content can be determined and used to set encoding parameters for the GOs. The GOPs can be encoded such that one GOP is encoded differently than another GOP if they have different characteristics.

Type: Grant

Filed: July 7, 2021

Date of Patent: August 15, 2023

Assignee: Amazon Technologies, Inc.

Inventors: Charles Benjamin Franklin Waggoner, Marc Joliveau, Srikanth Kiran Kotagiri, Yongjun Wu, Yang Yang
Apparatuses and methods for enhanced speech recognition in variable environments

Patent number: 11631421

Abstract: Systems, apparatuses, and methods are described to increase a signal-to-noise ratio difference between a main channel and reference channel. The increased signal-to-noise ratio difference is accomplished with an adaptive threshold for a desired voice activity detector (DVAD) and shaping filters. The DVAD includes averaging an output signal of a reference microphone channel to provide an estimated average background noise level. A threshold value is selected from a plurality of threshold values based on the estimated average background noise level. The threshold value is used to detect desired voice activity on a main microphone channel.

Type: Grant

Filed: October 18, 2015

Date of Patent: April 18, 2023

Assignee: SOLOS TECHNOLOGY LIMITED

Inventors: Dashen Fan, Xi Chen, Hua Bao
Data preprocessing and data augmentation in frequency domain

Patent number: 11570477

Abstract: Methods and systems are provided for implementing preprocessing operations and augmentation operations upon image datasets transformed to frequency domain representations, including decoding images of an image dataset to generate a frequency domain representation of the image dataset; performing a resizing operation based on resizing factors on the image dataset in a frequency domain representation; performing a reshaping operation based on reshaping factors on the image dataset in a frequency domain representation; and performing a cropping operation on the image dataset in a frequency domain representation. The methods and systems may further include performing an augmentation operation on the image dataset in a frequency domain representation. Methods and systems of the present disclosure may free learning models from computational overhead caused by transforming image datasets into frequency domain representations.

Type: Grant

Filed: December 31, 2019

Date of Patent: January 31, 2023

Assignee: Alibaba Group Holding Limited

Inventors: Kai Xu, Fei Sun, Minghai Qin, Yen-kuang Chen
Concept for combined dynamic range compression and guided clipping prevention for audio devices

Patent number: 11551703

Abstract: The invention provides a concept for combined dynamic range compression and guided clipping prevention for audio devices. An audio decoder for decoding an audio bitstream and a metadata bitstream related to the audio bitstream according to the concept includes an audio processing chain including a plurality of adjustment stages including a dynamic range control stage for adjusting a dynamic range of the audio output signal and a guided clipping prevention stage for preventing clipping of the audio output signal; and a metadata decoder configured to receive the metadata bitstream and to extract dynamic range control gain sequences and guided clipping prevention gain sequences from the metadata bitstream, at least a part of the dynamic range control gain sequences being supplied to the dynamic range control stage, and at least a part of the guided clipping prevention gain sequences being supplied to the guided clipping prevention stage.

Type: Grant

Filed: February 11, 2021

Date of Patent: January 10, 2023

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Fabian Kuech, Christian Uhle, Michael Kratschmer, Bernhard Neugebauer, Michael Meier, Arne Borsum
Apparatus and method for selecting one of a first encoding algorithm and a second encoding algorithm

Patent number: 11521631

Abstract: An apparatus for selecting one of a first encoding algorithm having a first characteristic and a second encoding algorithm having a second characteristic for encoding a portion of an audio signal to obtain an encoded version of the portion of the audio signal has a first estimator for estimating a first quality measure for the portion of the audio signal, which is associated with the first encoding algorithm, without actually encoding and decoding the portion of the audio signal using the first encoding algorithm. A second estimator is provided for estimating a second quality measure for the portion of the audio signal, which is associated with the second encoding algorithm, without actually encoding and decoding the portion of the audio signal using the second encoding algorithm. The apparatus has a controller for selecting the first or second encoding algorithms based on a comparison between the first and second quality measures.

Type: Grant

Filed: March 31, 2020

Date of Patent: December 6, 2022

Assignee: Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.

Inventors: Emmanuel Ravelli, Stefan Doehla, Guillaume Fuchs, Eleni Fotopoulou, Christian Helmrich
Audio coding method based on spectral recovery scheme

Patent number: 11508386

Abstract: An inventive concept relates to an audio coding method to which CNN-based frequency spectrum recovery is applied. An inventive concept transmits a part of frequency spectral coefficients generated in transform coding to a decoder and the decoder recovers the frequency spectral coefficient not transmitted. Furthermore, the signs of frequency spectral coefficient are transmitted from an encoder to the decoder depending on a sign transmission rule.

Type: Grant

Filed: April 8, 2020

Date of Patent: November 22, 2022

Assignees: Electronics and Telecommunications Research Institute, Kwangwoon University Industry-Academic Collaboration Foundation

Inventors: Hochong Park, Seung Kwon Beack, Jongmo Sung, Seong-Hyeon Shin, Mi Suk Lee, Tae Jin Lee, Jin Soo Choi
Encoding method, electronic device, and program product

Patent number: 11431353

Abstract: An encoding method includes: receiving configuration data related to encoding with a predetermined encoding mode; determining an encoding strategy based on the configuration data, wherein the encoding strategy includes parameters associated with encoding the data on an entity; and causing the data to be encoded on the entity based on the encoding strategy.

Type: Grant

Filed: May 17, 2021

Date of Patent: August 30, 2022

Assignee: EMC IP Holding Company LLC

Inventors: Zhenzhen Lin, Si Chen, Anzhou Hou
Hierarchical decorrelation of multichannel audio

Patent number: 11380342

Abstract: Provided are methods, systems, and apparatus for hierarchical decorrelation of multichannel audio. A hierarchical decorrelation algorithm is designed to adapt to possibly changing characteristics of an input signal, and also preserves the energy of the original signal. The algorithm is invertible in that the original signal can be retrieved if needed. Furthermore, the proposed algorithm decomposes the decorrelation process into multiple low-complexity steps. The contribution of these steps is generally in a decreasing order, and thus the complexity of the algorithm can be scaled.

Type: Grant

Filed: February 3, 2020

Date of Patent: July 5, 2022

Assignee: GOOGLE LLC

Inventors: Minyue Li, Willem Bastiaan Kleijn, Jan Skoglund
Parametric multi-channel audio representation

Patent number: 8498422

Abstract: Multi-channel audio signals are coded into a monaural audio signal and information allowing to recover the multi-channel audio signal from the monaural audio signal and the information. The information is generated by determining a first portion of the information for a first frequency region of the multi-channel audio signal, and by determining a second portion of the information for a second frequency region of the multi-channel audio signal. The second frequency region is a portion of the first frequency region and thus is a sub-range of the first frequency region. The information is multi-layered enabling a scaling of the decoding quality versus bit rate.

Type: Grant

Filed: April 22, 2003

Date of Patent: July 30, 2013

Assignee: Koninklijke Philips N.V.

Inventors: Arnoldus Werner Johannes Oomen, Erik Gosuinus Petrus Schuijers, Dirk Jeroen Breebaart, Steven Leonardus Josephus Dimphina Elisabeth Van De Par
Apparatus and method of encoding audio data and apparatus and method of decoding encoded audio data

Patent number: 8046235

Abstract: An apparatus and method encode audio data, and an apparatus and method decode encoded audio data. An audio data encoding apparatus includes: a scalable encoding unit dividing audio data into a plurality of layers, representing the audio data in predetermined numbers of bits in each of the plurality of layers, and encoding a lower layer prior to encoding an upper layer and an upper bit of each layer prior to encoding a lower bit of each layer; an SBR encoding unit generating spectral band replication (SBR) data that has information with respect to audio data in a frequency band of frequencies equal to or greater than a predetermined frequency among the audio data to be encoded, and encoding the SBR data; and a bitstream production unit generating a bitstream using the encoded SBR data and the encoded audio data corresponding to a predetermined bitrate.

Type: Grant

Filed: September 7, 2010

Date of Patent: October 25, 2011

Assignee: Samsung Electronics Co., Ltd.

Inventors: Miyoung Kim, Sangwook Kim, Dohyung Kim, Shihwa Lee, Junghoe Kim
TRANSMITTER AND RECEIVER FOR SPEECH CODING AND DECODING BY USING ADDITIONAL BIT ALLOCATION METHOD

Publication number: 20110040557

Abstract: The present invention relates to a transmitter and a receiver for speech coding and decoding by using an additional bit allocation method. The transmitter and the receiver according to the present invention realize a voice communication service of high quality by using additional bits permitted in system requirements while using a conventional speech coder as it is. In addition, the transmitter and the receiver according to the present invention have an advantage in that they enable insertion of additional quantization blocks while not changing the structure of the conventional standard speech coder, since they allocate additional bits by applying a multi-stage quantization procedure not in a speech signal domain but in a parameter domain.

Type: Application

Filed: October 29, 2010

Publication date: February 17, 2011

Applicant: Electronics and Telecommunications Research Institute

Inventors: Ho-Sang SUNG, Dae-Hwan Hwang, Dae-Hee Youn, Hong-Goo Kang, Young-Cheol Park, Ki-Seung Lee, Sung-Kyo Jung, Kyung-Tae Kim
Optimized multiple coding method

Patent number: 7792679

Abstract: The invention relates to the compression coding of digital signals such as multimedia signals (audio or video), and more particularly a method for multiple coding, wherein several encoders each comprising a series of functional blocks receive an input signal in parallel. Accordingly, a method is provided in which, a) the functional blocks forming each encoder are identified, along with one or several functions carried out of each block, b) functions which are common to various encoders are itemized and c) said common functions are carried out definitively for a part of at least all of the encoders within at least one same calculation module.

Type: Grant

Filed: November 24, 2004

Date of Patent: September 7, 2010

Assignee: France Telecom

Inventors: David Virette, Claude Lamblin, Abdellatif Benjelloun Touimi
SCALABLE LOSSLESS AUDIO CODEC AND AUTHORING TOOL

Publication number: 20100082352

Abstract: An audio codec losslessly encodes audio data into a sequence of analysis windows in a scalable bitstream. This is suitably done by separating the audio data into MSB and LSB portions and encoding each with a different lossless algorithm. An authoring tool compares the buffered payload to an allowed payload for each window and selectively scales the losslessly encoded audio data, suitably the LSB portion, in the non-conforming windows to reduce the encoded payload, hence buffered payload. This approach satisfies the media bit rate and buffer capacity constraints without having to filter the original audio data, reencode or otherwise disrupt the lossless bitstream.

Type: Application

Filed: November 5, 2009

Publication date: April 1, 2010

Inventor: Zoran Fejzo
LOW COMPLEXITY DECODER FOR COMPLEX TRANSFORM CODING OF MULTI-CHANNEL SOUND

Publication number: 20080319739

Abstract: A multi-channel audio decoder provides a reduced complexity processing to reconstruct multi-channel audio from an encoded bitstream in which the multi-channel audio is represented as a coded subset of the channels along with a complex channel correlation matrix parameterization. The decoder translates the complex channel correlation matrix parameterization to a real transform that satisfies the magnitude of the complex channel correlation matrix. The multi-channel audio is derived from the coded subset of channels via channel extension processing using a real value effect signal and real number scaling.

Type: Application

Filed: June 22, 2007

Publication date: December 25, 2008

Applicant: Microsoft Corporation

Inventors: Sanjeev Mehrotra, Wei-Ge Chen
Adaptive media encoding and decoding equipment

Patent number: 7437285

Abstract: An encoding unit encodes data at a first rate during an initial predetermined section of uncompressed data (a), and encodes data at a second rate after the initial predetermined section (where the first rate<the second rate), and encoded data are stored in a storage. The storage is read out the stored data to a network at a transmission rate equal to the transmission rate of the network. A decoding unit decodes received data at the first rate during an initial predetermined section, and decodes received data at a second rate after the initial predetermined section. For finite contents, receiving completion and decoding completion of the received data are simultaneous. In accordance with the present invention, while tolerance with respect to data incoming fluctuations is ensured, playback of dynamic images and music and the like can be immediately started after receiving.

Type: Grant

Filed: May 1, 2002

Date of Patent: October 14, 2008

Assignee: KDDI Corporation

Inventors: Shigeyuki Sakazawa, Yasutoshi Watanabe, Yasuhiro Takishima, Masahiro Wada
METHODS AND APPARATUS FOR VOICE ACTIVITY DETECTION

Publication number: 20080133226

Abstract: Methods and apparatus for voice activity detection are disclosed.

Type: Application

Filed: September 20, 2007

Publication date: June 5, 2008

Applicant: Spreadtrum Communications Corporation

Inventors: Heyun Huang, Tan Li, Fu-Huei Lin
Scalable lossless audio codec and authoring tool

Publication number: 20080021712

Abstract: An audio codec losslessly encodes audio data into a sequence of analysis windows in a scalable bitstream. This is suitably done by separating the audio data into MSB and LSB portions and encoding each with a different lossless algorithm. An authoring tool compares the buffered payload to an allowed payload for each window and selectively scales the losslessly encoded audio data, suitably the LSB portion, in the non-conforming windows to reduce the encoded payload, hence buffered payload. This approach satisfies the media bit rate and buffer capacity constraints without having to filter the original audio data, reencode or otherwise disrupt the lossless bitstream.

Type: Application

Filed: August 14, 2007

Publication date: January 24, 2008

Inventor: Zoran Fejzo