Variable Rate Or Variable Quality Codecs, E.g., Scalable Representation Encoding, Etc. (epo) Patents (Class 704/E19.044)
  • Patent number: 11969266
    Abstract: A deep learning medical device implantable in a body is provided. The device includes a processing and communication unit and a sensing and actuation unit. The processing and communication unit includes a deep learning module including a neural network trained to process the input samples, received from the sensing and actuation unit, through a plurality of layers to classify physiological parameters and provide classification results. A communication interface in communication with the deep learning module receives the classification results for ultrasonic transmission through biological tissue. Methods of sensing and classifying physiological parameters of a body and methods of embedding deep learning into an implantable medical device are also provided.
    Type: Grant
    Filed: February 16, 2021
    Date of Patent: April 30, 2024
    Assignee: Northeastern University
    Inventors: Daniel Uvaydov, Raffaele Guida, Francesco Restuccia, Tommaso Melodia
  • Patent number: 11894004
    Abstract: An audio signal encoding method is provided comprising: receiving first and second audio signal frames; processing a second portion of the first audio signal frame and a first portion of the second audio signal frame using an orthogonal transformation to determine in part a first intermediate encoding result; and processing the first intermediate encoding result using an orthogonal transformation to determine a set of spectral coefficients that corresponds to at least a portion of the first audio signal frame.
    Type: Grant
    Filed: November 13, 2020
    Date of Patent: February 6, 2024
    Assignee: DTS, Inc.
    Inventors: Michael M. Goodwin, Antonius Kalker, Albert Chau
  • Patent number: 11855711
    Abstract: A communication system uses multiple communications links, preferably links that use different communications media. The multiple communications links may include a high latency/high bandwidth link using a fiber-optic cable configured to carry large volumes of data but having a high latency. The communications links may also include a low latency/low bandwidth link implemented using skywave propagation of radio waves and configured to carry smaller volumes of triggering data with a lower latency across a substantial portion of the earth's surface. The triggering data may be sent in a data stream as data frames without headers, security information, or error checking codes. The two communications links may be used together to coordinate various activities such as the buying and selling of financial instruments.
    Type: Grant
    Filed: February 14, 2022
    Date of Patent: December 26, 2023
    Assignee: Skywave Networks LLC
    Inventor: Kevin J. Babich
  • Patent number: 11843800
    Abstract: An encoder comprising a processor configured to obtain candidate motion vectors (MVs) corresponding to neighboring blocks of a current block, the neighboring blocks neighboring the current block; obtain precisions of the candidate MVs; round the precisions to a target precision based on a rounding scheme; round the candidate MVs based on the target precision; perform pruning of the candidate MVs; generate a candidate list based on the rounding of the candidate MVs and the pruning; select one of the candidate MVs from the candidate list for encoding the current block; and encode an MV candidate index corresponding to the one of the candidate MVs that was selected in a bitstream.
    Type: Grant
    Filed: December 13, 2021
    Date of Patent: December 12, 2023
    Assignee: Huawei Technologies Co., Ltd.
    Inventors: Shan Liu, Wei Wang
  • Patent number: 11823691
    Abstract: An encoder operable to filter audio signals into a plurality of frequency band components, generate quantized digital components for each band, identify a potential for pre-echo events within the generated quantized digital components, generate an approximate signal by decoding the quantized digital components using inverse pulse code modulation, generate an error signal by comparing the approximate signal with the sampled audio signal, and process the error signal and quantized digital components. The encoder operable to process the error signal by processing delayed audio signals and Q band values, determining the potential for pre-echo events from the Q band values, and determining scale factors and MDCT block sizes for the potential for pre-echo events.
    Type: Grant
    Filed: January 23, 2023
    Date of Patent: November 21, 2023
    Assignee: IMMERSION NETWORKS, INC.
    Inventors: James David Johnston, Stephen Daniel White, King Wei Hor, Barry M. Genova
  • Patent number: 11735196
    Abstract: Described are an encoder for coding speech-like content and/or general audio content, wherein the encoder is configured to embed, at least in some frames, parameters in a bitstream, which parameters enhance a concealment in case an original frame is lost, corrupted or delayed, and a decoder for decoding speech-like content and/or general audio content, wherein the decoder is configured to use parameters which are sent later in time to enhance a concealment in case an original frame is lost, corrupted or delayed, as well as a method for encoding and a method for decoding.
    Type: Grant
    Filed: December 18, 2020
    Date of Patent: August 22, 2023
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Jérémie Lecomte, Benjamin Schubert, Michael Schnabel, Martin Dietz
  • Patent number: 11729418
    Abstract: Techniques are described for adaptive encoding different portions of media content based on content. Characteristics of GOPs of media content can be determined and used to set encoding parameters for the GOs. The GOPs can be encoded such that one GOP is encoded differently than another GOP if they have different characteristics.
    Type: Grant
    Filed: July 7, 2021
    Date of Patent: August 15, 2023
    Assignee: Amazon Technologies, Inc.
    Inventors: Charles Benjamin Franklin Waggoner, Marc Joliveau, Srikanth Kiran Kotagiri, Yongjun Wu, Yang Yang
  • Patent number: 11631421
    Abstract: Systems, apparatuses, and methods are described to increase a signal-to-noise ratio difference between a main channel and reference channel. The increased signal-to-noise ratio difference is accomplished with an adaptive threshold for a desired voice activity detector (DVAD) and shaping filters. The DVAD includes averaging an output signal of a reference microphone channel to provide an estimated average background noise level. A threshold value is selected from a plurality of threshold values based on the estimated average background noise level. The threshold value is used to detect desired voice activity on a main microphone channel.
    Type: Grant
    Filed: October 18, 2015
    Date of Patent: April 18, 2023
    Assignee: SOLOS TECHNOLOGY LIMITED
    Inventors: Dashen Fan, Xi Chen, Hua Bao
  • Patent number: 11570477
    Abstract: Methods and systems are provided for implementing preprocessing operations and augmentation operations upon image datasets transformed to frequency domain representations, including decoding images of an image dataset to generate a frequency domain representation of the image dataset; performing a resizing operation based on resizing factors on the image dataset in a frequency domain representation; performing a reshaping operation based on reshaping factors on the image dataset in a frequency domain representation; and performing a cropping operation on the image dataset in a frequency domain representation. The methods and systems may further include performing an augmentation operation on the image dataset in a frequency domain representation. Methods and systems of the present disclosure may free learning models from computational overhead caused by transforming image datasets into frequency domain representations.
    Type: Grant
    Filed: December 31, 2019
    Date of Patent: January 31, 2023
    Assignee: Alibaba Group Holding Limited
    Inventors: Kai Xu, Fei Sun, Minghai Qin, Yen-kuang Chen
  • Patent number: 11551703
    Abstract: The invention provides a concept for combined dynamic range compression and guided clipping prevention for audio devices. An audio decoder for decoding an audio bitstream and a metadata bitstream related to the audio bitstream according to the concept includes an audio processing chain including a plurality of adjustment stages including a dynamic range control stage for adjusting a dynamic range of the audio output signal and a guided clipping prevention stage for preventing clipping of the audio output signal; and a metadata decoder configured to receive the metadata bitstream and to extract dynamic range control gain sequences and guided clipping prevention gain sequences from the metadata bitstream, at least a part of the dynamic range control gain sequences being supplied to the dynamic range control stage, and at least a part of the guided clipping prevention gain sequences being supplied to the guided clipping prevention stage.
    Type: Grant
    Filed: February 11, 2021
    Date of Patent: January 10, 2023
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Fabian Kuech, Christian Uhle, Michael Kratschmer, Bernhard Neugebauer, Michael Meier, Arne Borsum
  • Patent number: 11521631
    Abstract: An apparatus for selecting one of a first encoding algorithm having a first characteristic and a second encoding algorithm having a second characteristic for encoding a portion of an audio signal to obtain an encoded version of the portion of the audio signal has a first estimator for estimating a first quality measure for the portion of the audio signal, which is associated with the first encoding algorithm, without actually encoding and decoding the portion of the audio signal using the first encoding algorithm. A second estimator is provided for estimating a second quality measure for the portion of the audio signal, which is associated with the second encoding algorithm, without actually encoding and decoding the portion of the audio signal using the second encoding algorithm. The apparatus has a controller for selecting the first or second encoding algorithms based on a comparison between the first and second quality measures.
    Type: Grant
    Filed: March 31, 2020
    Date of Patent: December 6, 2022
    Assignee: Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.
    Inventors: Emmanuel Ravelli, Stefan Doehla, Guillaume Fuchs, Eleni Fotopoulou, Christian Helmrich
  • Patent number: 11508386
    Abstract: An inventive concept relates to an audio coding method to which CNN-based frequency spectrum recovery is applied. An inventive concept transmits a part of frequency spectral coefficients generated in transform coding to a decoder and the decoder recovers the frequency spectral coefficient not transmitted. Furthermore, the signs of frequency spectral coefficient are transmitted from an encoder to the decoder depending on a sign transmission rule.
    Type: Grant
    Filed: April 8, 2020
    Date of Patent: November 22, 2022
    Assignees: Electronics and Telecommunications Research Institute, Kwangwoon University Industry-Academic Collaboration Foundation
    Inventors: Hochong Park, Seung Kwon Beack, Jongmo Sung, Seong-Hyeon Shin, Mi Suk Lee, Tae Jin Lee, Jin Soo Choi
  • Patent number: 11431353
    Abstract: An encoding method includes: receiving configuration data related to encoding with a predetermined encoding mode; determining an encoding strategy based on the configuration data, wherein the encoding strategy includes parameters associated with encoding the data on an entity; and causing the data to be encoded on the entity based on the encoding strategy.
    Type: Grant
    Filed: May 17, 2021
    Date of Patent: August 30, 2022
    Assignee: EMC IP Holding Company LLC
    Inventors: Zhenzhen Lin, Si Chen, Anzhou Hou
  • Patent number: 11380342
    Abstract: Provided are methods, systems, and apparatus for hierarchical decorrelation of multichannel audio. A hierarchical decorrelation algorithm is designed to adapt to possibly changing characteristics of an input signal, and also preserves the energy of the original signal. The algorithm is invertible in that the original signal can be retrieved if needed. Furthermore, the proposed algorithm decomposes the decorrelation process into multiple low-complexity steps. The contribution of these steps is generally in a decreasing order, and thus the complexity of the algorithm can be scaled.
    Type: Grant
    Filed: February 3, 2020
    Date of Patent: July 5, 2022
    Assignee: GOOGLE LLC
    Inventors: Minyue Li, Willem Bastiaan Kleijn, Jan Skoglund
  • Patent number: 8498422
    Abstract: Multi-channel audio signals are coded into a monaural audio signal and information allowing to recover the multi-channel audio signal from the monaural audio signal and the information. The information is generated by determining a first portion of the information for a first frequency region of the multi-channel audio signal, and by determining a second portion of the information for a second frequency region of the multi-channel audio signal. The second frequency region is a portion of the first frequency region and thus is a sub-range of the first frequency region. The information is multi-layered enabling a scaling of the decoding quality versus bit rate.
    Type: Grant
    Filed: April 22, 2003
    Date of Patent: July 30, 2013
    Assignee: Koninklijke Philips N.V.
    Inventors: Arnoldus Werner Johannes Oomen, Erik Gosuinus Petrus Schuijers, Dirk Jeroen Breebaart, Steven Leonardus Josephus Dimphina Elisabeth Van De Par
  • Patent number: 8046235
    Abstract: An apparatus and method encode audio data, and an apparatus and method decode encoded audio data. An audio data encoding apparatus includes: a scalable encoding unit dividing audio data into a plurality of layers, representing the audio data in predetermined numbers of bits in each of the plurality of layers, and encoding a lower layer prior to encoding an upper layer and an upper bit of each layer prior to encoding a lower bit of each layer; an SBR encoding unit generating spectral band replication (SBR) data that has information with respect to audio data in a frequency band of frequencies equal to or greater than a predetermined frequency among the audio data to be encoded, and encoding the SBR data; and a bitstream production unit generating a bitstream using the encoded SBR data and the encoded audio data corresponding to a predetermined bitrate.
    Type: Grant
    Filed: September 7, 2010
    Date of Patent: October 25, 2011
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Miyoung Kim, Sangwook Kim, Dohyung Kim, Shihwa Lee, Junghoe Kim
  • Publication number: 20110040557
    Abstract: The present invention relates to a transmitter and a receiver for speech coding and decoding by using an additional bit allocation method. The transmitter and the receiver according to the present invention realize a voice communication service of high quality by using additional bits permitted in system requirements while using a conventional speech coder as it is. In addition, the transmitter and the receiver according to the present invention have an advantage in that they enable insertion of additional quantization blocks while not changing the structure of the conventional standard speech coder, since they allocate additional bits by applying a multi-stage quantization procedure not in a speech signal domain but in a parameter domain.
    Type: Application
    Filed: October 29, 2010
    Publication date: February 17, 2011
    Applicant: Electronics and Telecommunications Research Institute
    Inventors: Ho-Sang SUNG, Dae-Hwan Hwang, Dae-Hee Youn, Hong-Goo Kang, Young-Cheol Park, Ki-Seung Lee, Sung-Kyo Jung, Kyung-Tae Kim
  • Patent number: 7792679
    Abstract: The invention relates to the compression coding of digital signals such as multimedia signals (audio or video), and more particularly a method for multiple coding, wherein several encoders each comprising a series of functional blocks receive an input signal in parallel. Accordingly, a method is provided in which, a) the functional blocks forming each encoder are identified, along with one or several functions carried out of each block, b) functions which are common to various encoders are itemized and c) said common functions are carried out definitively for a part of at least all of the encoders within at least one same calculation module.
    Type: Grant
    Filed: November 24, 2004
    Date of Patent: September 7, 2010
    Assignee: France Telecom
    Inventors: David Virette, Claude Lamblin, Abdellatif Benjelloun Touimi
  • Publication number: 20100082352
    Abstract: An audio codec losslessly encodes audio data into a sequence of analysis windows in a scalable bitstream. This is suitably done by separating the audio data into MSB and LSB portions and encoding each with a different lossless algorithm. An authoring tool compares the buffered payload to an allowed payload for each window and selectively scales the losslessly encoded audio data, suitably the LSB portion, in the non-conforming windows to reduce the encoded payload, hence buffered payload. This approach satisfies the media bit rate and buffer capacity constraints without having to filter the original audio data, reencode or otherwise disrupt the lossless bitstream.
    Type: Application
    Filed: November 5, 2009
    Publication date: April 1, 2010
    Inventor: Zoran Fejzo
  • Publication number: 20080319739
    Abstract: A multi-channel audio decoder provides a reduced complexity processing to reconstruct multi-channel audio from an encoded bitstream in which the multi-channel audio is represented as a coded subset of the channels along with a complex channel correlation matrix parameterization. The decoder translates the complex channel correlation matrix parameterization to a real transform that satisfies the magnitude of the complex channel correlation matrix. The multi-channel audio is derived from the coded subset of channels via channel extension processing using a real value effect signal and real number scaling.
    Type: Application
    Filed: June 22, 2007
    Publication date: December 25, 2008
    Applicant: Microsoft Corporation
    Inventors: Sanjeev Mehrotra, Wei-Ge Chen
  • Patent number: 7437285
    Abstract: An encoding unit encodes data at a first rate during an initial predetermined section of uncompressed data (a), and encodes data at a second rate after the initial predetermined section (where the first rate<the second rate), and encoded data are stored in a storage. The storage is read out the stored data to a network at a transmission rate equal to the transmission rate of the network. A decoding unit decodes received data at the first rate during an initial predetermined section, and decodes received data at a second rate after the initial predetermined section. For finite contents, receiving completion and decoding completion of the received data are simultaneous. In accordance with the present invention, while tolerance with respect to data incoming fluctuations is ensured, playback of dynamic images and music and the like can be immediately started after receiving.
    Type: Grant
    Filed: May 1, 2002
    Date of Patent: October 14, 2008
    Assignee: KDDI Corporation
    Inventors: Shigeyuki Sakazawa, Yasutoshi Watanabe, Yasuhiro Takishima, Masahiro Wada
  • Publication number: 20080133226
    Abstract: Methods and apparatus for voice activity detection are disclosed.
    Type: Application
    Filed: September 20, 2007
    Publication date: June 5, 2008
    Applicant: Spreadtrum Communications Corporation
    Inventors: Heyun Huang, Tan Li, Fu-Huei Lin
  • Publication number: 20080021712
    Abstract: An audio codec losslessly encodes audio data into a sequence of analysis windows in a scalable bitstream. This is suitably done by separating the audio data into MSB and LSB portions and encoding each with a different lossless algorithm. An authoring tool compares the buffered payload to an allowed payload for each window and selectively scales the losslessly encoded audio data, suitably the LSB portion, in the non-conforming windows to reduce the encoded payload, hence buffered payload. This approach satisfies the media bit rate and buffer capacity constraints without having to filter the original audio data, reencode or otherwise disrupt the lossless bitstream.
    Type: Application
    Filed: August 14, 2007
    Publication date: January 24, 2008
    Inventor: Zoran Fejzo