Linear Prediction Patents (Class 704/219)
  • Patent number: 11183201
    Abstract: A system and method for transferring a voice from one body of recordings to other recordings.
    Type: Grant
    Filed: May 12, 2020
    Date of Patent: November 23, 2021
    Inventor: John Alexander Angland
  • Patent number: 11176955
    Abstract: An audio signal transmission device for encoding an audio signal includes an audio encoding unit that encodes an audio signal and a side information encoding unit that calculates and encodes side information from a look-ahead signal. An audio signal receiving device for decoding an audio code and outputting an audio signal includes: an audio code buffer that detects packet loss based on a received state of an audio packet, an audio parameter decoding unit that decodes an audio code when an audio packet is correctly received, a side information decoding unit that decodes a side information code when an audio packet is correctly received, a side information accumulation unit that accumulates side information obtained by decoding a side information code, an audio parameter missing processing unit that outputs an audio parameter upon detection of audio packet loss, and an audio synthesis unit that synthesizes decoded audio from the audio parameter.
    Type: Grant
    Filed: December 17, 2019
    Date of Patent: November 16, 2021
    Assignee: NTT DOCOMO, INC.
    Inventors: Kimitaka Tsutsumi, Kei Kikuiri, Atsushi Yamaguchi
  • Patent number: 11170900
    Abstract: The invention relates to search for cases in a database. According to the proposed method and apparatus, similarity matching is performed between an input case and a set of cases in an initial search to receive similar cases by using a given matching criterion. Then statistics on image and/or non-image-based features associated with the similar cases are calculated and presented to the user with the similar cases. In a search refinement the similar cases are refined by additional features that are determined by the user based on the statistics. The search refinement can be iterative depending on the user's need.
    Type: Grant
    Filed: December 10, 2008
    Date of Patent: November 9, 2021
    Assignee: KONINKLIJKE PHILIPS N.V.
    Inventors: Lilla Boroczky, Lalitha Agnihotri, Luyin Zhao, Michael Chun-chieh Lee
  • Patent number: 11133014
    Abstract: A multi-channel signal encoding method and an encoder, where the encoding method includes obtaining a multi-channel signal of a current frame, determining an initial multi-channel parameter of the current frame, determining a difference parameter based on the initial multi-channel parameter of the current frame and multi-channel parameters of previous K frames of the current frame, where the difference parameter represents a difference between the initial multi-channel parameter of the current frame and the multi-channel parameters of the previous K frames, and K is an integer greater than or equal to one, determining a multi-channel parameter of the current frame based on the difference parameter and a characteristic parameter of the current frame, and encoding the multi-channel signal based on the multi-channel parameter of the current frame. Hence, the method and the encoder ensure better accuracy of inter-channel information of a multi-channel signal.
    Type: Grant
    Filed: February 11, 2019
    Date of Patent: September 28, 2021
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventors: Zexin Liu, Xingtao Zhang, Haiting Li, Lei Miao
  • Patent number: 11120809
    Abstract: A coding method and a decoding method are provided which can use in combination a predictive coding and decoding method which is a coding and decoding method that can accurately express coefficients which are convertible into linear prediction coefficients with a small code amount and a coding and decoding method that can obtain correctly, by decoding, coefficients which are convertible into linear prediction coefficients of the present frame if a linear prediction coefficient code of the present frame is correctly input to a decoding device.
    Type: Grant
    Filed: July 31, 2019
    Date of Patent: September 14, 2021
    Assignee: Nippon Telegraph and Telephone Corporation
    Inventors: Takehiro Moriya, Yutaka Kamamoto, Noboru Harada
  • Patent number: 11107481
    Abstract: Systems and methods are described for concealing packet loss in a received audio stream. Packets of the audio stream may be received in a non-lapped transform domain format, where at least one packet is missing in the stream. The received packets are decoded, and each missing packet in the decoded stream is replaced by a reduced-energy signal block. Each reduced-energy signal block may also be modified at a beginning or ending boundary, and shifted such that a start or end of each missing packet does not coincide with a peak of a transform window of a lapped transform domain format. The raw audio signal may then be encoded into transform windows having the lapped transform domain format. Packet loss concealment may then be performed for selected transform windows that include modified reduced-energy blocks, either prior to transmission or after transmission by the receiving endpoint.
    Type: Grant
    Filed: April 9, 2019
    Date of Patent: August 31, 2021
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Raphael Marc Ullmann, Glenn N. Dickins
  • Patent number: 11062718
    Abstract: An encoding apparatus and a decoding apparatus in a transform between a Modified Discrete Cosine Transform (MDCT)-based coder and a different coder are provided. The encoding apparatus may encode additional information to restore an input signal encoded according to the MDCT-based coding scheme, when switching occurs between the MDCT-based coder and the different coder. Accordingly, an unnecessary bitstream may be prevented from being generated, and minimum additional information may be encoded.
    Type: Grant
    Filed: September 25, 2017
    Date of Patent: July 13, 2021
    Assignees: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE, KWANGWOON UNIVERSITY INDUSTRY-ACADEMIC COLLABORATION FOUNDATION
    Inventors: Seung Kwon Beack, Tae Jin Lee, Min Je Kim, Dae Young Jang, Kyeongok Kang, Jin Woo Hong, Ho Chong Park, Young-cheol Park
  • Patent number: 11032574
    Abstract: In a method of video decoding in a decoder, a merge candidate list of a current coding block is constructed for processing the current coding block with a triangular prediction mode (TPM). The merge candidate list can include merge candidates each having one or two motion vectors. Each motion vector can be associated with a first reference picture list or a second reference picture list. A first motion vector and a second motion vector are determined from the motion vectors of the merge candidates on the merge candidate list. The current block is processed with the TPM with the first and second motion vectors as two motion vector predictors (MVPs) of two triangular partitions of the current coding block.
    Type: Grant
    Filed: August 7, 2019
    Date of Patent: June 8, 2021
    Assignee: TENCENT AMERICA LLC
    Inventors: Meng Xu, Xiang Li, Shan Liu
  • Patent number: 10997511
    Abstract: Certain aspects involve optimizing neural networks or other models for assessing risks and generating explanatory data regarding predictor variables used in the model. In one example, a system identifies predictor variables. The system generates a neural network for determining a relationship between each predictor variable and a risk indicator. The system performs a factor analysis on the predictor variables to determine common factors. The system iteratively adjusts the neural network so that (i) a monotonic relationship exists between each common factor and the risk indicator and (ii) a respective variance inflation factor for each common factor is sufficiently low. Each variance inflation factor indicates multicollinearity among the common factors. The adjusted neural network can be used to generate explanatory indicating relationships between (i) changes in the risk indicator and (ii) changes in at least some common factors.
    Type: Grant
    Filed: October 21, 2020
    Date of Patent: May 4, 2021
    Assignee: EQUIFAX INC.
    Inventors: Matthew Turner, Michael McBurnett, Yafei Zhang
  • Patent number: 10991376
    Abstract: A method and apparatus for handling input Line Spectral Frequency, LSF, coefficients. The method comprises determining LSF residual coefficients as first compressed LSF coefficients subtracted from the input LSF coefficients, and transforming the LSF residual coefficients into a warped domain. One of a plurality of gain-shape coding schemes is applied on the transformed LSF residual coefficients in order to achieve gain-shape coded LSF residual coefficients, where the plurality of gain-shape coding schemes have mutually different trade-offs in one or more of gain resolution and shape resolution for one or more of the transformed LSF residual coefficients. A representation of the first compressed LSF coefficients, the gain-shape coded LSF residual coefficients, and information on the applied gain-shape coding scheme are transmitted over a communication channel to a decoder.
    Type: Grant
    Filed: November 28, 2017
    Date of Patent: April 27, 2021
    Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)
    Inventors: Jonas Svedberg, Stefan Bruhn, Martin Sehlstedt
  • Patent number: 10978070
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speaker diarization are disclosed. In one aspect, a method includes the actions of receiving audio data corresponding to an utterance. The actions further include determining that the audio data includes an utterance of a predefined hotword spoken by a first speaker. The actions further include identifying a first portion of the audio data that includes speech from the first speaker. The actions further include identifying a second portion of the audio data that includes speech from a second, different speaker. The actions further include transmitting the first portion of the audio data that includes speech from the first speaker and suppressing transmission of the second portion of the audio data that includes speech from the second, different speaker.
    Type: Grant
    Filed: August 27, 2019
    Date of Patent: April 13, 2021
    Inventors: Aleksandar Kracun, Richard Cameron Rose
  • Patent number: 10970993
    Abstract: A method for managing the assistance to a person in response to the emission of an alert includes emitting an alert from a piece of mobile equipment of a first user to a plurality of users; establishing a first two-way communication between the first equipment and a given terminal of the first set of an assisting user; automatic generating of a plurality of first notifications to a subset of terminals of the first set, each one of the notifications including at least one piece of data that identifies the assisting user; automatic generating of a plurality of second notifications to the second subset, each second notification including a status relative to the processing of the alert by the assisting user.
    Type: Grant
    Filed: March 15, 2019
    Date of Patent: April 6, 2021
    Assignee: HAREAU
    Inventor: Ferdinand Rousseau
  • Patent number: 10947594
    Abstract: The present invention relates to coding of audio signals, and in particular to high frequency reconstruction methods including a frequency domain harmonic transposer. A system and method for generating a high frequency component of a signal from a low frequency component of the signal is described.
    Type: Grant
    Filed: March 5, 2020
    Date of Patent: March 16, 2021
    Assignee: Dolby International AB
    Inventors: Lars Villemoes, Per Ekstrand
  • Patent number: 10909997
    Abstract: According to an aspect of the present invention an encoder for encoding an audio signal has an analyzer configured for deriving prediction coefficients and a residual signal from a frame of the audio signal. The encoder has a formant information calculator configured for calculating a speech related spectral shaping information from the prediction coefficients, a gain parameter calculator configured for calculating a gain parameter from an unvoiced residual signal and the spectral shaping information and a bitstream former configured for forming an output signal based on an information related to a voiced signal frame, the gain parameter or a quantized gain parameter and the prediction coefficients.
    Type: Grant
    Filed: July 8, 2019
    Date of Patent: February 2, 2021
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Guillaume Fuchs, Markus Multrus, Emmanuel Ravelli, Markus Schnell
  • Patent number: 10908670
    Abstract: A circuit for sound activity detection includes a transducer (106) adapted to generate an electrical signal based on detected sound; a variable gain amplifier adapted to amplify the electrical signal to generate an amplified electrical signal; a comparator adapted to compare the amplified electrical signal with at least one first threshold level to generate a comparison signal indicating comparator events; and a control circuit adapted to generate, based on the comparison signal, a gain control signal for controlling the gain of the variable gain amplifier, and a sound activity alert signal indicating the detection of sound activity.
    Type: Grant
    Filed: September 26, 2017
    Date of Patent: February 2, 2021
    Assignee: Dolphin Integration
    Inventor: Emmanuel Grand
  • Patent number: 10885922
    Abstract: A method includes decoding a low-band portion of an encoded mid channel to generate a decoded low-band mid channel. The method also includes filtering the decoded low-band mid channel according to one or more filter coefficients to generate a low-band filtered mid channel. The method also includes generating an inter-channel predicted signal based on the low-band filtered mid channel and the inter-channel prediction gain. The method further includes generating a low-band left channel and a low-band right channel based on an up-mix factor, the decoded low-band mid channel, and the inter-channel predicted signal.
    Type: Grant
    Filed: September 19, 2019
    Date of Patent: January 5, 2021
    Assignee: QUALCOMM Incorporated
    Inventors: Venkatraman Atti, Venkata Subrahmanyam Chandra Sekhar Chebiyyam, Daniel Jared Sinder
  • Patent number: 10840949
    Abstract: A method of redundantly encoding data includes formatting the data into columns and rows, and generating first and second sets of projections of the data using an encoding transform. For each set of projections generated, an encoding parameter of the encoding transform is set to a different value. The first and second sets of projections are stored as the encoded data. A decoding method reads settings including an indication of a number of data fragments. The number of data fragments is compared to a number of projections in a first set of projections of the encoded data in order to determine whether to use a first or a second decoding mode. The encoded data is then decoded according to the selected decoding mode and the result is outputted.
    Type: Grant
    Filed: January 18, 2019
    Date of Patent: November 17, 2020
    Assignee: ZEBWARE AB
    Inventor: Thomas Nilsson
  • Patent number: 10839794
    Abstract: The present disclosure provides a method and an apparatus for correcting an input speech based on artificial intelligence. The method includes: receiving a speech input by a user; performing recognition on the speech to obtain a current recognition text; obtaining at least one candidate phrase of a first phrase to be corrected in the current recognition text and displaying the at least one candidate phrase to the user; detecting a select operation of the user, the select operation being configured to select one of the at least one candidate phrase as a target candidate phrase; and correcting the first phrase in the current recognition text by using the target candidate phrase, to obtain a target recognition text.
    Type: Grant
    Filed: August 7, 2018
    Date of Patent: November 17, 2020
    Assignee: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.
    Inventor: Kuai Li
  • Patent number: 10818313
    Abstract: A method for detecting an audio signal and an apparatus, where the method includes determining an input audio signal as a to-be-determined audio signal, determining an enhanced segmental signal-to-noise ratio (SSNR) of the audio signal, where the enhanced SSNR is greater than a reference SSNR, and comparing the enhanced SSNR with a voice activity detection (VAD) decision threshold to determine whether the audio signal is an active signal. Therefore, the method and the apparatus can accurately distinguish an active voice and an inactive voice.
    Type: Grant
    Filed: April 23, 2019
    Date of Patent: October 27, 2020
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventor: Zhe Wang
  • Patent number: 10811019
    Abstract: A spectrum encoding method includes selecting an important spectral component in band units for a normalized spectrum and encoding information of the selected important spectral component for a band, based on a number, a position, a magnitude and a sign thereof. A spectrum decoding method includes obtaining from a bitstream, information about an important spectral component for a band of an encoded spectrum and decoding the obtained information of the important spectral component, based on a number, a position, a magnitude and a sign of the important spectral component.
    Type: Grant
    Filed: February 22, 2019
    Date of Patent: October 20, 2020
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventor: Ho-sang Sung
  • Patent number: 10811022
    Abstract: A method and apparatus for performing coding and decoding for high-frequency bandwidth extension. The decoding apparatus may include: a mode checking unit to check mode information of each of frames included in a bitstream; a first core decoding unit to perform code excited linear prediction (CELP) decoding on a CELP coded frame, when a core coding mode of a low-frequency signal indicates a CELP coding mode; a first extension decoding unit to generate a decoded signal of a high-frequency band by using at least one of a result of the performing the CELP decoding and an excitation signal of the low-frequency signal; a second core decoding unit to perform audio decoding on an audio coded frame, when the core coding mode indicates an audio coding mode; and a second extension decoding unit to generate a decoded signal of the high-frequency band by performing frequency-domain (FD) extension decoding.
    Type: Grant
    Filed: October 18, 2019
    Date of Patent: October 20, 2020
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Ki-hyun Choo, Eun-mi Oh, Ho-sang Sung
  • Patent number: 10778991
    Abstract: Encoding of a video file includes determining a plurality of scenes associated with a video file, and determining at least one group of pictures (GOP). Starting sequentially from a beginning frame of the video file, the system identifies a first GOP having a first encoding error characteristic. The system changes a bitrate allocation setting from a first setting to a second setting based on the encoding error characteristic. The system identifies a second frame having a second encoding error characteristic, and changes a second bitrate allocation setting from the second setting to a third setting based on the second encoding error characteristic. The system generates an encoded video file that includes an encoded plurality of scenes.
    Type: Grant
    Filed: September 25, 2018
    Date of Patent: September 15, 2020
    Assignee: Amazon Technologies, Inc.
    Inventors: Amarsingh B Winston, Deepthi Nandakumar, Avisar Ten-Ami
  • Patent number: 10771621
    Abstract: Systems, methods, and devices are disclosed for detecting an active speaker in a two-way conference. Real time audio in one or more sub band domains are analyzed according to an echo cancellor model. Based on the analyzed real time audio, one or more audio metrics are determined from output from an acoustic echo cancellation linear filter. The one or more audio metrics are weighted based on a priority, and a speaker status is determined based on the weighted one or more audio metrics being analyzed according to an active speaker detection model. For an active speaker status, one or more residual echo or noise is removed from the real time audio based on the one or more audio metrics.
    Type: Grant
    Filed: April 2, 2018
    Date of Patent: September 8, 2020
    Assignee: CISCO TECHNOLOGY, INC.
    Inventors: Fuling Liu, Eric Chen, Wei Li, Wei-Lien Hsu
  • Patent number: 10762908
    Abstract: An audio packet error concealment system includes an encoding unit for encoding an audio signal consisting of a plurality of frames, and an auxiliary information encoding unit for estimating and encoding auxiliary information about a temporal change of power of the audio signal. The auxiliary information is used in packet loss concealment in decoding of the audio signal. The auxiliary information about the temporal change of power may contain a parameter that functionally approximates a plurality of powers of subframes shorter than one frame, or may contain information about a vector obtained by vector quantization of a plurality of powers of subframes shorter than one frame.
    Type: Grant
    Filed: September 20, 2018
    Date of Patent: September 1, 2020
    Assignee: NTT DOCOMO, INC.
    Inventors: Kimitaka Tsutsumi, Kei Kikuiri
  • Patent number: 10742231
    Abstract: The present disclosure relates to a compression/encoding apparatus and method, a decoding apparatus and method, and a program that allow for provision of a lossless compression technology with higher compression ratio. A GOB data configuration section configures GOB data with a group of digital data that includes a plurality of blocks by treating a frame of delta-sigma-modulated digital data as a block. A table generation section generates a conversion table for encoding the GOB data. An encoding section compresses and encodes the digital data of each block included in the GOB data by using the conversion table. The present technology is applicable, for example, to audio signal compression and encoding, and so on.
    Type: Grant
    Filed: May 10, 2017
    Date of Patent: August 11, 2020
    Assignee: Sony Corporation
    Inventors: Takao Fukui, Toru Chinen
  • Patent number: 10734003
    Abstract: A linear prediction-based noise signal processing method, includes obtaining a linear prediction coefficient of the noise signal, filtering a signal derived from the noise signal based on the linear prediction coefficient in order to obtain a linear prediction residual signal, obtaining excitation energy of the linear prediction residual signal and a spectral envelope of the linear prediction residual signal, and the spectral envelope, the excitation energy and the linear prediction coefficient are encoded.
    Type: Grant
    Filed: October 23, 2018
    Date of Patent: August 4, 2020
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventor: Zhe Wang
  • Patent number: 10735775
    Abstract: A method, electronic device, computer program product, system and circuit assembly are provided for allocating one or more redundant pictures by taking into consideration the information content of the primary pictures, with which the redundant pictures would be associated. In particular, primary pictures that are determined to be more sensitive to transmission loss or corruption may be allocated one or more redundant pictures, while those that are less sensitive may not be so allocated. By selectively allocating redundant pictures to only those primary pictures that are more sensitive, the method disclosed reduces the amount of overhead associated with redundant pictures and increases the coding efficiency, without sacrificing the integrity of the video data.
    Type: Grant
    Filed: March 19, 2019
    Date of Patent: August 4, 2020
    Assignee: Conversant Wireless Licensing S.a r.l.
    Inventors: Chunbo Zhu, Ye-Kui Wang, Houqiang Li
  • Patent number: 10734002
    Abstract: The invention provides methods and devices for outputting a stereo audio signal having a left channel and a right channel. The apparatus includes a demultiplexer, decoder, and upmixer. The upmixer is configured operate either in a prediction mode or a non-prediction mode based on a parameter encoded in the audio bitstream.
    Type: Grant
    Filed: October 4, 2019
    Date of Patent: August 4, 2020
    Assignee: Dolby International AB
    Inventors: Heiko Purnhagen, Pontus Carlsson, Lars Villemoes
  • Patent number: 10734009
    Abstract: An envelope sequence is provided that can improve approximation accuracy near peaks caused by the pitch period of an audio signal. A periodic-combined-envelope-sequence generation device according to the present invention takes, as an input audio signal, a time-domain audio digital signal in each frame, which is a predetermined time segment, and generates a periodic combined envelope sequence as an envelope sequence. The periodic-combined-envelope-sequence generation device according to the present invention comprises at least a spectral-envelope-sequence calculating part and a periodic-combined-envelope generating part. The spectral-envelope-sequence calculating part calculates a spectral envelope sequence of the input audio signal on the basis of time-domain linear prediction of the input audio signal.
    Type: Grant
    Filed: December 21, 2018
    Date of Patent: August 4, 2020
    Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
    Inventors: Takehiro Moriya, Yutaka Kamamoto, Noboru Harada
  • Patent number: 10720172
    Abstract: An encoder for encoding an audio signal, audio transmission system and method for determining correction values includes an analyzer for analyzing the audio signal and for determining analysis prediction coefficients from the audio signal. Including a converter for deriving converted prediction coefficients from the analysis prediction coefficients, a memory for storing a multitude of correction values and a calculator. The calculator includes a processor for processing the converted prediction coefficients to obtain spectral weighting factors and a combiner for combining the spectral weighting factors and the multitude of correction values to obtain corrected weighting factors. A quantizer of the calculator is configured for quantizing the converted prediction coefficients using the corrected weighting factors obtaining a quantized representation of the converted prediction coefficients.
    Type: Grant
    Filed: February 7, 2019
    Date of Patent: July 21, 2020
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Konstantin Schmidt, Guillaume Fuchs, Matthias Neusinger, Martin Dietz
  • Patent number: 10714107
    Abstract: The purpose of the present invention is to estimate, with a small amount of computation, a linear prediction synthesis filter after conversion of an internal sampling frequency. A linear prediction coefficient conversion device is a device that converts first linear prediction coefficients calculated at a first sampling frequency to second linear prediction coefficients at a second sampling frequency different from the first sampling frequency, which includes a means for calculating, on the real axis of the unit circle, a power spectrum corresponding to the second linear prediction coefficients at the second sampling frequency based on the first linear prediction coefficients or an equivalent parameter, a means for calculating, on the real axis of the unit circle, autocorrelation coefficients from the power spectrum, and a means for converting the autocorrelation coefficients to the second linear prediction coefficients at the second sampling frequency.
    Type: Grant
    Filed: November 14, 2018
    Date of Patent: July 14, 2020
    Assignee: NTT DOCOMO, INC.
    Inventors: Nobuhiko Naka, Vesa Ruoppila
  • Patent number: 10714108
    Abstract: The purpose of the present invention is to estimate, with a small amount of computation, a linear prediction synthesis filter after conversion of an internal sampling frequency. A linear prediction coefficient conversion device is a device that converts first linear prediction coefficients calculated at a first sampling frequency to second linear prediction coefficients at a second sampling frequency different from the first sampling frequency, which includes a means for calculating, on the real axis of the unit circle, a power spectrum corresponding to the second linear prediction coefficients at the second sampling frequency based on the first linear prediction coefficients or an equivalent parameter, a means for calculating, on the real axis of the unit circle, autocorrelation coefficients from the power spectrum, and a means for converting the autocorrelation coefficients to the second linear prediction coefficients at the second sampling frequency.
    Type: Grant
    Filed: November 14, 2018
    Date of Patent: July 14, 2020
    Assignee: NTT DOCOMO, INC.
    Inventors: Nobuhiko Naka, Vesa Ruoppila
  • Patent number: 10686466
    Abstract: A method for differentiator-based compression of digital data includes (a) using a subtraction module, subtracting a predicted signal from a sample of an original signal to obtain an error signal, (b) using a quantization module, quantizing the error signal to obtain a quantized error signal, and (c) generating the predicted signal using a least means square (LMS)-based filtering method.
    Type: Grant
    Filed: July 3, 2019
    Date of Patent: June 16, 2020
    Assignee: CABLE TELEVISION LABORATORIES, INC.
    Inventors: Mu Xu, Zhensheng Jia, Jing Wang, Luis Alberto Campos
  • Patent number: 10679638
    Abstract: The coding efficiency of an audio codec using a controllable—switchable or even adjustable—harmonic filter tool is improved by performing the harmonicity-dependent controlling of this tool using a temporal structure measure in addition to a measure of harmonicity in order to control the harmonic filter tool. In particular, the temporal structure of the audio signal is evaluated in a manner which depends on the pitch. This enables to achieve a situation-adapted control of the harmonic filter tool so that in situations where a control made solely based on the measure of harmonicity would decide against or reduce the usage of this tool, although using the harmonic filter tool would, in that situation, increase the coding efficiency, the harmonic filter tool is applied, while in other situations where the harmonic filter tool may be inefficient or even destructive, the control reduces the appliance of the harmonic filter tool appropriately.
    Type: Grant
    Filed: August 30, 2018
    Date of Patent: June 9, 2020
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Goran Markovic, Christian Helmrich, Emmanuel Ravelli, Manuel Jander, Stefan Doehla
  • Patent number: 10657973
    Abstract: A method including decomposing a magnitude part of a signal spectrum of a mixture signal into spectral components, each spectral component including a frequency part and a time activation part; and clustering the spectral components to obtain one or more clusters of spectral components, wherein the clustering of the spectral components is computed in the time domain.
    Type: Grant
    Filed: September 29, 2015
    Date of Patent: May 19, 2020
    Assignee: SONY CORPORATION
    Inventors: Xin Guo, Stefan Uhlich, Yuhki Mitsufuji
  • Patent number: 10657937
    Abstract: The present document relates to audio coding systems which make use of a harmonic transposition method for high frequency reconstruction (HFR), and to digital effect processors, e.g. so-called exciters, where generation of harmonic distortion adds brightness to the processed signal. In particular, a system configured to generate a high frequency component of a signal from a low frequency component of the signal is described. The system may comprise an analysis filter bank (501) configured to provide a set of analysis subband signals from the low frequency component of the signal; wherein the set of analysis subband signals comprises at least two analysis subband signals; wherein the analysis filter bank (501) has a frequency resolution of ?f.
    Type: Grant
    Filed: April 5, 2019
    Date of Patent: May 19, 2020
    Assignee: Dolby International AB
    Inventors: Per Ekstrand, Lars Villemoes, Per Hedelin
  • Patent number: 10636421
    Abstract: A speech-based human-machine interface that parses words spoken to detect a complete parse and, responsive to so detecting, computes a hypothesis as to whether the words are a prefix to another complete parse. The duration of no voice activity period to determine an end of a sentence depends on the prefix hypothesis. The user's typical speech speed profile and a short-term measure of speech speed also scale the period. Speech speed is measured by the time between words, and the period scaling uses a continuously adaptive algorithm. The system uses a longer cut-off period after a system wake-up event but before it detects any voice activity.
    Type: Grant
    Filed: December 27, 2017
    Date of Patent: April 28, 2020
    Assignee: SOUNDHOUND, INC.
    Inventors: Jennifer Hee Young Zhang, Patricia Pozon Aguayo, Jonah Probell
  • Patent number: 10635068
    Abstract: The current disclosure provides a method for transmitting encoded information signals through a control system and to a decoder. The encoded information signals are transmitted along with control signals as an encoded message. The information signals are encoded based at least in part on a control-coding capacity of the control system.
    Type: Grant
    Filed: March 16, 2017
    Date of Patent: April 28, 2020
    Inventor: Charalambos D. Charalambous
  • Patent number: 10622005
    Abstract: A method and device for automatically increasing the spectral bandwidth of an audio signal including generating a “mapping” (or “prediction”) matrix based on the analysis of a reference wideband signal and a reference narrowband signal, the mapping matrix being a transformation matrix to predict high frequency energy from a low frequency energy envelope, generating an energy envelope analysis of an input narrowband audio signal, generating a resynthesized noise signal by processing a random noise signal with the mapping matrix and the envelope analysis, high-pass filtering the resynthesized noise signal, and summing the high-pass filtered resynthesized noise signal with the input narrowband audio signal. Other embodiments are disclosed.
    Type: Grant
    Filed: July 27, 2018
    Date of Patent: April 14, 2020
    Assignee: Staton Techiya, LLC
    Inventors: John Usher, Dan Ellis
  • Patent number: 10622001
    Abstract: A Unified Speech and Audio Codec (USAC) that may process a window sequence based on mode switching is provided. The USAC may perform encoding or decoding by overlapping between frames based on a folding point when mode switching occurs. The USAC may process different window sequences for each situation to perform encoding or decoding, and thereby may improve a coding efficiency.
    Type: Grant
    Filed: May 15, 2018
    Date of Patent: April 14, 2020
    Assignees: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE, KWANGWOON UNIVERSITY INDUSTRY-ACADEMIC COLLABORATION FOUNDATION
    Inventors: Seungkwon Beack, Tae Jin Lee, Min Je Kim, Kyeongok Kang, Dae Young Jang, Jeongil Seo, Jin Woo Hong, Chieteuk Ahn, Ho Chong Park, Young-cheol Park
  • Patent number: 10614339
    Abstract: According to an example aspect of the present invention, there is provided an apparatus comprising at least one processing core, at least one memory including computer program code, the at least one memory and the computer program code being configured to, with the at least one processing core, cause the apparatus at least to provide an input data item to a first convolutional layer of an artificial neural network comprising a set of convolutional layers, process the input data item in the set of convolutional layers, define, in a feature map output from a last convolutional layer of the set of convolutional layers, a first feature map patch and a second feature map patch, and provide the first feature map patch to a first classifier and the second feature map patch to a second classifier.
    Type: Grant
    Filed: July 29, 2015
    Date of Patent: April 7, 2020
    Assignee: Nokia Technologies Oy
    Inventor: Xiaoheng Jiang
  • Patent number: 10607620
    Abstract: A decoder for processing an audio signal receives an audio bitstream, decodes the bitstream to obtain a set of spectral frequency parameters that are arranged in an order of frequencies, determines a minimum spectral frequency parameter difference from a plurality of calculated spectral frequency parameter differences, determines a start frequency bin for predicting a high band excitation signal according to the minimum spectral frequency parameter difference, generates the high band excitation signal by selecting a frequency band with a preset bandwidth selected from a low band excitation signal according to the start frequency bin, and synthesizes a wideband signal based on the generated high band excitation signal.
    Type: Grant
    Filed: May 20, 2019
    Date of Patent: March 31, 2020
    Assignee: Huawei Technologies Co., Ltd.
    Inventors: Zexin Liu, Lei Miao
  • Patent number: 10607619
    Abstract: An encoder for encoding an audio signal has: an analyzer configured for deriving prediction coefficients and a residual signal from an unvoiced frame of the audio signal; a gain parameter calculator configured for calculating a first gain parameter information for defining a first excitation signal related to a deterministic codebook and for calculating a second gain parameter information for defining a second excitation signal related to a noise-like signal for the unvoiced frame; and a bitstream former configured for forming an output signal based on an information related to a voiced signal frame, the first gain parameter information and the second gain parameter information.
    Type: Grant
    Filed: April 1, 2019
    Date of Patent: March 31, 2020
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Guillaume Fuchs, Markus Multrus, Emmanuel Ravelli, Markus Schnell
  • Patent number: 10595025
    Abstract: A transmitting device for generating a plurality of encoded portions of a video to be transmitted to a receiving device over a network configured to: receive an error message over a feedback channel from the receiving device indicating at least one of said plurality of encoded portions that has been lost at the receiving device; encode a recovery portion responsive to said receiving said error message; and transmit said recovery portion to the receiving device over said network; wherein said error message includes information pertaining to a decoded portion successfully decoded at the receiving device and said recovery portion is encoded relative to said decoded portion.
    Type: Grant
    Filed: September 8, 2015
    Date of Patent: March 17, 2020
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Ming-Chieh Lee, Amy Lu, Pontus Carlsson, Mattias Dan Nilsson, Sergey Sablin, Sergey Silkin, David Yuheng Zhao, Magnus Hemmendorff, Sergei Nikiforov
  • Patent number: 10580425
    Abstract: Proposed is a method and apparatus for determining a weighting function for quantizing a linear predictive coding (LPC) coefficient and having a low complexity. The weighting function determination apparatus may convert an LPC coefficient of a mid-subframe of an input signal to one of a immittance spectral frequency (ISF) coefficient and a line spectral frequency (LSF) coefficient, and may determine a weighting function associated with an importance of the ISF coefficient or the LSF coefficient based on the converted ISF coefficient or LSF coefficient.
    Type: Grant
    Filed: August 28, 2017
    Date of Patent: March 3, 2020
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Ho Sang Sung, Eun Mi Oh
  • Patent number: 10580415
    Abstract: An apparatus for generating a bandwidth extended signal from a bandwidth limited audio signal, the bandwidth limited audio signal The patch generator is configured to perform a harmonic patching algorithm to obtain the patched signal. The signal manipulator is configured for manipulating a signal before patching or the patched signal. The timely preceding bandwidth limited time block timely precedes the current bandwidth limited time block in the plurality of consecutive bandwidth limited time blocks of the bandwidth limited audio signal. The combiner is configured for combining the bandwidth limited audio signal having the core frequency band and the manipulated patched signal having the upper frequency band to obtain the bandwidth extended signal.
    Type: Grant
    Filed: May 14, 2018
    Date of Patent: March 3, 2020
    Assignee: FRAUNHOFER-GESELLSCHAFT ZUR FOERDERUNG DER ANGEWANDTEN FORSCHUNG E.V.
    Inventors: Frederik Nagel, Stephan Wilde
  • Patent number: 10579684
    Abstract: A computer-implemented method for determining a relevance of a node in a network. A digital representation of a local neighborhood structure of the node in the network is obtained in a computer-readable non-volatile memory. A numerical value characteristic of the node's relevance is determined, and output to a user. The numerical value is determined based on the neighborhood structure of the node.
    Type: Grant
    Filed: January 30, 2015
    Date of Patent: March 3, 2020
    Assignee: Max-Planck-Gesellschaft zur Förderung der Wissenschaften e.V.
    Inventor: Glenn Lawyer
  • Patent number: 10573326
    Abstract: A method includes decoding a low-band mid channel bitstream to generate a low-band mid signal and a low-band mid excitation signal. The method further includes decoding a high-band mid channel bandwidth extension bitstream to generate a synthesized high-band mid signal. The method also includes determining an inter-channel bandwidth extension (ICBWE) gain mapping parameter corresponding to the synthesized high-band mid signal. The ICBWE gain mapping parameter is based on a selected frequency-domain gain parameter that is extracted from a stereo downmix/upmix parameter bitstream. The method further includes performing a gain scaling operation on the synthesized high-band mid signal based on the ICBWE gain mapping parameter to generate a reference high-band channel and a target high-band channel. The method includes outputting a first audio channel and a second audio channel. The first audio channel is based on the reference high-band channel, and the second audio channel is based on target high-band channel.
    Type: Grant
    Filed: March 26, 2018
    Date of Patent: February 25, 2020
    Assignee: Qualcomm Incorporated
    Inventors: Venkata Subrahmanyam Chandra Sekhar Chebiyyam, Venkatraman Atti
  • Patent number: 10553228
    Abstract: Disclosed are some examples of systems, apparatus, methods and computer program products implementing techniques for extending the range of a set of decoded parameter values for a sequence of frequency bands in an identifiable time frame of an audio signal. In some implementations, the parameter values vary in relation to a sequence of time frames of the audio signal and in relation to a sequence of frequency bands in each time frame. In some implementations, it is determined that a decoded value corresponds to a minimum of a first range of values of a first coding protocol of a set of coding protocols. The determined value is modified to be below the minimum of the first range of values to produce an extended value. A modified set of decoded values including one or more extended values can thus be provided.
    Type: Grant
    Filed: April 1, 2016
    Date of Patent: February 4, 2020
    Assignee: Dolby International AB
    Inventors: Heiko Purnhagen, Per Ekstrand, Harald Mundt, Klaus Peichl
  • Patent number: 10552457
    Abstract: Systems and methods for the matching of datasets, such as input audio segments, with known datasets in a database are disclosed. In an illustrative embodiment, the use of the presently disclosed systems and methods is described in conjunction with recognizing known network message recordings encountered during an outbound telephone call. The methodologies include creation of a ternary fingerprint bitmap to make the comparison process more efficient. Also disclosed are automated methodologies for creating the database of known datasets from a larger collection of datasets.
    Type: Grant
    Filed: January 19, 2018
    Date of Patent: February 4, 2020
    Inventors: Kevin Vlack, Felix Immanuel Wyss