Patents by Inventor Seung Kwon Beack

Seung Kwon Beack has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11508386
    Abstract: An inventive concept relates to an audio coding method to which CNN-based frequency spectrum recovery is applied. An inventive concept transmits a part of frequency spectral coefficients generated in transform coding to a decoder and the decoder recovers the frequency spectral coefficient not transmitted. Furthermore, the signs of frequency spectral coefficient are transmitted from an encoder to the decoder depending on a sign transmission rule.
    Type: Grant
    Filed: April 8, 2020
    Date of Patent: November 22, 2022
    Assignees: Electronics and Telecommunications Research Institute, Kwangwoon University Industry-Academic Collaboration Foundation
    Inventors: Hochong Park, Seung Kwon Beack, Jongmo Sung, Seong-Hyeon Shin, Mi Suk Lee, Tae Jin Lee, Jin Soo Choi
  • Patent number: 11508385
    Abstract: Disclosed is a method of processing a residual signal for audio coding and an audio coding apparatus. The method learns a feature map of a reference signal through a residual signal learning engine including a convolutional layer and a neural network and performs learning based on a result obtained by mapping a node of an output layer of the neural network and a quantization level of index of the residual signal.
    Type: Grant
    Filed: November 18, 2019
    Date of Patent: November 22, 2022
    Assignee: Electronics and Telecommunications Research Institute
    Inventors: Seung Kwon Beack, Jongmo Sung, Mi Suk Lee, Tae Jin Lee, Hui Yong Kim
  • Publication number: 20220369058
    Abstract: Disclosed is an apparatus and method for processing a multichannel audio signal. A multichannel audio signal processing method may include: generating an N-channel audio signal of N channels by down-mixing an M-channel audio signal of M channels; and generating a stereo audio signal by performing binaural rendering of the N-channel audio signal.
    Type: Application
    Filed: July 29, 2022
    Publication date: November 17, 2022
    Applicant: Electronics and Telecommunications Research Institute
    Inventors: Yong Ju LEE, Jeong Il SEO, Seung Kwon BEACK, Kyeong Ok KANG, Jin Woong KIM, Jae Hyoun YOO
  • Publication number: 20220358940
    Abstract: Disclosed are methods of encoding and decoding an audio signal using side information, and an encoder and a decoder for performing the methods. The method of encoding an audio signal using side information includes identifying an input signal, the input signal being an original audio signal, extracting side information from the input signal using a learning model trained to extract side information from a feature vector of the input signal, encoding the input signal, and generating a bitstream by combining the encoded input signal and the side information.
    Type: Application
    Filed: November 16, 2021
    Publication date: November 10, 2022
    Inventors: Woo-taek LIM, Seung Kwon BEACK, Jongmo SUNG, Tae Jin LEE, Inseon JANG, Jong Won SHIN, Soojoong HWANG, Youngju CHEON, Sangwook HAN
  • Patent number: 11488613
    Abstract: Disclosed are a method for coding a residual signal of LPC coefficients based on collaborative quantization and a computing device for performing the method. The residual signal coding method includes: generating encoded LPC coefficients and LPC residual signals by performing LPC analysis and quantization on an input speech; Determining a predicted LPC residual signal by applying the LPC residual signal to cross module residual learning; Performing LPC synthesis using the coded LPC coefficients and the predicted LPC residual signal; It may include the step of determining an output speech that is a synthesized output according to a result of performing the LPC synthesis.
    Type: Grant
    Filed: November 13, 2020
    Date of Patent: November 1, 2022
    Assignees: Electronics and Telecommunications Research Institute, The Trustees of Indiana University
    Inventors: Minje Kim, Kai Zhen, Mi Suk Lee, Seung Kwon Beack, Jongmo Sung, Tae Jin Lee, Jin Soo Choi
  • Publication number: 20220335963
    Abstract: An audio signal encoding and decoding method using a neural network model, and an encoder and decoder for performing the same are disclosed. A method of encoding an audio signal using a neural network model, the method may include identifying an input signal, generating a quantized latent vector by inputting the input signal into a neural network model encoding the input signal, and generating a bitstream corresponding to the quantized latent vector, wherein the neural network model may include i) a feature extraction layer generating a latent vector by extracting a feature of the input signal, ii) a plurality of downsampling blocks downsampling the latent vector, and iii) a plurality of quantization blocks performing quantization of a downsampled latent vector.
    Type: Application
    Filed: February 11, 2022
    Publication date: October 20, 2022
    Inventors: Inseon JANG, Seung Kwon BEACK, Jongmo SUNG, Tae Jin LEE, Woo-taek LIM, Hong-Goo KANG, Jihyun LEE, Chanwoo LEE, Hyungseob LIM
  • Patent number: 11456001
    Abstract: Disclosed are a method of encoding a high band of an audio, a method of decoding a high band of an audio, and an encoder and a decoder for performing the methods. The method of decoding a high band of an audio, the method performed by a decoder, includes identifying a parameter extracted through a first neural network, identifying side information extracted through a second neural network, and restoring a high band of an audio by applying the parameter and the side information to a third neural network.
    Type: Grant
    Filed: March 10, 2020
    Date of Patent: September 27, 2022
    Assignees: Electronics and Telecommunications Research Institute, KWANGWOON UNIVERSITY INDUSTRY-ACADEMIC COLLABORATION FOUNDATION
    Inventors: Seung Kwon Beack, Jongmo Sung, Mi Suk Lee, Tae Jin Lee, Hochong Park
  • Patent number: 11430457
    Abstract: Disclosed is an LPC residual signal encoding/decoding apparatus of an MDCT based unified voice and audio encoding device. The LPC residual signal encoding apparatus analyzes a property of an input signal, selects an encoding method of an LPC filtered signal, and encode the LPC residual signal based on one of a real filterbank, a complex filterbank, and an algebraic code excited linear prediction (ACELP).
    Type: Grant
    Filed: April 10, 2020
    Date of Patent: August 30, 2022
    Assignee: Electronics and Telecommunications Research Institute
    Inventors: Seung Kwon Beack, Tae Jin Lee, Min Je Kim, Kyeongok Kang, Dae Young Jang, Jin Woo Hong, Jeongil Seo, Chieteuk Ahn, Hochong Park, Young-Cheol Park
  • Publication number: 20220262378
    Abstract: An audio signal encoding and decoding method using a learning model, a training method of the learning model, and an encoder and decoder that perform the method, are disclosed. The audio signal decoding method may include extracting a first residual signal and a first linear prediction coefficient by decoding a bitstream received from an encoder, generating a first audio signal from the first residual signal using the first linear prediction coefficient, generating a second linear prediction coefficients and a second residual signal from the first audio signal, obtaining a third linear prediction coefficient by inputting the second linear prediction coefficient into a trained learning model, and generating a second audio signal from the second residual signal using the third linear prediction coefficient.
    Type: Application
    Filed: February 15, 2022
    Publication date: August 18, 2022
    Inventors: Woo-taek LIM, Seung Kwon BEACK, Jongmo SUNG, Tae Jin LEE, Inseon JANG
  • Patent number: 11412225
    Abstract: Disclosed herein is a context-adaptive entropy model for end-to-end optimized image compression. The entropy model exploits two types of contexts. The two types of contexts are a bit-consuming context and a bit-free context, respectively, and these contexts are classified depending on the corresponding context requires the allocation of additional bits. Based on these contexts, the entropy model may more accurately estimate the distribution of each latent representation using a more generalized form of entropy models, thus improving compression performance.
    Type: Grant
    Filed: September 27, 2019
    Date of Patent: August 9, 2022
    Assignee: Electronics and Telecommunications Research Institute
    Inventors: Joo-Young Lee, Seung-Hyun Cho, Seung-Kwon Beack, Hyunsuk Ko, Youn-Hee Kim, Jong-Ho Kim, Jin-Wuk Seok, Woong Lim, Se-Yoon Jeong, Hui-Yong Kim, Jin-Soo Choi
  • Patent number: 11405738
    Abstract: Disclosed is an apparatus and method for processing a multichannel audio signal. A multichannel audio signal processing method may include: generating an N-channel audio signal of N channels by down-mixing an M-channel audio signal of M channels; and generating a stereo audio signal by performing binaural rendering of the N-channel audio signal.
    Type: Grant
    Filed: December 4, 2019
    Date of Patent: August 2, 2022
    Assignee: Electronics and Telecommunications Research Institute
    Inventors: Yong Ju Lee, Jeong Il Seo, Seung Kwon Beack, Kyeong Ok Kang, Jin Woong Kim, Jae Hyoun Yoo
  • Publication number: 20220238126
    Abstract: Methods of encoding and decoding an audio signal using a learning model and an encoder and a decoder for performing the methods are disclosed. A method of encoding an audio signal using a learning model may include extracting pitch information of the audio signal, determining a dilation factor of a receptive field of a first expandable neural network block to extract a feature map from the audio signal based on the pitch information, generating a first feature map of the audio signal using the first expandable neural network block in which the dilation factor is determined, determining a second feature map by inputting the first feature map into a second expandable neural network block to process the first feature map, and converting the second feature map and the pitch information into a bitstream.
    Type: Application
    Filed: January 7, 2022
    Publication date: July 28, 2022
    Inventors: Jongmo SUNG, Seung Kwon BEACK, Tae Jin LEE, Woo-taek LIM, Inseon JANG
  • Publication number: 20220223159
    Abstract: An encoding/decoding apparatus and method for controlling a channel signal is disclosed, wherein the encoding apparatus may include an encoder to encode an object signal, a channel signal, and rendering information for the channel signal, and a bit stream generator to generate, as a bit stream, the encoded object signal, the encoded channel signal, and the encoded rendering information for the channel signal.
    Type: Application
    Filed: March 28, 2022
    Publication date: July 14, 2022
    Applicant: Electronics and Telecommunications Research Institute
    Inventors: Jeong Il SEO, Seung Kwon BEACK, Dae Young JANG, Kyeong Ok KANG, Tae Jin PARK, Yong Ju LEE, Keun Woo CHOI, Jin Woong KIM
  • Patent number: 11375331
    Abstract: The present research relates to controlling rendering of multi-object or multi-channel audio signals. The present research provides a method and apparatus for controlling rendering of multi-object or multi-channel audio signals based on spatial cues in a process of decoding the multi-object or multi-channel audio signals. To achieve the purpose, the method suggested in the research controls rendering in a spatial cue domain in the process of decoding the multi-object or multi-channel audio signals.
    Type: Grant
    Filed: May 8, 2020
    Date of Patent: June 28, 2022
    Assignee: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE
    Inventors: Seung-Kwon Beack, Jeong-Il Seo, Dae-Young Jang, Tae-Jin Lee, Yong-Ju Lee, Jin-Woo Hong, Jin-Woong Kim
  • Publication number: 20220157326
    Abstract: A method of generating a residual signal performed by an encoder includes identifying an input signal including an audio sample, generating a first residual signal from the input signal using linear predictive coding (LPC), generating a second residual signal having a less information amount than the first residual signal by transforming the first residual signal, transforming the second residual signal into a frequency domain, and generating a third residual signal having a less information amount than the second residual signal from the transformed second residual signal using frequency-domain prediction (FDP) coding.
    Type: Application
    Filed: October 21, 2021
    Publication date: May 19, 2022
    Inventors: Seung Kwon BEACK, Jongmo SUNG, Tae Jin LEE, Woo-taek LIM, Inseon JANG
  • Patent number: 11328734
    Abstract: An encoding method for a multi-channel audio signal, an encoding apparatus for performing the encoding method, and a decoding method for a multi-channel audio signal and a decoding apparatus for performing the decoding method are disclosed. A method and apparatus of bypassing an MPEG Surround (MPS) standard operation and using an arbitrary tree when a number of audio signals of N channels exceeds a channel number defined in an MPS standard, is disclosed.
    Type: Grant
    Filed: January 6, 2020
    Date of Patent: May 10, 2022
    Assignee: Electronics and Telecommunications Research Institute
    Inventors: Seung Kwon Beack, Jeong Il Seo, Jong Mo Sung, Tae Jin Lee, Jin Soo Choi
  • Patent number: 11330274
    Abstract: The present invention discloses an encoding apparatus using a Discrete Cosine Transform (DCT) scanning, which includes a mode selection means for selecting an optimal mode for intra prediction; an intra prediction means for performing intra prediction onto video inputted based on the mode selected in the mode selection means; a DCT and quantization means for performing DCT and quantization onto residual coefficients of a block outputted from the intra prediction means; and an entropy encoding means for performing entropy encoding onto DCT coefficients acquired from the DCT and quantization by using a scanning mode decided based on pixel similarity of the residual coefficients.
    Type: Grant
    Filed: June 12, 2019
    Date of Patent: May 10, 2022
    Assignees: Electronics and Telecommunications Research Institute, Kwangwoon University Research Institute for Industry Cooperation, Industry-Academia Cooperation Group of Sejong University
    Inventors: Se-Yoon Jeong, Hae-Chul Choi, Jeong-Il Seo, Seung-Kwon Beack, In-Seon Jang, Jae-Gon Kim, Kyung-Ae Moon, Dae-Young Jang, Jin-Woo Hong, Jin-Woong Kim, Yung-Lyul Lee, Dong-Gyu Sim, Seoung-Jun Oh, Chang-Beom Ahn, Dae-Yeon Kim, Dong-Kyun Kim
  • Patent number: 11310615
    Abstract: An audio encoding apparatus and method that encodes hybrid contents including an object sound, a background sound, and metadata, and an audio decoding apparatus and method that decodes the encoded hybrid contents are provided. The audio encoding apparatus may include a mixing unit to generate an intermediate channel signal by mixing a background sound and an object sound, a matrix information encoding unit to encode matrix information used for the mixing, an audio encoding unit to encode the intermediate channel signal, and a metadata encoding unit to encode metadata including control information of the object sound.
    Type: Grant
    Filed: January 20, 2020
    Date of Patent: April 19, 2022
    Assignee: Electronics and Telecommunications Research Institute
    Inventors: Seung Kwon Beack, Tae Jin Lee, Jong Mo Sung, Kyeong Ok Kang, Jeong Il Seo, Dae Young Jang, Yong Ju Lee, Jin Woong Kim
  • Patent number: 11289105
    Abstract: An encoding/decoding apparatus and method for controlling a channel signal is disclosed, wherein the encoding apparatus may include an encoder to encode an object signal, a channel signal, and rendering information for the channel signal, and a bit stream generator to generate, as a bit stream, the encoded object signal, the encoded channel signal, and the encoded rendering information for the channel signal.
    Type: Grant
    Filed: June 20, 2019
    Date of Patent: March 29, 2022
    Assignee: Electronics and Telecommunications Research Institute
    Inventors: Jeong Il Seo, Seung Kwon Beack, Dae Young Jang, Kyeong Ok Kang, Tae Jin Park, Yong Ju Lee, Keun Woo Choi, Jin Woong Kim
  • Publication number: 20220020385
    Abstract: An audio signal encoding method performed by an encoder includes identifying a time-domain audio signal in a unit of blocks, quantizing a linear prediction coefficient extracted from a combined block in which a current original block of the audio signal and a previous original block chronologically adjacent to the current original block using frequency-domain linear predictive coding (LPC), generating a temporal envelope by dequantizing the quantized linear prediction coefficient, extracting a residual signal from the combined block based on the temporal envelope, quantizing the residual signal by one of time-domain quantization and frequency-domain quantization, and transforming the quantized residual signal and the quantized linear prediction coefficient into a bitstream.
    Type: Application
    Filed: July 15, 2021
    Publication date: January 20, 2022
    Inventors: Seung Kwon Beack, Jongmo Sung, Mi Suk Lee, Tae Jin Lee, Woo-taek Lim, Inseon Jang, Jin Soo Choi