Patents by Inventor Jongmo Sung

Jongmo Sung has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20210233547
    Abstract: A method and apparatus for processing an audio signal are disclosed. According to an example embodiment, a method of processing an audio signal may include acquiring a final audio signal for an initial audio signal using a plurality of neural network models generating output audio signals by encoding and decoding input audio signals, calculating a difference between the initial audio signal and the final audio signal in a time domain, converting the initial audio signal and the final audio signal into Mel-spectra, calculating a difference between the Mel-spectra of the initial audio signal and the final audio signal in a frequency domain, training the plurality of neural network models based on results calculated in the time domain and the frequency domain, and generating a new final audio signal distinguished from the final audio signal from the initial audio signal using the trained neural network models.
    Type: Application
    Filed: January 22, 2021
    Publication date: July 29, 2021
    Applicants: Electronics and Telecommunications Research Institute, The Trustees of Indiana University
    Inventors: Mi Suk LEE, Seung Kwon BEACK, Jongmo SUNG, Tae Jin LEE, Jin Soo CHOI, Minje KIM, Kai ZHEN
  • Publication number: 20210174815
    Abstract: Disclosed are a quantizing method for a latent vector and a computing device for performing the quantization method. A quantizing method of a latent vector includes performing information shaping on the latent vector resulting from reduction in a dimension of an input signal using a target neural network; clamping a residual signal of the latent vector derived based on the information shaping; performing resealing on the clamped residual signal; and performing quantization on the resealed residual signal.
    Type: Application
    Filed: December 4, 2020
    Publication date: June 10, 2021
    Applicant: Electronics and Telecommunications Research Institute
    Inventors: Seung Kwon BEACK, Jooyoung LEE, Jongmo SUNG, Mi Suk LEE, Tae Jin LEE, Woo-taek LIM, Seunghyun CHO, Jin Soo CHOI
  • Publication number: 20210166701
    Abstract: An audio signal encoding/decoding device and method using a filter bank is disclosed. The audio signal encoding method includes generating a plurality of first audio signals by performing filtering on an input audio signal using an analysis filter bank, generating a plurality of second audio signals by performing downsampling on the first audio signals, and outputting a bitstream by encoding and quantizing the second audio signals.
    Type: Application
    Filed: November 25, 2020
    Publication date: June 3, 2021
    Inventors: Woo-taek LIM, Seung Kwon BEACK, Jongmo SUNG, Mi Suk LEE, Tae Jin LEE
  • Publication number: 20210166706
    Abstract: Disclosed is an apparatus and method for encoding/decoding an audio signal using information of a previous frame. An audio signal encoding method includes: generating a current latent vector by reducing dimension of a current frame of an audio signal; generating a concatenation vector by concatenating a previous latent vector generated by reducing dimension of a previous frame of the audio signal with the current latent vector; and encoding and quantizing the concatenation vector.
    Type: Application
    Filed: November 27, 2020
    Publication date: June 3, 2021
    Applicant: Electronics and Telecommunications Research Institute
    Inventors: Woo-taek LIM, Seung Kwon BEACK, Jongmo SUNG, Mi Suk LEE, Tae Jin LEE
  • Publication number: 20210142812
    Abstract: Disclosed are a method for coding a residual signal of LPC coefficients based on collaborative quantization and a computing device for performing the method. The residual signal coding method includes: generating encoded LPC coefficients and LPC residual signals by performing LPC analysis and quantization on an input speech; Determining a predicted LPC residual signal by applying the LPC residual signal to cross module residual learning; Performing LPC synthesis using the coded LPC coefficients and the predicted LPC residual signal; It may include the step of determining an output speech that is a synthesized output according to a result of performing the LPC synthesis.
    Type: Application
    Filed: November 13, 2020
    Publication date: May 13, 2021
    Applicants: Electronics and Telecommunications Research Institute, The Trustees of Indiana University
    Inventors: Minje KIM, Kai ZHEN, Mi Suk LEE, Seung Kwon BEACK, Jongmo SUNG, Tae Jin LEE, Jin Soo CHOI
  • Publication number: 20210074306
    Abstract: Provided are an audio encoding method, an audio decoding method, an audio encoding apparatus, and an audio decoding apparatus using dynamic model parameters. The audio encoding method using dynamic model parameters may use dynamic model parameters corresponding to each of the levels of the encoding network when reducing the dimension of an audio signal in the encoding network. In addition, the audio decoding method using the dynamic model parameter may use a dynamic model parameter corresponding to each of the levels of the decoding network when extending the dimension of an audio signal in an encoding network.
    Type: Application
    Filed: September 10, 2020
    Publication date: March 11, 2021
    Applicant: Electronics and Telecommunications Research Institute
    Inventors: Jongmo SUNG, Seung Kwon BEACK, Mi Suk LEE, Tae Jin LEE, Woo-taek LIM, Jin Soo CHOI
  • Publication number: 20210005208
    Abstract: Disclosed is a method of processing a residual signal for audio coding and an audio coding apparatus. The method learns a feature map of a reference signal through a residual signal learning engine including a convolutional layer and a neural network and performs learning based on a result obtained by mapping a node of an output layer of the neural network and a quantization level of index of the residual signal.
    Type: Application
    Filed: November 18, 2019
    Publication date: January 7, 2021
    Applicant: Electronics and Telecommunications Research Institute
    Inventors: Seung Kwon Beack, Jongmo Sung, Mi Suk Lee, Tae Jin Lee, Hui Yong Kim
  • Publication number: 20210005209
    Abstract: Disclosed are a method of encoding a high band of an audio, a method of decoding a high band of an audio, and an encoder and a decoder for performing the methods. The method of decoding a high band of an audio, the method performed by a decoder, includes identifying a parameter extracted through a first neural network, identifying side information extracted through a second neural network, and restoring a high band of an audio by applying the parameter and the side information to a third neural network.
    Type: Application
    Filed: March 10, 2020
    Publication date: January 7, 2021
    Applicants: Electronics and Telecommunications Research Institute, Kwangwoon University Industry-Academic Collaboration Foundation
    Inventors: Seung Kwon BEACK, Jongmo SUNG, Mi Suk LEE, Tae Jin LEE, Hochong PARK
  • Patent number: 10839819
    Abstract: Provided is an apparatus and method for encoding/decoding audio based on a block. A method of encoding an audio signal may include dividing each of frame of input signal that constitute an audio signal into a plurality of subframes; transforming the subframes to a frequency domain; determining a two-dimensional (2D) intra block using the subframes transformed to the frequency domain; and encoding the 2D intra block. The 2D intra block may be a block that two-dimensionally displays frequency coefficients of the subframes transformed to the frequency domain using a time and a frequency.
    Type: Grant
    Filed: March 21, 2017
    Date of Patent: November 17, 2020
    Assignee: Electronics and Telecommunications Research Institute
    Inventors: Seung Kwon Beack, Tae Jin Lee, Jongmo Sung, Mi Suk Lee, Dae Young Jang, Jin Soo Choi
  • Publication number: 20200349959
    Abstract: An inventive concept relates to an audio coding method to which CNN-based frequency spectrum recovery is applied. An inventive concept transmits a part of frequency spectral coefficients generated in transform coding to a decoder and the decoder recovers the frequency spectral coefficient not transmitted. Furthermore, the signs of frequency spectral coefficient are transmitted from an encoder to the decoder depending on a sign transmission rule.
    Type: Application
    Filed: April 8, 2020
    Publication date: November 5, 2020
    Applicants: Electronics and Telecommunications Research Institute, Kwangwoon University Industry-Academic Collaboration Foundation
    Inventors: Hochong PARK, Seung Kwon BEACK, Jongmo SUNG, Seong-Hyeon SHIN, Mi Suk LEE, Tae Jin LEE, Jin Soo CHOI
  • Publication number: 20200135220
    Abstract: Disclosed are an audio signal encoding method and audio signal decoding method, and an encoder and decoder performing the same. The audio signal encoding method includes applying an audio signal to a training model including N autoencoders provided in a cascade structure, encoding an output result derived through the training model, and generating a bitstream with respect to the audio signal based on the encoded output result.
    Type: Application
    Filed: August 16, 2019
    Publication date: April 30, 2020
    Applicants: Electronics and Telecommunications Research Institute, THE TRUSTEES OF INDIANA UNIVERSITY
    Inventors: Mi Suk LEE, Jongmo SUNG, Minje KIM, Kai ZHEN
  • Publication number: 20200111501
    Abstract: Disclosed are an audio signal encoding method and device, and an audio signal decoding method and device. The encoding method includes transforming an original test signal of a time domain being an audio signal into a frequency domain, binarizing a coefficient of the original test signal of the frequency domain, performing an encoding layer feedforward using the binarized coefficient and a training model parameter derived through a training process, and performing an entropy encoding based on a result of performing the encoding layer feedforward.
    Type: Application
    Filed: August 15, 2019
    Publication date: April 9, 2020
    Applicants: Electronics and Telecommunications Research Institute, The Trustees of Indiana University
    Inventors: Jongmo SUNG, Seung Kwon BEACK, Mi Suk LEE, Tae Jin LEE, Minje KIM
  • Publication number: 20190180763
    Abstract: A method of predicting a channel parameter of an original signal from a downmix signal is disclosed. The method may include generating an input feature map to be used to predict a channel parameter of the original signal based on a downmix signal of an original signal, determining an output feature map including a predicted parameter to be used to predict the channel parameter by applying the input feature map to a neural network, generating a label map including information associated with the channel parameter of the original signal, and predicting the channel parameter of the original signal by comparing the output feature map and the label map.
    Type: Application
    Filed: November 5, 2018
    Publication date: June 13, 2019
    Applicant: Electronics and Telecommunications Research Institute
    Inventors: Seung Kwon BEACK, Woo-taek LIM, Jongmo SUNG, Mi Suk LEE, Tae Jin LEE, Hui Yong KIM
  • Publication number: 20190164052
    Abstract: Provided is a training method of a neural network that is applied to an audio signal encoding method using an audio signal encoding apparatus, the training method including generating a masking threshold of a first audio signal before training is performed, calculating a weight matrix to be applied to a frequency component of the first audio signal based on the masking threshold, generating a weighted error function obtained by correcting a preset error function using the weight matrix, and generating a second audio signal by applying a parameter learned using the weighted error function to the first audio signal.
    Type: Application
    Filed: September 5, 2018
    Publication date: May 30, 2019
    Applicants: Electronics and Telecommunications Research Institute, THE TRUSTEES OF INDIANA UNIVERSITY
    Inventors: Jongmo SUNG, Minje KIM, Aswin Sivaraman, Kai Zhen
  • Publication number: 20190035412
    Abstract: Provided is an apparatus and method for encoding/decoding audio based on a block. A method of encoding an audio signal may include dividing each of frame of input signal that constitute an audio signal into a plurality of subframes; transforming the subframes to a frequency domain; determining a two-dimensional (2D) intra block using the subframes transformed to the frequency domain; and encoding the 2D intra block. The 2D intra block may be a block that two-dimensionally displays frequency coefficients of the subframes transformed to the frequency domain using a time and a frequency.
    Type: Application
    Filed: March 21, 2017
    Publication date: January 31, 2019
    Applicant: Electronics And Telecommunications Research Institute
    Inventors: Seung Kwon BEACK, Tae Jin LEE, Jongmo SUNG, Mi Suk LEE, Dae Young JANG, Jin Soo CHOI
  • Publication number: 20180144755
    Abstract: Disclosed is an audio watermark insertion method. The audio watermark insertion method includes performing a modulated complex lapped transform (MCLT) on a first audio signal, inserting a bit string of a watermark in the first audio signal obtained by performing the MCLT, performing an inverse modified discrete cosine transform (IMDCT) on the first audio signal in which the bit string is inserted, and obtaining a second audio signal, which is the first audio signal in which the watermark is inserted, by performing an overlap-add on a signal obtained by performing the IMDCT and a neighbor frame signal.
    Type: Application
    Filed: September 20, 2017
    Publication date: May 24, 2018
    Applicant: Electronics and Telecommunications Research Institute
    Inventors: Mi Suk LEE, Seung Kwon BEACK, Jongmo SUNG, Tae Jin LEE
  • Publication number: 20180144757
    Abstract: Disclosed is a bitstream generation method performed by an acoustic data transmission (ADT) encoder, the method including receiving a first audio signal, receiving additional information converted into a bitstream, and transmitting a second audio signal obtained by inserting the bitstream into the first audio signal, to an ADT decoder.
    Type: Application
    Filed: November 22, 2017
    Publication date: May 24, 2018
    Applicant: Electronics and Telecommunications Research Institute
    Inventors: Seung Kwon Beack, Jongmo Sung, Mi Suk Lee, Young Ho Jeong, Tae Jin Lee, Sang Won Suh
  • Patent number: 9424857
    Abstract: An encoding method of an encoder is provided. The encoder generates first MDCT coefficients by transforming an input signal, and generates MDCT indices by quantizing the first MDCT coefficients. The encoder generates second MDCT coefficients by dequantizing the MDCT indices, and calculates MDCT residual coefficients using differences between the first MDCT coefficients and the second MDCT coefficients. The encoder generates a residual index by encoding the MDCT residual coefficients, and generates gain indices corresponding to gains from the first MDCT coefficients and the second MDCT coefficients.
    Type: Grant
    Filed: March 31, 2011
    Date of Patent: August 23, 2016
    Assignee: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE
    Inventors: Jongmo Sung, Hyun Woo Kim, Hyun Joo Bae
  • Patent number: 9111535
    Abstract: Provided are a method and an apparatus for decoding an audio signal. A method for decoding an audio signal encoded by a layered sinusoidal pulse coding scheme using one or more sinusoidal pulses includes decoding the encoded audio signal, setting a smoothing frequency band of the decoded audio signal according to a layer structure of the layered sinusoidal pulse coding scheme, dividing the smoothing frequency band into one or more subbands, and smoothing the decoded audio signal on a subband-by-subband basis. Accordingly, a decoding operation time can be reduced and the quality of a synthesized signal can be improved by variably setting a frequency band to be smoothed, when decoding an audio signal encoded by a layered sinusoidal pulse coding scheme using one or more sinusoidal pulses.
    Type: Grant
    Filed: January 21, 2011
    Date of Patent: August 18, 2015
    Assignee: Electronics and Telecommunications Research Institute
    Inventors: Heesik Yang, Mi-Suk Lee, Hyun-Woo Kim, Jongmo Sung, Hyun-Joo Bae, Byung-Sun Lee
  • Publication number: 20140324417
    Abstract: Provided are a method and an apparatus for encoding and decoding an audio signal. A method for encoding an audio signal includes receiving a transformed audio signal, dividing the transformed audio signal into a plurality of subbands, performing a first sinusoidal pulse coding operation on the subbands, determining a performance region of a second sinusoidal pulse coding operation among the subbands on the basis of coding information of the first sinusoidal pulse coding operation, and performing the second sinusoidal pulse coding operation on the determined performance region, wherein the first sinusoidal pulse coding operation is performed variably according to the coding information. Accordingly, it is possible to further improve the quality of a synthesized signal by considering the sinusoidal pulse coding of a lower layer when encoding or decoding an audio signal in an upper layer by a layered sinusoidal pulse coding scheme.
    Type: Application
    Filed: July 8, 2014
    Publication date: October 30, 2014
    Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE
    Inventors: Mi-Suk LEE, Heesik YANG, Hyun-Woo KIM, Jongmo SUNG, Hyun-Joo BAE, Byung-Sun LEE