Patents by Inventor Woo Taek LIM

Woo Taek LIM has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11562757
    Abstract: An audio signal encoding method performed by an encoder includes identifying a time-domain audio signal in a unit of blocks, quantizing a linear prediction coefficient extracted from a combined block in which a current original block of the audio signal and a previous original block chronologically adjacent to the current original block using frequency-domain linear predictive coding (LPC), generating a temporal envelope by dequantizing the quantized linear prediction coefficient, extracting a residual signal from the combined block based on the temporal envelope, quantizing the residual signal by one of time-domain quantization and frequency-domain quantization, and transforming the quantized residual signal and the quantized linear prediction coefficient into a bitstream.
    Type: Grant
    Filed: July 15, 2021
    Date of Patent: January 24, 2023
    Assignee: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE
    Inventors: Seung Kwon Beack, Jongmo Sung, Mi Suk Lee, Tae Jin Lee, Woo-taek Lim, Inseon Jang, Jin Soo Choi
  • Patent number: 11545163
    Abstract: A loss function of a signal including an audio signal is determined. A loss function determining system for an audio signal is provided. A loss function is determined by: determining a reference quantization index by quantizing an original input signal; inputting the original input signal to a neural network classifier and applying an activation function to an output layer of the neural network classifier; and determining a total loss function for the neural network classifier using an output of the activation function and the reference quantization index.
    Type: Grant
    Filed: December 27, 2019
    Date of Patent: January 3, 2023
    Assignee: Electronics and Telecommunications Research Institute
    Inventors: Seung Kwon Beack, Woo-taek Lim, Tae Jin Lee
  • Publication number: 20220375483
    Abstract: Disclosed are methods of encoding and decoding an audio signal, and an encoder and a decoder for performing the methods. The method of encoding an audio signal includes identifying an input signal corresponding to a low frequency band of the audio signal, windowing the input signal, generating a first latent vector by inputting the windowed input signal to a first encoding model, transforming the windowed input signal into a frequency domain, generating a second latent vector by inputting the transformed input signal to a second encoding model, generating a final latent vector by combining the first latent vector and the second latent vector, and generating a bitstream corresponding to the final latent vector.
    Type: Application
    Filed: November 8, 2021
    Publication date: November 24, 2022
    Inventors: Woo-taek LIM, Seung Kwon BEACK, Jongmo SUNG, Tae Jin LEE, Inseon JANG, Jong-won SEOK, YUNSU KIM
  • Publication number: 20220358940
    Abstract: Disclosed are methods of encoding and decoding an audio signal using side information, and an encoder and a decoder for performing the methods. The method of encoding an audio signal using side information includes identifying an input signal, the input signal being an original audio signal, extracting side information from the input signal using a learning model trained to extract side information from a feature vector of the input signal, encoding the input signal, and generating a bitstream by combining the encoded input signal and the side information.
    Type: Application
    Filed: November 16, 2021
    Publication date: November 10, 2022
    Inventors: Woo-taek LIM, Seung Kwon BEACK, Jongmo SUNG, Tae Jin LEE, Inseon JANG, Jong Won SHIN, Soojoong HWANG, Youngju CHEON, Sangwook HAN
  • Publication number: 20220355664
    Abstract: A vehicle includes a plurality of tactile input devices configured to receive a tactile input for controlling a function of the vehicle; a microphone configured to receive an audio input; and a voice recognition system configured to control the function of the vehicle based on the audio input, where the voice recognition system is configured to determine a target object to be controlled based on the tactile input, determine a control instruction for the target object based on the audio input, and control the target object based on the control instruction.
    Type: Application
    Filed: February 14, 2022
    Publication date: November 10, 2022
    Inventors: Sungwang Kim, Woo Taek Lim, Minjae Park, Donghyeon Lee
  • Publication number: 20220335963
    Abstract: An audio signal encoding and decoding method using a neural network model, and an encoder and decoder for performing the same are disclosed. A method of encoding an audio signal using a neural network model, the method may include identifying an input signal, generating a quantized latent vector by inputting the input signal into a neural network model encoding the input signal, and generating a bitstream corresponding to the quantized latent vector, wherein the neural network model may include i) a feature extraction layer generating a latent vector by extracting a feature of the input signal, ii) a plurality of downsampling blocks downsampling the latent vector, and iii) a plurality of quantization blocks performing quantization of a downsampled latent vector.
    Type: Application
    Filed: February 11, 2022
    Publication date: October 20, 2022
    Inventors: Inseon JANG, Seung Kwon BEACK, Jongmo SUNG, Tae Jin LEE, Woo-taek LIM, Hong-Goo KANG, Jihyun LEE, Chanwoo LEE, Hyungseob LIM
  • Publication number: 20220262378
    Abstract: An audio signal encoding and decoding method using a learning model, a training method of the learning model, and an encoder and decoder that perform the method, are disclosed. The audio signal decoding method may include extracting a first residual signal and a first linear prediction coefficient by decoding a bitstream received from an encoder, generating a first audio signal from the first residual signal using the first linear prediction coefficient, generating a second linear prediction coefficients and a second residual signal from the first audio signal, obtaining a third linear prediction coefficient by inputting the second linear prediction coefficient into a trained learning model, and generating a second audio signal from the second residual signal using the third linear prediction coefficient.
    Type: Application
    Filed: February 15, 2022
    Publication date: August 18, 2022
    Inventors: Woo-taek LIM, Seung Kwon BEACK, Jongmo SUNG, Tae Jin LEE, Inseon JANG
  • Publication number: 20220238126
    Abstract: Methods of encoding and decoding an audio signal using a learning model and an encoder and a decoder for performing the methods are disclosed. A method of encoding an audio signal using a learning model may include extracting pitch information of the audio signal, determining a dilation factor of a receptive field of a first expandable neural network block to extract a feature map from the audio signal based on the pitch information, generating a first feature map of the audio signal using the first expandable neural network block in which the dilation factor is determined, determining a second feature map by inputting the first feature map into a second expandable neural network block to process the first feature map, and converting the second feature map and the pitch information into a bitstream.
    Type: Application
    Filed: January 7, 2022
    Publication date: July 28, 2022
    Inventors: Jongmo SUNG, Seung Kwon BEACK, Tae Jin LEE, Woo-taek LIM, Inseon JANG
  • Publication number: 20220157326
    Abstract: A method of generating a residual signal performed by an encoder includes identifying an input signal including an audio sample, generating a first residual signal from the input signal using linear predictive coding (LPC), generating a second residual signal having a less information amount than the first residual signal by transforming the first residual signal, transforming the second residual signal into a frequency domain, and generating a third residual signal having a less information amount than the second residual signal from the transformed second residual signal using frequency-domain prediction (FDP) coding.
    Type: Application
    Filed: October 21, 2021
    Publication date: May 19, 2022
    Inventors: Seung Kwon BEACK, Jongmo SUNG, Tae Jin LEE, Woo-taek LIM, Inseon JANG
  • Publication number: 20220020385
    Abstract: An audio signal encoding method performed by an encoder includes identifying a time-domain audio signal in a unit of blocks, quantizing a linear prediction coefficient extracted from a combined block in which a current original block of the audio signal and a previous original block chronologically adjacent to the current original block using frequency-domain linear predictive coding (LPC), generating a temporal envelope by dequantizing the quantized linear prediction coefficient, extracting a residual signal from the combined block based on the temporal envelope, quantizing the residual signal by one of time-domain quantization and frequency-domain quantization, and transforming the quantized residual signal and the quantized linear prediction coefficient into a bitstream.
    Type: Application
    Filed: July 15, 2021
    Publication date: January 20, 2022
    Inventors: Seung Kwon Beack, Jongmo Sung, Mi Suk Lee, Tae Jin Lee, Woo-taek Lim, Inseon Jang, Jin Soo Choi
  • Publication number: 20220005487
    Abstract: An audio signal encoding and decoding method using a neural network model, a method of training the neural network model, and an encoder and decoder performing the methods are disclosed. The encoding method includes computing the first feature information of an input signal using a recurrent encoding model, computing an output signal from the first feature information using a recurrent decoding model, calculating a residual signal by subtracting the output signal from the input signal, computing the second feature information of the residual signal using a nonrecurrent encoding model, and converting the first feature information and the second feature information to a bitstream.
    Type: Application
    Filed: July 6, 2021
    Publication date: January 6, 2022
    Applicant: Electronics and Telecommunications Research Institute
    Inventors: Jongmo SUNG, Seung Kwon BEACK, Mi Suk LEE, Tae Jin LEE, Woo-taek LIM, Inseon JANG
  • Publication number: 20220005488
    Abstract: The encoding method includes computing the first feature information of an input signal using a recurrent encoding model, quantizing the first feature information and producing the first feature bitstream, computing the first output signal from the quantized first feature information using a recurrent decoding model, computing the second feature information of the input signal using a nonrecurrent encoding model, quantizing the second feature information and producing the second feature bitstream, computing the second output signal from the quantized second feature information using a nonrecurrent decoding model, determining an encoding mode based on the input signal, the first and second output signals, and the first and second feature bitstreams, and outputting an overall bitstream by multiplexing an encoding mode bit and one of the first feature bitstream and the second feature bitstream depending on the encoding mode.
    Type: Application
    Filed: July 6, 2021
    Publication date: January 6, 2022
    Applicant: Electronics and Telecommunications Research Institute
    Inventors: Jongmo SUNG, Seung Kwon BEACK, Mi Suk LEE, Tae Jin LEE, Woo-taek LIM, Inseon JANG
  • Publication number: 20210398547
    Abstract: An audio signal encoding method performed by an encoder includes identifying an audio signal of a time domain in units of a block, generating a combined block by combining i) a current original block of the audio signal and ii) a previous original block chronologically adjacent to the current original block, extracting a first residual signal of a frequency domain from the combined block using linear predictive coding of a time domain, overlapping chronologically adjacent first residual signals among first residual signals converted into a time domain, and quantizing a second residual signal of a time domain extracted from the overlapped first residual signal by converting the second residual signal of the time domain into a frequency domain using linear predictive coding of a frequency domain.
    Type: Application
    Filed: May 26, 2021
    Publication date: December 23, 2021
    Applicant: Electronics and Telecommunications Research Institute
    Inventors: Seung Kwon BEACK, Jongmo SUNG, Mi Suk LEE, Tae Jin LEE, Woo-taek LIM, Inseon JANG
  • Patent number: 11205442
    Abstract: Provided is a sound event recognition method that may improve a sound event recognition performance using a correlation between difference sound signal feature parameters based on a neural network, in detail, that may extract a sound signal feature parameter from a sound signal including a sound event, and recognize the sound event included in the sound signal by applying a convolutional neural network (CNN) trained using the sound signal feature parameter.
    Type: Grant
    Filed: September 5, 2019
    Date of Patent: December 21, 2021
    Assignee: Electronics and Telecommunications Research Institute
    Inventors: Young Ho Jeong, Sang Won Suh, Tae Jin Lee, Woo-taek Lim, Hui Yong Kim
  • Publication number: 20210390967
    Abstract: Disclosed is a method of encoding and decoding an audio signal using linear predictive coding (LPC) and an encoder and a decoder that perform the method. The method of encoding an audio signal to be performed by the encoder includes identifying a time-domain audio signal block-wise, quantizing a linear prediction coefficient obtained from a block of the audio signal through the LPC, generating an envelope based on the quantized linear prediction coefficient, extracting a residual signal based on the envelope and a result of converting the block into a frequency domain, grouping the residual signal by each sub-band and determining a scale factor for quantizing the grouped residual signal, quantizing the residual signal using the scale factor, and converting the quantized residual signal and the quantized linear prediction coefficient into a bitstream and transmitting the bitstream to a decoder.
    Type: Application
    Filed: April 28, 2021
    Publication date: December 16, 2021
    Applicant: Electronics and Telecommunications Research Institute
    Inventors: Seung Kwon Beack, Jongmo SUNG, Mi Suk LEE, Tae Jin LEE, Woo-taek LIM, Inseon JANG, Jin Soo CHOI
  • Publication number: 20210366497
    Abstract: Methods of encoding and decoding a speech signal using a neural network model that recognizes sound sources, and encoding and decoding apparatuses for performing the methods are provided. A method of encoding a speech signal includes identifying an input signal for a plurality of sound sources; generating a latent signal by encoding the input signal; obtaining a plurality of sound source signals by separating the latent signal for each of the plurality of sound sources; determining a number of bits used for quantization of each of the plurality of sound source signals according to a type of each of the plurality of sound sources; quantizing each of the plurality of sound source signals based on the determined number of bits; and generating a bitstream by combining the plurality of quantized sound source signals.
    Type: Application
    Filed: May 20, 2021
    Publication date: November 25, 2021
    Applicants: Electronics and Telecommunications Research Institute, The Trustees of Indiana University
    Inventors: Woo-taek LIM, Seung Kwon BEACK, Jongmo SUNG, Mi Suk LEE, Tae Jin LEE, Inseon JANG, Minje KIM, Haici YANG
  • Patent number: 11133015
    Abstract: A method of predicting a channel parameter of an original signal from a downmix signal is disclosed. The method may include generating an input feature map to be used to predict a channel parameter of the original signal based on a downmix signal of an original signal, determining an output feature map including a predicted parameter to be used to predict the channel parameter by applying the input feature map to a neural network, generating a label map including information associated with the channel parameter of the original signal, and predicting the channel parameter of the original signal by comparing the output feature map and the label map.
    Type: Grant
    Filed: November 5, 2018
    Date of Patent: September 28, 2021
    Assignee: Electronics and Telecommunications Research Institute
    Inventors: Seung Kwon Beack, Woo-taek Lim, Jongmo Sung, Mi Suk Lee, Tae Jin Lee, Hui Yong Kim
  • Publication number: 20210174815
    Abstract: Disclosed are a quantizing method for a latent vector and a computing device for performing the quantization method. A quantizing method of a latent vector includes performing information shaping on the latent vector resulting from reduction in a dimension of an input signal using a target neural network; clamping a residual signal of the latent vector derived based on the information shaping; performing resealing on the clamped residual signal; and performing quantization on the resealed residual signal.
    Type: Application
    Filed: December 4, 2020
    Publication date: June 10, 2021
    Applicant: Electronics and Telecommunications Research Institute
    Inventors: Seung Kwon BEACK, Jooyoung LEE, Jongmo SUNG, Mi Suk LEE, Tae Jin LEE, Woo-taek LIM, Seunghyun CHO, Jin Soo CHOI
  • Publication number: 20210174252
    Abstract: Disclosed is an apparatus and method for augmenting training data using a notch filter. The method may include obtaining original data, and obtaining training data having a modified frequency component from the original data by filtering the original data using a filter configured to remove a component of a predetermined frequency band.
    Type: Application
    Filed: July 13, 2020
    Publication date: June 10, 2021
    Applicants: Electronics and Telecommunications Research Institute, Kyungpook National University Industry-Academic Cooperation Foundation
    Inventors: Young Ho JEONG, Soo Young PARK, Sang Won SUH, Woo-taek LIM, Minhan KIM, Seokjin LEE
  • Publication number: 20210166706
    Abstract: Disclosed is an apparatus and method for encoding/decoding an audio signal using information of a previous frame. An audio signal encoding method includes: generating a current latent vector by reducing dimension of a current frame of an audio signal; generating a concatenation vector by concatenating a previous latent vector generated by reducing dimension of a previous frame of the audio signal with the current latent vector; and encoding and quantizing the concatenation vector.
    Type: Application
    Filed: November 27, 2020
    Publication date: June 3, 2021
    Applicant: Electronics and Telecommunications Research Institute
    Inventors: Woo-taek LIM, Seung Kwon BEACK, Jongmo SUNG, Mi Suk LEE, Tae Jin LEE