Patents by Inventor Hong-kook Kim

Hong-kook Kim has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Stereo extension apparatus and method

Patent number: 9288602

Abstract: Disclosed herein are a stereo extension apparatus and method. The apparatus includes a database that stores predetermined information as a result of Gaussian mixture model (GMM) training or hidden Markov model (HMM) training; a modified discrete cosine transform (MDCT) transformer that transforms a mono signal through MDCT, a feature parameter extractor that extracts a feature parameter of the mono signal from an MDCT coefficient output from the MDCT transformer, a side signal energy estimator that estimates subband energy of a side signal with reference to information stored in the database based on the feature parameter; an energy controller that obtains the MDCT coefficient of a side signal estimated from the subband energy of the estimated side signal, an inverse MDCT transformer that obtains an estimated side signal by transforming the MDCT coefficient of the estimated side signal through inverse MDCT.

Type: Grant

Filed: June 11, 2014

Date of Patent: March 15, 2016

Assignee: GWANGJU INSTITUTE OF SCIENCE AND TECHNOLOGY

Inventors: Hong Kook Kim, Nam In Park
Packet loss concealment for bandwidth extension of speech signals

Patent number: 9280978

Abstract: Disclosed is a speech receiving apparatus. A low-band PLC module and a synthesis filter reconstructs a low-band speech signal of a lost frame from a previous good frame. A high-band PLC module reconstructs a high-band speech signal of the lost frame from the previous good frame. A transforming part transforms the low-band speech signal into a frequency range. A bandwidth extending part generates at least an extended MDCT coefficient as information for the high-band speech signal from the low-band speech signal transformed by the transforming part. A smoothing part smoothes the extended MDCT coefficient. An inverse transforming part inversely transforms the extended MDCT coefficient smoothed by the smoothing part to a time domain. A synthesizing part synthesizes the low-band speech signal, and the high-band speech signal which is inverse-transformed by the inverse transforming part and reconstructed, to output a wideband speech signal.

Type: Grant

Filed: March 27, 2013

Date of Patent: March 8, 2016

Assignee: Gwangju Institute of Science and Technology

Inventors: Hong-Kook Kim, Nam-In Park
Method and apparatus for audio encoding for noise reduction

Patent number: 9202454

Abstract: A method and apparatus for audio signal encoding for noise reduction are provided. The method includes: receiving an audio signal and performing modified discrete cosine transformation (MDCT) on the audio signal to convert the audio signal into a long block or a short block; reducing noise included in the audio signal in accordance with the long block or the short block; and performing advanced audio coding (AAC) on the long block or the short block in which noise is reduced.

Type: Grant

Filed: January 31, 2013

Date of Patent: December 1, 2015

Assignees: SAMSUNG ELECTRONICS CO., LTD., GWANGJU INSTITUTE OF SCIENCE AND TECHNOLOGY

Inventors: Myung-kyu Choi, Sang-ryong Kim, Seong-woon Kim, Ung-sik Kim, Kwang-il Hwang, Duk-soo Kim, Hong-kook Kim, Nam-in Park, Kwang-myung Jeon
DEVICE AND METHOD FOR PLAYING SOUND

Publication number: 20150271618

Abstract: The present invention extracts azimuth information on a sound source, read the touch state of a touch screen on which an image is displayed, and enables a sound source having azimuth information corresponding to a place touched on the image to be synthesized so as to be distinguished from other sound sources. According to the present invention, since it is possible to listen to the distinguished sound of a desired location on an image, a user may be provided with more satisfaction.

Type: Application

Filed: October 4, 2013

Publication date: September 24, 2015

Inventors: Hong Kook Kim, Chan Jun Chun
Apparatus and method for eliminating noise

Patent number: 9123347

Abstract: Provided are an apparatus and method for eliminating noise. The method includes: detecting a speech section from a noise speech signal including a noise signal; separating the speech section into a consonant section and a vowel section on the basis of a VOP at the speech section; calculating a transfer function of a filter for eliminating the noise signal to allow the degree of noise elimination to be different in the consonant section and the vowel section; and eliminating the noise signal from the noise speech signal on the basis of the transfer function.

Type: Grant

Filed: August 29, 2012

Date of Patent: September 1, 2015

Assignee: Gwangju Institute of Science and Technology

Inventors: Hong Kook Kim, Ji Hun Park, Woo Kyeong Seong
Apparatus for correcting error in speech recognition

Patent number: 9099082

Abstract: An apparatus for correcting errors in speech recognition is provided. The apparatus includes a feature vector extracting unit extracting feature vectors from a received speech. A speech recognizing unit recognizes the received speech as a word sequence on the basis of the extracted feature vectors. A phoneme weighted finite state transducer (WFST)-based converting unit converts the recognized word sequence recognized by the speech recognizing unit into a phoneme WFST. A speech recognition error correcting unit corrects errors in the converted phoneme WFST. The speech recognition error correcting unit includes a WFST synthesizing unit modeling a phoneme WFST transferred from the phoneme WFST-based converting unit as pronunciation variation on the basis of a Kullback-Leibler (KL) distance matrix.

Type: Grant

Filed: May 16, 2013

Date of Patent: August 4, 2015

Assignee: Gwangju Institute of Science and Technology

Inventors: Hong-Kook Kim, Woo-Kyeong Seong, Ji-Hun Park
APPARATUS AND METHOD FOR EXTENDING BANDWIDTH OF SOUND SIGNAL

Publication number: 20150112692

Abstract: Disclosed is an apparatus for extending a bandwidth of a sound signal. The apparatus includes a database that stores predetermined training information as a result of at least one of Gaussian mixture model (GMM) training and hidden Markov model (HMM) training; a modified discrete cosine transform (MDCT) transformer that transforms a first band signal through MDCT, a feature extractor that extracts a feature parameter of the first band signal from an MDCT coefficient output from the MDCT transformer; an extender that provides an extended MDCT coefficient for a second band signal based on the MDCT coefficient of the first band signal output from the MDCT transformer, a subband energy estimator that estimates subband energy of the second band signal with reference to information stored in the database based on the feature parameter.

Type: Application

Filed: June 11, 2014

Publication date: April 23, 2015

Inventors: Hong Kook KIM, Nam In PARK
STEREO EXTENSION APPARATUS AND METHOD

Publication number: 20150071445

Abstract: Disclosed herein are a stereo extension apparatus and method. The apparatus includes a database that stores predetermined information as a result of Gaussian mixture model (GMM) training or hidden Markov model (HMM) training; a modified discrete cosine transform (MDCT) transformer that transforms a mono signal through MDCT, a feature parameter extractor that extracts a feature parameter of the mono signal from an MDCT coefficient output from the MDCT transformer, a side signal energy estimator that estimates subband energy of a side signal with reference to information stored in the database based on the feature parameter; an energy controller that obtains the MDCT coefficient of a side signal estimated from the subband energy of the estimated side signal, an inverse MDCT transformer that obtains an estimated side signal by transforming the MDCT coefficient of the estimated side signal through inverse MDCT.

Type: Application

Filed: June 11, 2014

Publication date: March 12, 2015

Inventors: Hong Kook KIM, Nam In Park
Method and device for extending bandwidth of speech signal

Patent number: 8909539

Abstract: A method for extending a bandwidth of a speech signal received, according to an embodiment of the present invention, includes: transforming the received speech signal into a frequency domain by decoding the received speech signal; normalizing the transformed speech signal; differentiating a voiced sound period or unvoiced sound period from the received speech signal; extracting, from the normalized speech signal, a first period including a harmonic component of the voiced sound period on the basis of the voiced sound period; extracting, from the normalized speech signal, a second period on the basis of correlation between the unvoiced sound period and the normalized speech signal; generating a high-band speech signal on the basis of the first period and the second period; and synthesizing the generated high-band speech signal and the transformed speech signal to output a wideband speech signal.

Type: Grant

Filed: December 7, 2012

Date of Patent: December 9, 2014

Assignee: Gwangju Institute of Science and Technology

Inventors: Hong Kook Kim, Nam In Park
FRAME ERASURE CONCEALMENT TECHNIQUE FOR A BITSTREAM-BASED FEATURE EXTRACTOR

Publication number: 20140330564

Abstract: A frame erasure concealment technique for a bitstream-based feature extractors in a speech recognition system particularly suited for use in a wireless communication system operates to “delete” each frame in which an erasure is declared. The deletions thus reduce the length of the observation sequence, but have been found to provide for sufficient speech recognition based on both single word and “string” tests of the deletion technique.

Type: Application

Filed: May 19, 2014

Publication date: November 6, 2014

Applicant: AT&T Intellectual Property II, L.P.

Inventors: Richard Vandervoort Cox, Hong Kook Kim
Apparatus and method for coding signal in a communication system

Patent number: 8751225

Abstract: Provided is an apparatus and method for encoding a voice and audio signal by expanding a modified discrete cosine transform (MDCT) based CODEC to a wideband and a super-wideband in a communication system. The apparatus for encoding a signal in a communication system, includes a converter configured to convert a time domain signal corresponding to a service to be provided to users to a frequency domain signal, a quantization and normalization unit configured to calculate and quantize gain of each subband in the converted frequency domain signal and normalize a frequency coefficient of the each subband, a search unit configured to search patch information of each subband in the converted frequency domain signal using the normalized frequency coefficient, and a packetizer configured to packetize the quantized gain and the searched patch information and encode gain information of each subband in the frequency domain signal.

Type: Grant

Filed: May 12, 2011

Date of Patent: June 10, 2014

Assignee: Electronics and Telecommunications Research Institute

Inventors: Mi-Suk Lee, Hong-Kook Kim, Young-Han Lee
Frame erasure concealment technique for a bitstream-based feature extractor

Patent number: 8731921

Abstract: A frame erasure concealment technique for a bitstream-based feature extractor in a speech recognition system particularly suited for use in a wireless communication system operates to “delete” each frame in which an erasure is declared. The deletions thus reduce the length of the observation sequence, but have been found to provide for sufficient speech recognition based on both single word and “string” tests of the deletion technique.

Type: Grant

Filed: November 30, 2012

Date of Patent: May 20, 2014

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Richard Vandervoort Cox, Hong Kook Kim
METHOD AND DEVICE FOR BANDWIDTH EXTENSION

Publication number: 20130317812

Abstract: Method and device of extending a signal band of a voice or audio signal are provided. The bandwidth extension method includes the steps of: performing a modified discrete cosine transform (MDCT) process on an input signal to generate a first transform signal; generating a second transform signal and a third transform signal on the basis of the first transform signal; generating normalized components and energy components of the first transform signal, the second transform signal, and the third transform signal therefrom; generating an extended normalized component from the normalized components and generating an extended energy component from the energy components; generating an extended transform signal on the basis of the extended normalized component and the extended energy component; and performing an inverse MDCT (IMDCT) process on the extended transform signal.

Type: Application

Filed: February 8, 2012

Publication date: November 28, 2013

Applicant: LG Electronics Inc.

Inventors: Gyu Hyeok Jeong, Young Han Lee, Hye Jeong Jeon, Hong Kook Kim, In Gyu Kang, Lag Young Kim
APPARATUS FOR CORRECTING ERROR IN SPEECH RECOGNITION

Publication number: 20130311182

Abstract: An apparatus for correcting errors in speech recognition is provided. The apparatus includes a feature vector extracting unit extracting feature vectors from a received speech. A speech recognizing unit recognizes the received speech as a word sequence on the basis of the extracted feature vectors. A phoneme weighted finite state transducer (WFST)-based converting unit converts the recognized word sequence recognized by the speech recognizing unit into a phoneme WFST. A speech recognition error correcting unit corrects errors in the converted phoneme WFST. The speech recognition error correcting unit includes a WFST synthesizing unit modeling a phoneme WFST transferred from the phoneme WFST-based converting unit as pronunciation variation on the basis of a Kullback-Leibler (KL) distance matrix.

Type: Application

Filed: May 16, 2013

Publication date: November 21, 2013

Inventors: Hong-Kook KIM, Woo-Kyeong SEONG, Ji-Hun PARK
SPEECH RECEIVING APPARATUS, AND SPEECH RECEIVING METHOD

Publication number: 20130262122

Abstract: Disclosed is a speech receiving apparatus. A low-band PLC module and a synthesis filter reconstructs a low-band speech signal of a lost frame from a previous good frame. A high-band PLC module reconstructs a high-band speech signal of the lost frame from the previous good frame. A transforming part transforms the low-band speech signal into a frequency range. A bandwidth extending part generates at least an extended MDCT coefficient as information for the high-band speech signal from the low-band speech signal transformed by the transforming part. A smoothing part smoothes the extended MDCT coefficient. An inverse transforming part inversely transforms the extended MDCT coefficient smoothed by the smoothing part to a time domain. A synthesizing part synthesizes the low-band speech signal, and the high-band speech signal which is inverse-transformed by the inverse transforming part and reconstructed, to output a wideband speech signal.

Type: Application

Filed: March 27, 2013

Publication date: October 3, 2013

Applicant: GWANGJU INSTITUTE OF SCIENCE AND TECHNOLOGY

Inventors: Hong-Kook KIM, Nam-In PARK
VOICE ANALYSIS APPARATUS, VOICE SYNTHESIS APPARATUS, VOICE ANALYSIS SYNTHESIS SYSTEM

Publication number: 20130262098

Abstract: A speech analysis apparatus is provided. An F0 extraction part extracts a pitch value from speech information. A spectrum extraction part extracts spectrum information from the speech information. An MVF extraction part extract a maximum voiced frequency and allows boundary information for respectively filtering a harmonic component and a non-harmonic component to be obtained. According to the speech analysis apparatus, speech synthesis apparatus, and speech analysis synthesis system of the present invention, speech that is closer to the original voice and is more natural may be synthesized. Also, speech may be represented with less data capacity.

Type: Application

Filed: March 27, 2013

Publication date: October 3, 2013

Applicant: GWANGJU INSTITUTE OF SCIENCE AND TECHNOLOGY

Inventors: Hong-Kook KIM, Kwang-Myung JEON
METHOD AND APPARATUS FOR AUDIO ENCODING FOR NOISE REDUCTION

Publication number: 20130262129

Abstract: A method and apparatus for audio signal encoding for noise reduction are provided. The method includes: receiving an audio signal and performing modified discrete cosine transformation (MDCT) on the audio signal to convert the audio signal into a long block or a short block; reducing noise included in the audio signal in accordance with the long block or the short block; and performing advanced audio coding (AAC) on the long block or the short block in which noise is reduced.

Type: Application

Filed: January 31, 2013

Publication date: October 3, 2013

Applicants: GWANGJU INSTITUTE OF SCIENCE AND TECHNOLOGY, SAMSUNG ELECTRONICS CO., LTD.

Inventors: Myung-kyu Choi, Sang-ryong Kim, Seong-woon Kim, Ung-sik Kim, Kwang-il Hwang, Duk-soo Kim, Hong-kook Kim, Nam-in Park, Kwang-myung Jeon
Method and apparatus for predicting word accuracy in automatic speech recognition systems

Patent number: 8538752

Abstract: The invention comprises a method and apparatus for predicting word accuracy. Specifically, the method comprises obtaining an utterance in speech data where the utterance comprises an actual word string, processing the utterance for generating an interpretation of the actual word string, processing the utterance to identify at least one utterance frame, and predicting a word accuracy associated with the interpretation according to at least one stationary signal-to-noise ratio and at least one non-stationary signal to noise ratio, wherein the at least one stationary signal-to-noise ratio and the at least one non-stationary signal to noise ratio are determined according to a frame energy associated with each of the at least one utterance frame.

Type: Grant

Filed: May 7, 2012

Date of Patent: September 17, 2013

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Mazin Gilbert, Hong Kook Kim
Acoustic model adaptation methods based on pronunciation variability analysis for enhancing the recognition of voice of non-native speaker and apparatus thereof

Patent number: 8515753

Abstract: The example embodiment of the present invention provides an acoustic model adaptation method for enhancing recognition performance for a non-native speaker's speech. In order to adapt acoustic models, first, pronunciation variations are examined by analyzing a non-native speaker's speech. Thereafter, based on variation pronunciation of a non-native speaker's speech, acoustic models are adapted in a state-tying step during a training process of acoustic models. When the present invention for adapting acoustic models and a conventional acoustic model adaptation scheme are combined, more-enhanced recognition performance can be obtained. The example embodiment of the present invention enhances recognition performance for a non-native speaker's speech while reducing the degradation of recognition performance for a native speaker's speech.

Type: Grant

Filed: March 30, 2007

Date of Patent: August 20, 2013

Assignee: Gwangju Institute of Science and Technology

Inventors: Hong Kook Kim, Yoo Rhee Oh, Jae Sam Yoon
APPARATUS AND METHOD FOR ELIMINATING NOISE

Publication number: 20130054234

Abstract: Provided are an apparatus and method for eliminating noise. The method includes: detecting a speech section from a noise speech signal including a noise signal; separating the speech section into a consonant section and a vowel section on the basis of a VOP at the speech section; calculating a transfer function of a filter for eliminating the noise signal to allow the degree of noise elimination to be different in the consonant section and the vowel section; and eliminating the noise signal from the noise speech signal on the basis of the transfer function.

Type: Application

Filed: August 29, 2012

Publication date: February 28, 2013

Applicant: GWANGJU INSTITUTE OF SCIENCE AND TECHNOLOGY

Inventors: Hong Kook KIM, Ji Hun PARK, Woo Kyeong SEONG

prev 1 2 3 4 next