Abstract: Provided is a method of enhancing quality of audio data which comprise obtaining a spectrum of mixed audio data including noise, inputting two-dimensional (2D) input data corresponding to the spectrum to a convolutional network including a downsampling process and an upsampling process to obtain output data of the convolutional network, generating a mask for removing noise included in the audio data based on the obtained output data and removing noise from the mixed audio data using the generated mask, wherein, in the convolutional network, the downsampling process and the upsampling process are performed on a first axis of the 2D input data, and remaining processes other than the downsampling process and the upsampling process are performed on the first axis and a second axis.
Type:
Grant
Filed:
November 20, 2020
Date of Patent:
November 28, 2023
Assignees:
DEEPHEARING INC., The Industry & Academic Cooperation in Chungnam National University (IAC)
Abstract: Provided is a method of enhancing quality of audio data which comprise obtaining a spectrum of mixed audio data including noise, inputting two-dimensional (2D) input data corresponding to the spectrum to a convolutional network including a downsampling process and an upsampling process to obtain output data of the convolutional network, generating a mask for removing noise included in the audio data based on the obtained output data and removing noise from the mixed audio data using the generated mask, wherein, in the convolutional network, the downsampling process and the upsampling process are performed on a first axis of the 2D input data, and remaining processes other than the downsampling process and the upsampling process are performed on the first axis and a second axis.
Type:
Application
Filed:
November 20, 2020
Publication date:
August 31, 2023
Applicants:
DEEPHEARING INC., The Industry & Academic Cooperation in Chungnam National University (IAC)