Patents by Inventor Yongwook Nam

Yongwook Nam has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11776528
    Abstract: This application relates to a method of synthesizing a speech of which a speed and a pitch are changed. In one aspect, the method includes a spectrogram may be generated by performing a short-time Fourier transformation on a first speech signal based on a first hop length and a first window length, and speech signals of sections having a second window length at the interval of a second hop length from the spectrogram. A ratio between the first hop length and the second hop length may be set to be equal to the value of a playback rate and a ratio between the first window length and the second window length may be set to be equal to the value of a pitch change rate, thereby generating a second speech signal of which the speed and the pitch are changed.
    Type: Grant
    Filed: July 20, 2021
    Date of Patent: October 3, 2023
    Assignee: Xinapse Co., Ltd.
    Inventors: Jinbeom Kang, Dong Won Joo, Yongwook Nam
  • Publication number: 20230037892
    Abstract: A computer-implemented method of generating speech training data is proposed. The method may include generating, at a processor, a recording script corresponding to particular text. The method may also include generating, at the processor, recorded data by performing recording by a speaker based on the recording script. The method may further include labeling, at the processor, the recorded data. Various embodiments can generate a large amount of speech training data for training an artificial neural network model while minimizing a worker's inconvenience and time consumption.
    Type: Application
    Filed: July 25, 2022
    Publication date: February 9, 2023
    Inventors: Dong Won JOO, Jinbeom KANG, Yongwook NAM, Jung Hoon LEE
  • Publication number: 20230037541
    Abstract: A method of synthesizing speeches by scoring the speeches is proposed. The method may include generating a spectrogram based on utterer information and a text and generating a plurality of sub-speeches corresponding to the spectrogram. The method may also include selecting one of the plurality of sub-speeches and generating a final speech by using the selected sub-speech.
    Type: Application
    Filed: July 19, 2022
    Publication date: February 9, 2023
    Inventors: Dong Won JOO, Jinbeom KANG, Yongwook NAM, Jung Hoon Lee
  • Publication number: 20220165247
    Abstract: This application relates to a speech synthesis system. In one aspect, the system includes an encoder configured to generate a speaker embedding vector corresponding to a verbal speech based on a first speech signal corresponding to a verbal utterance. The system may also include a synthesizer configured to perform at least once the cycle including generating a plurality of spectrograms corresponding to verbal utterance of the sequence of the text, based on the speaker embedding vector and a sequence of a text written in a particular natural language and selecting a first spectrogram from among the spectrograms, to output the first spectrogram. The system may further include a vocoder configured to generate a second speech signal corresponding to the sequence of the text based on the first spectrogram.
    Type: Application
    Filed: July 20, 2021
    Publication date: May 26, 2022
    Inventors: Jinbeom Kang, Dong Won Joo, Yongwook Nam, Seung Jae Lee
  • Publication number: 20220165250
    Abstract: This application relates to a method of synthesizing a speech of which a speed and a pitch are changed. In one aspect, the method includes a spectrogram may be generated by performing a short-time Fourier transformation on a first speech signal based on a first hop length and a first window length, and speech signals of sections having a second window length at the interval of a second hop length from the spectrogram. A ratio between the first hop length and the second hop length may be set to be equal to the value of a playback rate and a ratio between the first window length and the second window length may be set to be equal to the value of a pitch change rate, thereby generating a second speech signal of which the speed and the pitch are changed.
    Type: Application
    Filed: July 20, 2021
    Publication date: May 26, 2022
    Inventors: Jinbeom Kang, Dong Won Joo, Yongwook Nam, Seung Jae Lee