Patents by Inventor Seungdo CHOI

Seungdo CHOI has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11942077
    Abstract: An electronic device for providing a text-to-speech (TTS) service and an operating method therefor are provided. The operating method of the electronic device includes obtaining target voice data based on an utterance input of a specific speaker, determining a number of learning steps of the target voice data, based on data features including a data amount of the target voice data, generating a target model by training a pre-trained model pre-trained to convert text into an audio signal, by using the target voice data as training data, based on the determined number of learning steps, generating output data obtained by converting input text into an audio signal, by using the generated target model, and outputting the generated output data.
    Type: Grant
    Filed: September 21, 2022
    Date of Patent: March 26, 2024
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Kyoungbo Min, Seungdo Choi, Doohwa Hong
  • Patent number: 11830473
    Abstract: A system for synthesising expressive speech includes: an interface configured to receive an input text for conversion to speech; a memory; and at least one processor coupled to the memory. The processor is configured to generate, using an expressivity characterisation module, a plurality of expression vectors, wherein each expression vector is a representation of prosodic information in a reference audio style file, and synthesise expressive speech from the input text, using an expressive acoustic model comprising a deep convolutional neural network that is conditioned by at least one of the plurality of expression vectors.
    Type: Grant
    Filed: September 29, 2020
    Date of Patent: November 28, 2023
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Jesus Monge Alvarez, Holly Francois, Hosang Sung, Seungdo Choi, Kihyun Choo, Sangjun Park
  • Patent number: 11763799
    Abstract: An electronic apparatus and a controlling method thereof are provided. The electronic apparatus includes a microphone; a memory configured to store a text-to-speech (TTS) model and a plurality of evaluation texts; and a processor configured to: obtain a first reference vector of a user speech spoken by a user based the user speech being received through the microphone, generate a plurality of candidate reference vectors based on the first reference vector, obtain a plurality of synthesized sounds by inputting the plurality of candidate reference vectors and the plurality of evaluation texts to the TTS model, identify at least one synthesized sound of the plurality of synthesized sounds based on a similarity between characteristics of the plurality of synthesized sounds and the user speech, and store a second reference vector of the at least one synthesized sound in the memory as a reference vector corresponding to the user for the TTS model.
    Type: Grant
    Filed: December 17, 2021
    Date of Patent: September 19, 2023
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Sangjun Park, Kyoungbo Min, Kihyun Choo, Seungdo Choi
  • Publication number: 20230230569
    Abstract: An electronic apparatus includes an inner microphone provided on a first surface of the electronic apparatus; an outer microphone disposed on a second surface opposite the first surface; and a processor configured to: receive a voice signal of a counterpart and a voice signal of a wearer of the electronic apparatus that are input through the inner microphone and the outer microphone, based on a size of the voice signal of the wearer input through the inner microphone being greater than or equal to a predetermined threshold, remove the voice signal of the wearer input through the outer microphone based on the voice signal of the wearer input through the inner microphone, and amplify the voice signal of the counterpart input through the outer microphone and from which the voice signal of the wearer is removed and output the amplified voice signal, wherein the size of the voice signal of the wearer input through the inner microphone is greater than a size of the voice signal of the wearer input through the outer
    Type: Application
    Filed: November 18, 2022
    Publication date: July 20, 2023
    Applicant: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Seungdo CHOI, Kyoungbo MIN, Sooyeon PARK
  • Publication number: 20230017302
    Abstract: An electronic device for providing a text-to-speech (TTS) service and an operating method therefor are provided. The operating method of the electronic device includes obtaining target voice data based on an utterance input of a specific speaker, determining a number of learning steps of the target voice data, based on data features including a data amount of the target voice data, generating a target model by training a pre-trained model pre-trained to convert text into an audio signal, by using the target voice data as training data, based on the determined number of learning steps, generating output data obtained by converting input text into an audio signal, by using the generated target model, and outputting the generated output data.
    Type: Application
    Filed: September 21, 2022
    Publication date: January 19, 2023
    Inventors: Kyoungbo MIN, Seungdo CHOI, Doohwa HONG
  • Patent number: 11475878
    Abstract: An electronic device for providing a text-to-speech (TTS) service and an operating method therefor are provided. The operating method of the electronic device includes obtaining target voice data based on an utterance input of a specific speaker, determining a number of learning steps of the target voice data, based on data features including a data amount of the target voice data, generating a target model by training a pre-trained model pre-trained to convert text into an audio signal, by using the target voice data as training data, based on the determined number of learning steps, generating output data obtained by converting input text into an audio signal, by using the generated target model, and outputting the generated output data.
    Type: Grant
    Filed: October 27, 2020
    Date of Patent: October 18, 2022
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Kyoungbo Min, Seungdo Choi, Doohwa Hong
  • Patent number: 11404045
    Abstract: A speech synthesis method performed by an electronic apparatus to synthesize speech from text and includes: obtaining text input to the electronic apparatus; obtaining a text representation by encoding the text using a text encoder of the electronic apparatus; obtaining an audio representation of a first audio frame set from an audio encoder of the electronic apparatus, based on the text representation; obtaining an audio representation of a second audio frame set based on the text representation and the audio representation of the first audio frame set; obtaining an audio feature of the second audio frame set by decoding the audio representation of the second audio frame set; and synthesizing speech based on an audio feature of the first audio frame set and the audio feature of the second audio frame set.
    Type: Grant
    Filed: August 31, 2020
    Date of Patent: August 2, 2022
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Seungdo Choi, Kyoungbo Min, Sangjun Park, Kihyun Choo
  • Publication number: 20220148562
    Abstract: An electronic apparatus and a controlling method thereof are provided. The electronic apparatus includes a microphone; a memory configured to store a text-to-speech (TTS) model and a plurality of evaluation texts; and a processor configured to: obtain a first reference vector of a user speech spoken by a user based the user speech being received through the microphone, generate a plurality of candidate reference vectors based on the first reference vector, obtain a plurality of synthesized sounds by inputting the plurality of candidate reference vectors and the plurality of evaluation texts to the TTS model, identify at least one synthesized sound of the plurality of synthesized sounds based on a similarity between characteristics of the plurality of synthesized sounds and the user speech, and store a second reference vector of the at least one synthesized sound in the memory as a reference vector corresponding to the user for the TTS model.
    Type: Application
    Filed: December 17, 2021
    Publication date: May 12, 2022
    Applicant: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Sangjun PARK, Kyoungbo Min, Kihyun Choo, Seungdo Choi
  • Publication number: 20210225358
    Abstract: A system for synthesising expressive speech includes: an interface configured to receive an input text for conversion to speech; a memory; and at least one processor coupled to the memory. The processor is configured to generate, using an expressivity characterisation module, a plurality of expression vectors, wherein each expression vector is a representation of prosodic information in a reference audio style file, and synthesise expressive speech from the input text, using an expressive acoustic model comprising a deep convolutional neural network that is conditioned by at least one of the plurality of expression vectors.
    Type: Application
    Filed: September 29, 2020
    Publication date: July 22, 2021
    Applicant: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Jesus MONGE ALVAREZ, Holly Francois, Hosang Sung, Seungdo Choi, Kihyun Choo, Sangjun Park
  • Publication number: 20210134269
    Abstract: An electronic device for providing a text-to-speech (TTS) service and an operating method therefor are provided. The operating method of the electronic device includes obtaining target voice data based on an utterance input of a specific speaker, determining a number of learning steps of the target voice data, based on data features including a data amount of the target voice data, generating a target model by training a pre-trained model pre-trained to convert text into an audio signal, by using the target voice data as training data, based on the determined number of learning steps, generating output data obtained by converting input text into an audio signal, by using the generated target model, and outputting the generated output data.
    Type: Application
    Filed: October 27, 2020
    Publication date: May 6, 2021
    Inventors: Kyoungbo MIN, Seungdo CHOI, Doohwa HONG
  • Publication number: 20210065678
    Abstract: A speech synthesis method performed by an electronic apparatus to synthesize speech from text and includes: obtaining text input to the electronic apparatus; obtaining a text representation by encoding the text using a text encoder of the electronic apparatus; obtaining an audio representation of a first audio frame set from an audio encoder of the electronic apparatus, based on the text representation; obtaining an audio representation of a second audio frame set based on the text representation and the audio representation of the first audio frame set; obtaining an audio feature of the second audio frame set by decoding the audio representation of the second audio frame set; and synthesizing speech based on an audio feature of the first audio frame set and the audio feature of the second audio frame set.
    Type: Application
    Filed: August 31, 2020
    Publication date: March 4, 2021
    Applicant: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Seungdo CHOI, Kyoungbo MIN, Sangjun PARK, Kihyun CHOO