Patents by Inventor Kyuyeon HWANG

Kyuyeon HWANG has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11475898
    Abstract: Systems and processes for operating an intelligent automated assistant are provided. In one example, a method includes receiving mixed speech data representing utterances of a target speaker and utterances of one or more interfering audio sources. The method further includes obtaining a target speaker representation, which represents speech characteristics of the target speaker; and determining, using a learning network, probability distributions of phonetic elements directly from the mixed speech data. The inputs of the learning network include the mixed speech data and the target speaker representation. An output of the learning network includes the probability distributions of phonetic elements. The method further includes generating text corresponding to the utterances of the target speaker based on the probability distributions of the phonetic elements; and providing a response to the target speaker based on the text corresponding to the utterances of the target speaker.
    Type: Grant
    Filed: August 7, 2019
    Date of Patent: October 18, 2022
    Assignee: Apple Inc.
    Inventors: Masood Delfarah, Ossama A. Abdelhamid, Kyuyeon Hwang, Donald R. McAllaster, Sabato Marco Siniscalchi
  • Publication number: 20200135209
    Abstract: Systems and processes for operating an intelligent automated assistant are provided. In one example, a method includes receiving mixed speech data representing utterances of a target speaker and utterances of one or more interfering audio sources. The method further includes obtaining a target speaker representation, which represents speech characteristics of the target speaker; and determining, using a learning network, probability distributions of phonetic elements directly from the mixed speech data. The inputs of the learning network include the mixed speech data and the target speaker representation. An output of the learning network includes the probability distributions of phonetic elements. The method further includes generating text corresponding to the utterances of the target speaker based on the probability distributions of the phonetic elements; and providing a response to the target speaker based on the text corresponding to the utterances of the target speaker.
    Type: Application
    Filed: August 7, 2019
    Publication date: April 30, 2020
    Inventors: Masood DELFARAH, Ossama A. ABDELHAMID, Kyuyeon HWANG, Donald R. MCALLASTER, Sabato Marco SINISCALCHI