Patents by Inventor Dhananjaya N. GOWDA

Dhananjaya N. GOWDA has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11961522
    Abstract: The disclosure relates to an electronic apparatus for recognizing user voice and a method of recognizing, by the electronic apparatus, the user voice. According to an embodiment, the method of recognizing the user voice includes obtaining an audio signal segmented into a plurality of frame units, determining an energy component for each filter bank by applying a filter bank distributed according to a preset scale to a frequency spectrum of the audio signal segmented into the frame units, smoothing the determined energy component for each filter bank, extracting a feature vector of the audio signal based on the smoothed energy component for each filter bank, and recognizing the user voice in the audio signal by inputting the extracted feature vector to a voice recognition model.
    Type: Grant
    Filed: November 22, 2019
    Date of Patent: April 16, 2024
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Chanwoo Kim, Dhananjaya N. Gowda, Sungsoo Kim, Minkyu Shin, Larry Paul Heck, Abhinav Garg, Kwangyoun Kim, Mehul Kumar
  • Patent number: 11532310
    Abstract: Provided is a system and method for recognizing a user's speech. A method, performed by a server, of providing a text string for a speech signal input to a device includes: receiving, from the device, an encoder output value derived from an encoder of an end-to-end automatic speech recognition (ASR) model included in the device; identifying a domain corresponding to the received encoder output value; selecting a decoder corresponding to the identified domain from among a plurality of decoders of an end-to-end ASR model included in the server; obtaining a text string from the received encoder output value using the selected decoder; and providing the obtained text string to the device.
    Type: Grant
    Filed: August 10, 2020
    Date of Patent: December 20, 2022
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Chanwoo Kim, Dhananjaya N. Gowda, Kwangyoun Kim, Kyungmin Lee
  • Patent number: 11521619
    Abstract: Provided are a system and method for modifying a speech recognition result. The method includes: receiving, from a device, text output from an automatic speech recognition (ASR) model of the device; identifying at least one domain related to the received text; selecting, from among a plurality of text modification models included in the server, at least one text modification model corresponding to the identified at least one domain; and modifying the received text by using the selected at least one text modification model.
    Type: Grant
    Filed: August 11, 2020
    Date of Patent: December 6, 2022
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Chanwoo Kim, Dhananjaya N. Gowda, Abhinav Garg, Kyungmin Lee
  • Patent number: 11514916
    Abstract: A server for supporting speech recognition of a device and an operation method of the server. The server and method identify a plurality of estimated character strings from the first character string and obtain a second character string, based on the plurality of estimated character strings, and transmit the second character string to the device. The first character string is output from a speech signal input to the device, via speech recognition.
    Type: Grant
    Filed: August 13, 2020
    Date of Patent: November 29, 2022
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Chanwoo Kim, Sichen Jin, Kyungmin Lee, Dhananjaya N. Gowda, Kwangyoun Kim
  • Patent number: 11475896
    Abstract: Provided is a system and method for recognizing a user's speech. A method, performed by a server, of providing a text string for a speech signal input to a device includes: receiving, from the device, an encoder output value derived from an encoder of an end-to-end automatic speech recognition (ASR) model included in the device; identifying a domain corresponding to the received encoder output value; selecting a decoder corresponding to the identified domain from among a plurality of decoders of an end-to-end ASR model included in the server; obtaining a text string from the received encoder output value using the selected decoder; and providing the obtained text string to the device.
    Type: Grant
    Filed: August 10, 2020
    Date of Patent: October 18, 2022
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Chanwoo Kim, Dhananjaya N. Gowda, Kwangyoun Kim, Kyungmin Lee
  • Patent number: 11302331
    Abstract: Provided are an electronic device for recognizing speech of a user, and a method, performed by the electronic device, of recognizing speech. The method includes obtaining an audio signal based on a speech input based on the audio signal being input, obtaining an output value of a first automatic speech recognition (ASR) model that outputs a character string at a first level; obtaining an output value of a second ASR model that outputs a character string at a second level corresponding to the audio signal based on the output value of the first ASR model based on the audio signal being input; and recognizing the speech from the output value of the second ASR model.
    Type: Grant
    Filed: January 23, 2020
    Date of Patent: April 12, 2022
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Dhananjaya N. Gowda, Kwangyoun Kim, Abhinav Garg, Chanwoo Kim
  • Publication number: 20220005481
    Abstract: The disclosure relates to an electronic apparatus for recognizing user voice and a method of recognizing, by the electronic apparatus, the user voice. According to an embodiment, the method of recognizing the user voice includes obtaining an audio signal segmented into a plurality of frame units, determining an energy component for each filter bank by applying a filter bank distributed according to a preset scale to a frequency spectrum of the audio signal segmented into the frame units, smoothing the determined energy component for each filter bank, extracting a feature vector of the audio signal based on the smoothed energy component for each filter bank, and recognizing the user voice in the audio signal by inputting the extracted feature vector to a voice recognition model.
    Type: Application
    Filed: November 22, 2019
    Publication date: January 6, 2022
    Inventors: Chanwoo KIM, Dhananjaya N. GOWDA, Sungsoo KIM, Minkyu SHIN, Larry Paul HECK, Abhinav GARG, Kwangyoun KIM, Mehul KUMAR
  • Publication number: 20210050017
    Abstract: Provided are a system and method for modifying a speech recognition result. The method includes: receiving, from a device, text output from an automatic speech recognition (ASR) model of the device; identifying at least one domain related to the received text; selecting, from among a plurality of text modification models included in the server, at least one text modification model corresponding to the identified at least one domain; and modifying the received text by using the selected at least one text modification model.
    Type: Application
    Filed: August 11, 2020
    Publication date: February 18, 2021
    Applicant: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Chanwoo KIM, Dhananjaya N. GOWDA, Abhinav GARG, Kyungmin LEE
  • Publication number: 20210050018
    Abstract: A server for supporting speech recognition of a device and an operation method of the server. The server and method identify a plurality of estimated character strings from the first character string and obtain a second character string, based on the plurality of estimated character strings, and transmit the second character string to the device. The first character string is output from a speech signal input to the device, via speech recognition.
    Type: Application
    Filed: August 13, 2020
    Publication date: February 18, 2021
    Applicant: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Chanwoo KIM, Sichen JIN, Kyungmin LEE, Dhananjaya N. GOWDA, Kwangyoun KIM
  • Publication number: 20210050016
    Abstract: Provided is a system and method for recognizing a user's speech. A method, performed by a server, of providing a text string for a speech signal input to a device includes: receiving, from the device, an encoder output value derived from an encoder of an end-to-end automatic speech recognition (ASR) model included in the device; identifying a domain corresponding to the received encoder output value; selecting a decoder corresponding to the identified domain from among a plurality of decoders of an end-to-end ASR model included in the server; obtaining a text string from the received encoder output value using the selected decoder; and providing the obtained text string to the device.
    Type: Application
    Filed: August 10, 2020
    Publication date: February 18, 2021
    Inventors: Chanwoo KIM, Dhananjaya N. GOWDA, Kwangyoun KIM, Kyungmin LEE
  • Publication number: 20200234713
    Abstract: Provided are an electronic device for recognizing speech of a user, and a method, performed by the electronic device, of recognizing speech. The method includes obtaining an audio signal based on a speech input based on the audio signal being input, obtaining an output value of a first automatic speech recognition (ASR) model that outputs a character string at a first level; obtaining an output value of a second ASR model that outputs a character string at a second level corresponding to the audio signal based on the output value of the first ASR model based on the audio signal being input; and recognizing the speech from the output value of the second ASR model.
    Type: Application
    Filed: January 23, 2020
    Publication date: July 23, 2020
    Inventors: Dhananjaya N. GOWDA, Kwangyoun KIM, Abhinav GARG, Chanwoo KIM