Patents by Inventor KAPIL DHAWAN

KAPIL DHAWAN has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11361780
    Abstract: Information loss in speech to text conversion and Inability to preserve vocal emotion information without changing the artificial intelligence model infrastructure in a conventional speech to speech translation system are essential drawback of the conventional techniques. Embodiments of the invention provide direct speech to speech translation system is disclosed. Direct speech to speech translation system uses a one-tier approach, creating a unified-model for whole application. The single-model ecosystem takes in audio (mel spectrogram) as an input and gives out audio (mel spectrogram) as an output. This solves the bottleneck problem by not converting speech directly to text but having text as a byproduct of speech to speech translation, preserving phonetic information along the way. This model also uses pre-processing and post-processing scripts but only for the whole model. This model needs parallel audio samples in two languages.
    Type: Grant
    Filed: December 24, 2021
    Date of Patent: June 14, 2022
    Inventors: Sandeep Dhawan, Kapil Dhawan, Dennis Reutter, Chris Beckman, Ahsan Memon
  • Publication number: 20220115028
    Abstract: Information loss in speech to text conversion and Inability to preserve vocal emotion information without changing the artificial intelligence model infrastructure in a conventional speech to speech translation system are essential drawback of the conventional techniques. Embodiments of the invention provide direct speech to speech translation system is disclosed. Direct speech to speech translation system uses a one-tier approach, creating a unified-model for whole application. The single-model ecosystem takes in audio (mel spectrogram) as an input and gives out audio (mel spectrogram) as an output. This solves the bottleneck problem by not converting speech directly to text but having text as a byproduct of speech to speech translation, preserving phonetic information along the way. This model also uses pre-processing and post-processing scripts but only for the whole model. This model needs parallel audio samples in two languages.
    Type: Application
    Filed: December 24, 2021
    Publication date: April 14, 2022
    Inventors: Sandeep Dhawan, Kapil Dhawan, Dennis Reutter, Chris Beckman, Ahsan Memon
  • Publication number: 20140193140
    Abstract: A method for slow motion display of audiovisual content on a mobile device comprises storing a plurality of videos in a memory; providing a first video window configured to include a start/pause control, frame control and video display area for display of a first video; and determining, by a display orientation sensor, whether the touchscreen is in portrait or landscape orientation. If in portrait orientation, the first video window occupies substantially the entire viewing area. If in landscape orientation, the first video window occupies a first portion of the viewing area and an analysis window occupies a second portion of the viewing area. The analysis window includes either a menu displaying a list of videos for selection as a second video, or if a second video has been selected, a second video window including independent start/pause control, frame control, and video display area for independent display of the second video.
    Type: Application
    Filed: August 12, 2013
    Publication date: July 10, 2014
    Inventors: SANDY FLIDERMAN, KAPIL DHAWAN