Patents by Inventor KAPIL DHAWAN

KAPIL DHAWAN has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Real-time speech-to-speech generation (RSSG) apparatus, method and a system therefore

Patent number: 11361780

Abstract: Information loss in speech to text conversion and Inability to preserve vocal emotion information without changing the artificial intelligence model infrastructure in a conventional speech to speech translation system are essential drawback of the conventional techniques. Embodiments of the invention provide direct speech to speech translation system is disclosed. Direct speech to speech translation system uses a one-tier approach, creating a unified-model for whole application. The single-model ecosystem takes in audio (mel spectrogram) as an input and gives out audio (mel spectrogram) as an output. This solves the bottleneck problem by not converting speech directly to text but having text as a byproduct of speech to speech translation, preserving phonetic information along the way. This model also uses pre-processing and post-processing scripts but only for the whole model. This model needs parallel audio samples in two languages.

Type: Grant

Filed: December 24, 2021

Date of Patent: June 14, 2022

Inventors: Sandeep Dhawan, Kapil Dhawan, Dennis Reutter, Chris Beckman, Ahsan Memon
REAL-TIME SPEECH-TO-SPEECH GENERATION (RSSG) APPARATUS, METHOD AND A SYSTEM THEREFORE

Publication number: 20220115028

Abstract: Information loss in speech to text conversion and Inability to preserve vocal emotion information without changing the artificial intelligence model infrastructure in a conventional speech to speech translation system are essential drawback of the conventional techniques. Embodiments of the invention provide direct speech to speech translation system is disclosed. Direct speech to speech translation system uses a one-tier approach, creating a unified-model for whole application. The single-model ecosystem takes in audio (mel spectrogram) as an input and gives out audio (mel spectrogram) as an output. This solves the bottleneck problem by not converting speech directly to text but having text as a byproduct of speech to speech translation, preserving phonetic information along the way. This model also uses pre-processing and post-processing scripts but only for the whole model. This model needs parallel audio samples in two languages.

Type: Application

Filed: December 24, 2021

Publication date: April 14, 2022

Inventors: Sandeep Dhawan, Kapil Dhawan, Dennis Reutter, Chris Beckman, Ahsan Memon
SYSTEM AND METHOD FOR SLOW MOTION DISPLAY, ANALYSIS AND/OR EDITING OF AUDIOVISUAL CONTENT ON A MOBILE DEVICE

Publication number: 20140193140

Abstract: A method for slow motion display of audiovisual content on a mobile device comprises storing a plurality of videos in a memory; providing a first video window configured to include a start/pause control, frame control and video display area for display of a first video; and determining, by a display orientation sensor, whether the touchscreen is in portrait or landscape orientation. If in portrait orientation, the first video window occupies substantially the entire viewing area. If in landscape orientation, the first video window occupies a first portion of the viewing area and an analysis window occupies a second portion of the viewing area. The analysis window includes either a menu displaying a list of videos for selection as a second video, or if a second video has been selected, a second video window including independent start/pause control, frame control, and video display area for independent display of the second video.

Type: Application

Filed: August 12, 2013

Publication date: July 10, 2014

Inventors: SANDY FLIDERMAN, KAPIL DHAWAN

Real-time speech-to-speech generation (RSSG) apparatus, method and a system therefore

REAL-TIME SPEECH-TO-SPEECH GENERATION (RSSG) APPARATUS, METHOD AND A SYSTEM THEREFORE

SYSTEM AND METHOD FOR SLOW MOTION DISPLAY, ANALYSIS AND/OR EDITING OF AUDIOVISUAL CONTENT ON A MOBILE DEVICE