Patents by Inventor Ashwin P. Rao

Ashwin P. Rao has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Multiple-Sensory User Interface for Computing

Publication number: 20190384489

Abstract: The disclosure describes a generalized system and method to incorporate speak-n-touch UI and multi-sense UI into applications. Several examples are considered. Additionally, several new user experiences across a variety of applications are also presented. Our approach has potential to shift the interface paradigm and propel a new wave of artificial intelligence based general purpose computing.

Type: Application

Filed: April 18, 2019

Publication date: December 19, 2019

Inventor: Ashwin P Rao
USER INTERFACES FOR KEYBOARDS

Publication number: 20190079668

Abstract: The disclosure describes keyboard user-interfaces UIs. A system including a modified keyboard having fewer keys than a standard qwerty keyboard is disclosed. The modified keyboard appears on a display of the system. The system also including a keyboard UI module for processing inputs from the modified keyboard, wherein the keyboard UI module implements directional double taps for specified symbols, swipe gestures for changing modes that are displayed as the modified keyboard and for incorporating special keys of the modified keyboard, and status indicator changes using color coding schemes on the display. The keyboard UI module for implements wavetags for blind input of symbols and shortcuts on the modified keyboard.

Type: Application

Filed: June 29, 2018

Publication date: March 14, 2019

Inventor: Ashwin P Rao
METHOD FOR MULTI-SENSE FUSION USING SYNCHRONY

Publication number: 20180336191

Abstract: The disclosure describes an overall system and method for designing and building multi-sense systems using a generalized synchrony based fusion technique. By treating “time” as a common thread that runs across all the modes, the invention uses trigger inputs to form several groups of inputs across all modes. The best group is selected as the one that has the maximize synchrony as determined by a combination of a weight for the group and a timing correlation of inputs within that group. The features from each mode within the best group are then used individually or jointly for pattern recognition or understanding. The result is a robust practically implementable generalized theory of multi-sense fusion. The proposed invention has immense potential to leap frog single input systems and revolutionize human-machine interactions.

Type: Application

Filed: May 17, 2018

Publication date: November 22, 2018

Inventor: Ashwin P. Rao
System and method for multimodal utterance detection

Patent number: 9922640

Abstract: The disclosure describe a system and method for detecting one or more segments of desired speech utterances from an audio stream using timings of events from other modes that are correlated to the timings of the desired segments of speech. The redundant information from other modes results in a highly accurate and robust utterance detection.

Type: Grant

Filed: February 3, 2014

Date of Patent: March 20, 2018

Inventor: Ashwin P Rao
Speak and touch auto correction interface

Patent number: 9830912

Abstract: The disclosure describes an overall system/method for developing a “speak and touch auto correction interface” referred to as STACI which is far more superior to existing user interfaces including the widely adopted qwerty. Using STACI a user speaks and types a word at the same time. The redundant information from the two modes, namely speech and the letters typed, enables the user to sloppily and partially type the words. The result is a very fast and accurate enhanced keyboard interface enabling document production on computing devices like phones and tablets.

Type: Grant

Filed: March 15, 2013

Date of Patent: November 28, 2017

Inventor: Ashwin P Rao
System and Method for Multimodal Utterance Detection

Publication number: 20140222430

Abstract: The disclosure describe a system and method for detecting one or more segments of desired speech utterances from an audio stream using timings of events from other modes that are correlated to the timings of the desired segments of speech. The redundant information from other modes results in a highly accurate and robust utterance detection.

Type: Application

Filed: February 3, 2014

Publication date: August 7, 2014

Inventor: Ashwin P. Rao
Detecting segments of speech from an audio stream

Patent number: 8645131

Abstract: The disclosure describes a speech detection system for detecting one or more desired speech segments in an audio stream. The speech detection system includes an audio stream input and a speech detection technique. The speech detection technique may be performed in various ways, such as using pattern matching and/or signal processing. The pattern matching implementation may extract features representing types of sounds as in phrases, words, syllables, phonemes and so on. The signal processing implementation may extract spectrally-localized frequency-based features, amplitude-based features, and combinations of the frequency-based and amplitude-based features. Metrics may be obtained and used to determine a desired word in the audio stream. In addition, a keypad stream having keypad entries may be used in determining the desired word.

Type: Grant

Filed: October 16, 2009

Date of Patent: February 4, 2014

Inventors: Ashwin P. Rao, Gregory M. Aronov, Marat V. Garafutdinov
SPEAK AND TOUCH AUTO CORRECTION INTERFACE

Publication number: 20130289993

Abstract: The disclosure describes an overall system/method for developing a “speak and touch auto correction interface” referred to as STACI which is far more superior to existing user interfaces including the widely adopted qwerty. Using STACI a user speaks and types a word at the same time. The redundant information from the two modes, namely speech and the letters typed, enables the user to sloppily and partially type the words. The result is a very fast and accurate enhanced keyboard interface enabling document production on computing devices like phones and tablets.

Type: Application

Filed: March 15, 2013

Publication date: October 31, 2013

Inventor: Ashwin P. Rao
Multimodal interface for input of text

Patent number: 8571862

Abstract: The disclosure describes an overall system/method for text-input using a multimodal interface with a combination of speech recognition and text prediction. Specifically, an “always listening” mode for entering words is combined with a push-to-speak mode for entering symbols and phrases. In addition, these two modes are further combined with keypad based text prediction. Finally, the overall user-interface of the proposed system is designed such that it enhances existing standard text-input methods; thereby minimizing the behavior change for mobile users.

Type: Grant

Filed: October 13, 2009

Date of Patent: October 29, 2013

Inventors: Ashwin P. Rao, Gregory M. Aronov, Marat V. Garafutdinov
Multimodal speech recognition system

Patent number: 8355915

Abstract: The disclosure describes an overall system/method for text-input using a multimodal interface with speech recognition. Specifically, pluralities of modes interact with the main speech mode to provide the speech-recognition system with partial knowledge of the text corresponding to the spoken utterance forming the input to the speech recognition system. The knowledge from other modes is used to dynamically change the ASR system's active vocabulary thereby significantly increasing recognition accuracy and significantly reducing processing requirements. Additionally, the speech recognition system is configured using three different system configurations (always listening, partially listening, and push-to-speak) and for each one of those three different user-interfaces are proposed (speak-and-type, type-and-speak, and speak-while-typing).

Type: Grant

Filed: November 30, 2007

Date of Patent: January 15, 2013

Inventor: Ashwin P. Rao
Predictive speech-to-text input

Patent number: 7904298

Abstract: This disclosure describes a practical system/method for predicting spoken text (a spoken word or a spoken sentence/phrase) given that text's partial spelling (example, initial characters forming the spelling of a word/sentence). The partial spelling may be given using “Speech” or may be inputted using the keyboard/keypad or may be obtained using other input methods. The disclosed system is an alternative method for inputting text into devices; the method is faster (especially for long words or phrases) compared to existing predictive-text-input and/or word-completion methods.

Type: Grant

Filed: November 16, 2007

Date of Patent: March 8, 2011

Inventor: Ashwin P. Rao
Detecting Segments of Speech from an Audio Stream

Publication number: 20100100382

Abstract: The disclosure describes a speech detection system for detecting one or more desired speech segments in an audio stream. The speech detection system includes an audio stream input and a speech detection technique. The speech detection technique may be performed in various ways, such as using pattern matching and/or signal processing. The pattern matching implementation may extract features representing types of sounds as in phrases, words, syllables, phonemes and so on. The signal processing implementation may extract spectrally-localized frequency-based features, amplitude-based features, and combinations of the frequency-based and amplitude-based features. Metrics may be obtained and used to determine a desired word in the audio stream. In addition, a keypad stream having keypad entries may be used in determining the desired word.

Type: Application

Filed: October 16, 2009

Publication date: April 22, 2010

Inventors: Ashwin P Rao, Gregory M. Aronov, Marat V. Garafutdinov
MULTIMODAL INTERFACE FOR INPUT OF TEXT

Publication number: 20100031143

Abstract: The disclosure describes an overall system/method for text-input using a multimodal interface with a combination of speech recognition and text prediction. Specifically, an “always listening” mode for entering words is combined with a push-to-speak mode for entering symbols and phrases. In addition, these two modes are further combined with keypad based text prediction. Finally, the overall user-interface of the proposed system is designed such that it enhances existing standard text-input methods; thereby minimizing the behavior change for mobile users.

Type: Application

Filed: October 13, 2009

Publication date: February 4, 2010

Inventors: Ashwin P. Rao, Gregory M. Aronov, Marat V. Garafutdinov
Multimodal speech recognition system

Publication number: 20080133228

Abstract: The disclosure describes an overall system/method for text-input using a multimodal interface with speech recognition. Specifically, pluralities of modes interact with the main speech mode to provide the speech-recognition system with partial knowledge of the text corresponding to the spoken utterance forming the input to the speech recognition system. The knowledge from other modes is used to dynamically change the ASR system's active vocabulary thereby significantly increasing recognition accuracy and significantly reducing processing requirements. Additionally, the speech recognition system is configured using three different system configurations (always listening, partially listening, and push-to-speak) and for each one of those three different user-interfaces are proposed (speak-and-type, type-and-speak, and speak-while-typing).

Type: Application

Filed: November 30, 2007

Publication date: June 5, 2008

Inventor: Ashwin P. Rao
PREDICTIVE SPEECH-TO-TEXT INPUT

Publication number: 20080120102

Abstract: This disclosure describes a practical system/method for predicting spoken text (a spoken word or a spoken sentence/phrase) given that text's partial spelling (example, initial characters forming the spelling of a word/sentence). The partial spelling may be given using “Speech” or may be inputted using the keyboard/keypad or may be obtained using other input methods. The disclosed system is an alternative method for inputting text into devices; the method is faster (especially for long words or phrases) compared to existing predictive-text-input and/or word-completion methods.

Type: Application

Filed: November 16, 2007

Publication date: May 22, 2008

Inventor: Ashwin P. Rao