Patents by Inventor Daniel Roth
Daniel Roth has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20050159948Abstract: The invention relates to the combination of speech recognition with handwriting and/or character recognition. This includes the innovation of selecting one or more best-scoring recognition candidates as a function of recognition of both handwritten and spoken representations of a sequence of one or more words to be recognized. It also includes the innovation of using character or handwriting recognition of one or more letters to alphabetically filter speech recognition of one or more words. It also includes the innovations of using speech recognition of one or more letter-identifying words to alphabetically filter handwriting recognition, and of using speech recognition to correct handwriting recognition of one or more words.Type: ApplicationFiled: December 5, 2004Publication date: July 21, 2005Applicant: Voice Signal Technologies, Inc.Inventors: Daniel Roth, Edward Porter
-
Publication number: 20050149327Abstract: A method of constructing a text message on a mobile communications device, the method involving: storing a plurality of text phrases; for each of the text phrases, storing a representation that is derived from that text phrase; receiving a spoken phrase from a user; from the received spoken phrase generating an acoustic representation thereof; based on the acoustic representation, searching among the stored representations to identify a stored text phrase that best matches the spoken phrase; and inserting into an electronic document the text phrase that is identified from searching.Type: ApplicationFiled: September 7, 2004Publication date: July 7, 2005Applicant: Voice Signal Technologies, Inc.Inventors: Daniel Roth, Jordan Cohen
-
Publication number: 20050143970Abstract: A method of generating an alternative pronunciation for a word or phrase, given an initial pronunciation and a spoken example of the word or phrase, includes providing the initial pronunciation of the word or phrase, and generating the alternative pronunciation by searching a neighborhood of pronunciations about the initial pronunciation via a constrained hypothesis, wherein the neighborhood includes pronunciations that differ from the initial pronunciation by at most one phoneme. The method further includes selecting a highest scoring pronunciation within the neighborhood of pronunciations.Type: ApplicationFiled: September 13, 2004Publication date: June 30, 2005Inventors: Daniel Roth, Laurence Gillick, Michael Shire
-
Publication number: 20050137878Abstract: A method of operating a device that includes speech recognition capabilities includes implementing on a device a plurality of user interfaces, wherein at least one said user interfaces is a voice interface. The method also includes launching a first application, and as part of launching the first application, launching a second application, the second application optionally presenting to a user at least one query using the voice interface and populating an address field in the first application in response to the query using the speech recognition capabilities. The second application is launched either simultaneously or subsequent to the launching of the first application. Populating the address field comprises accessing address information from a plurality of databases resident in the device.Type: ApplicationFiled: September 10, 2004Publication date: June 23, 2005Applicant: Voice Signal Technologies, Inc.Inventors: Daniel Roth, Laurence Gillick, Jordan Cohen, William Barton
-
Publication number: 20050131685Abstract: A method including: providing a mobile device (e.g. cellular phone) with a core engine for performing speech recognition; providing a plurality of sets of language-specific modules, each set of the plurality of sets for enabling the core engine to recognize a different language; selecting one set of language-specific modules among the plurality of sets of language-specific modules; and loading into memory within the mobile communication device the selected set of language-specific modules so as to enable the mobile communication device to recognize speech spoken in the language of the selected set.Type: ApplicationFiled: November 15, 2004Publication date: June 16, 2005Applicant: Voice Signal Technologies, Inc.Inventors: Daniel Roth, Jordan Cohen, William Barton
-
Publication number: 20050049880Abstract: The present invention relates to speech recognition using selectable recognition modes. This includes innovations such as: large vocabulary speech recognition programming that supplies recognized words to external program as they are recognized, and allows a user to select between large vocabulary recognition of an utterance with and without language context from the prior utterance independently of state of the external program; allowing a user to select between continuous and discrete speech recognition that use substantially the same vocabulary; allowing a user to select between continuous and discrete large-vocabulary speech recognition modes; allowing a user to select between at least two different alphabetic entry speech recognition modes; and allowing a user to select from among four or more of the following recognitions modes when creating text: a large-vocabulary mode, an alphabetic entry mode, a number entry mode, and a punctuation entry mode.Type: ApplicationFiled: September 24, 2004Publication date: March 3, 2005Applicant: Voice Signal Technologies, Inc.Inventors: Daniel Roth, Jordan Cohen, David Johnston, Manfred Grabherr
-
Publication number: 20050043954Abstract: Large vocabulary speech recognition can automatically turn recognition off in one or more ways. A user command can turn on recognition that is automatically turned off after the next end of utterance. A plurality of buttons can each be associated with a different speech mode and the touch of a given button can turn on, and then automatically turn off, the given button's associated speech recognition mode. These selectable modes can include large vocabulary and alphabetic entry modes, or continuous and discrete modes. A first user input can start recognition that allows a sequence of vocabulary words to be recognized and a second user input can start recognition that turns off after one word has been recognized. A first user input can start recognition that allows a sequence of utterances to be recognized and a second user input can start recognition that allows only a single utterance to be recognized.Type: ApplicationFiled: September 24, 2004Publication date: February 24, 2005Applicant: Voice Signal Technologies, Inc.Inventors: Daniel Roth, Jordan Cohen, David Johnston
-
Publication number: 20050043949Abstract: One aspect of the invention involves word recognition that uses scrollable choice lists in which choices are listed in character-order. Another aspect relates to a scrollable, visually-displayed word recognition choice list, where the recognition candidates on the choice list are each associated with a choice-selecting symbol the user can use to select a desired recognition candidate by pressing an associated button, and where the same choice-selecting symbol is used for different choices displayed on the display at different times as a result of scrolling. Another aspect of the invention relates to providing a choice list of best scoring characters for a particular character position in the spelling of a filter that is used to filter word recognition. Another aspect of the invention relates to a choice list used in word recognition in which the choice list can be scrolled horizontally.Type: ApplicationFiled: September 24, 2004Publication date: February 24, 2005Applicant: Voice Signal Technologies, Inc.Inventors: Daniel Roth, Jordan Cohen, David Johnston, Edward Porter
-
Publication number: 20050043947Abstract: Alphabetic filtering of the speech recognition of words uses a key press to indicate a desired character in an alphabetic filter string, where each key press represents two or more letters. The key presses can be disambiguated by recognizing a key-disambiguation utterance in association with a given key press. A user can select a desired recognition candidate from a choice list produced by such filtered word recognition. Ambiguous alphabetic filtering can be performed iteratively in response to the addition of successive ambiguous key presses. A user can select to re-recognize the utterance using filtering based on ambiguous key input after seeing the results of recognition without such filtering. Unambiguous alphabetic filtering can be performed by using multiple presses of an ambiguous key to disambiguate which letter is intended. A user can select between entering text by either large vocabulary speech recognition or by spelling text by pressing phone keys.Type: ApplicationFiled: September 24, 2004Publication date: February 24, 2005Applicant: Voice Signal Technologies, Inc.Inventors: Daniel Roth, Jordan Cohen, David Johnston
-
Publication number: 20050038657Abstract: Text-to-speech (TTS) generation is used in conjunction with large vocabulary speech recognition to say words selected by the speech recognition. The software for performing the large vocabulary speech recognition can share speech modeling data with the TTS software. TTS or recorded audio can be used to automatically say both recognized text and the names of recognized commands after their recognition. The TTS can automatically repeats text recognized by the speech recognition after each of a succession of end of utterance detections. A user can move a cursor back or forward in recognized text, and the TTS can speak one or more words at the cursor location after each such move. The speech recognition can be used to produces a choice list of possible recognition candidates and the TTS can be used to provide spoken output of one or more of the candidates on the choice list.Type: ApplicationFiled: September 24, 2004Publication date: February 17, 2005Applicant: Voice Signal Technologies, Inc.Inventors: Daniel Roth, Jordan Cohen, David Johnston, Manfred Grabherr, Edward Porter
-
Publication number: 20050038653Abstract: Word recognition enables a user to have a selected transformation performed on a given word produced by word recognition. In one aspect of the invention, a selectable transformation changes the given word to a differently spelled word having the same word root. In another, a selectable transformation changes a given word to one or more of its homonyms. In yet another, a selectable transformation changes the given word between a representation that spells the word with letters and one that does not. In one aspect of the invention a user can select to display a choice list of transformed words corresponding to a given recognized word and then select to have one of the listed transformed words replace the given word. In another aspect of the invention word recognition favors recognition of words corresponding to a user selected part of speech.Type: ApplicationFiled: September 24, 2004Publication date: February 17, 2005Applicant: Voice Signal Technologies, Inc.Inventors: Daniel Roth, Jordan Cohen, David Johnston
-
Patent number: D361281Type: GrantFiled: January 14, 1994Date of Patent: August 15, 1995Assignee: Daniel Roth SAInventor: Daniel Roth